To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 繞??????違 11100011100001010011111100111111001111110011111100111111001111111000100011100001 e3853f3f3f3f3f3f88e1
EUC-JP 繞??????違 11100101111001010011111100111111001111110011111100111111001111111011000011100011 e5e53f3f3f3f3f3fb0e3
UTF-8 繞섎맧짯連곕끇違 111001111011100110011110111011001000010010001110111010111010011110100111111011001010011110101111111011111010011010011010111010101011001110010101111010111000000110000111111010011000000110010101 e7b99eec848eeba7a7eca7afefa69aeab395eb8187e98195
UHC 繞섎맧짯連곕끇違 11101001101001001001100011101011100100001011000011000010101011011110011011100110101100001110101110000101101110111110101011011110 e9a498eb90b0c2ade6e6b0eb85bbeade

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)