To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 樗??造?堤? 10010010100101000011111100111111100100011010001000111111100100101110011100111111 92943f3f91a23f92e73f
EUC-JP 樗??造?堤? 11000011111101000011111100111111110000101010010000111111110001001110100100111111 c3f43f3fc2a43fc4e93f
UTF-8 樗곌짊造렊堤렩 111001101010100010010111111010101011001110001100111011001010011110001010111010011000000010100000111010111010000010001010111001011010000010100100111010111010000010101001 e6a897eab38ceca78ae980a0eba08ae5a0a4eba0a9
UHC 樗곌짊造렊堤렩 1110111011000000101100001110101011000001111110111111000011100011100011101010000111110000101001111000111010110111 eec0b0eac1fbf0e38ea1f0a78eb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)