To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???萸?????懊 001111110011111100111111111001001100111000111111001111110011111100111111001111111001110011100011 3f3f3fe4ce3f3f3f3f3f9ce3
EUC-JP ???萸??洹??懊 0011111100111111001111111110100011010000001111110011111110001111110001111011101000111111001111111101100011100101 3f3f3fe8d03f3f8fc7ba3f3fd8e5
UTF-8 閱묐똻萸쇠린洹욌윣懊 111010011001011010110001111010111010110010010000111010111001100010111011111010001001000010111000111011001000011110100000111010111010011010110000111001101011010010111001111011001001101010001100111011001001110010100011111001101000011110001010 e996b1ebac90eb98bbe890b8ec87a0eba6b0e6b4b9ec9a8cec9ca3e6878a
UHC 閱묐똻萸쇠린洹욌윣懊 1110011011110011100100011110101110001100100000011110101110101101101111001110100010111000101100001110101010110111100111101110101110011111101001001110011111111000 e6f391eb8c81ebadbce8b8b0eab79eeb9fa4e7f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)