To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 šHšB™þ—zšBᾗO 1001101001001000100110100100001010011001111111101001011101111010100110100100001011100001101111101001011101001111 9a489a4299fe977a9a42e1be974f
SJIS-WIN ?H?B???z?B???O 0011111101001000001111110100001000111111001111110011111101111010001111110100001000111111001111110011111101001111 3f483f423f3f3f7a3f423f3f3f4f
EUC-JP ?H?B?þ?z?Bá??O 001111110100100000111111010000100011111110001111101010011101000000111111011110100011111101000010100011111010101110100001001111110011111101001111 3f483f423f8fa9d03f7a3f428faba13f3f4f
UTF-8 šHšB™þ—zšBᾗO 1100001010011010010010001100001010011010010000101100001010011001110000111011111011000010100101110111101011000010100110100100001011000011101000011100001010111110110000101001011101001111 c29a48c29a42c299c3bec2977ac29a42c3a1c2bec2974f
UHC ?H?B?þ?z?B?¾?O 00111111010010000011111101000010001111111010100110101101001111110111101000111111010000100011111110101000111110100011111101001111 3f483f423fa9ad3f7a3f423fa8fa3f4f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)