To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???z???zB 001111110011111100111111011110100011111100111111001111110111101001000010 3f3f3f7a3f3f3f7a42
SJIS-WIN 諸褐┻z諸褐┻zB 100011111001010010001010100011001000010010110011011110101000111110010100100010101000110010000100101100110111101001000010 8f948a8c84b37a8f948a8c84b37a42
EUC-JP 諸褐┻z諸褐┻zB 101111011111010010110011111011001010100010110101011110101011110111110100101100111110110010101000101101010111101001000010 bdf4b3eca8b57abdf4b3eca8b57a42
UTF-8 諸褐┻z諸褐┻zB 111010001010101110111000111010001010010010010000111000101001010010111011011110101110100010101011101110001110100010100100100100001110001010010100101110110111101001000010 e8abb8e8a490e294bb7ae8abb8e8a490e294bb7a42
UHC 諸褐┻z諸褐┻zB 111100001011001111001010111010001010011010110101011110101111000010110011110010101110100010100110101101010111101001000010 f0b3cae8a6b57af0b3cae8a6b57a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)