To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????z????zB 0011111100111111001111110011111101111010001111110011111100111111001111110111101001000010 3f3f3f3f7a3f3f3f3f7a42
SJIS-WIN ????z????zB 0011111100111111001111110011111101111010001111110011111100111111001111110111101001000010 3f3f3f3f7a3f3f3f3f7a42
EUC-JP ????z????zB 0011111100111111001111110011111101111010001111110011111100111111001111110111101001000010 3f3f3f3f7a3f3f3f3f7a42
UTF-8 철횚횧챦z철횚횧챦zB 111011001011001010100000111011011001101010011010111011011001101010100111111011001011000110100110011110101110110010110010101000001110110110011010100110101110110110011010101001111110110010110001101001100111101001000010 ecb2a0ed9a9aed9aa7ecb1a67aecb2a0ed9a9aed9aa7ecb1a67a42
UHC 철횚횧챦z철횚횧챦zB 11000011101101101100001110010100110000111001111011000011101011110111101011000011101101101100001110010100110000111001111011000011101011110111101001000010 c3b6c394c39ec3af7ac3b6c394c39ec3af7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)