To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???Jz???JzB 0011111100111111001111110100101001111010001111110011111100111111010010100111101001000010 3f3f3f4a7a3f3f3f4a7a42
SJIS-WIN ???Jz???JzB 0011111100111111001111110100101001111010001111110011111100111111010010100111101001000010 3f3f3f4a7a3f3f3f4a7a42
EUC-JP ???Jz???JzB 0011111100111111001111110100101001111010001111110011111100111111010010100111101001000010 3f3f3f4a7a3f3f3f4a7a42
UTF-8 쩌횈체Jz쩌횈체JzB 1110110010101001100011001110110110011010100010001110110010110010101101000100101001111010111011001010100110001100111011011001101010001000111011001011001010110100010010100111101001000010 eca98ced9a88ecb2b44a7aeca98ced9a88ecb2b44a7a42
UHC 쩌횈체Jz쩌횈체JzB 1100001010111100110000111000011011000011101111000100101001111010110000101011110011000011100001101100001110111100010010100111101001000010 c2bcc386c3bc4a7ac2bcc386c3bc4a7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)