To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C?????????????? 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????C?????????????? 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?????????C?????????????? 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챙짹혲챙짠쨔챙철혲C챙짹혻챠혱짹챙짠혵챙짹혲챙짠 11101100101100011001100111101100101001111011100111101101100110001011001011101100101100011001100111101100101001111010000011101100101010001001010011101100101100011001100111101100101100101010000011101101100110001011001001000011111011001011000110011001111011001010011110111001111011011001100010111011111011001011000110100000111011011001100010110001111011001010011110111001111011001011000110011001111011001010011110100000111011011001100010110101111011001011000110011001111011001010011110111001111011011001100010110010111011001011000110011001111011001010011110100000 ecb199eca7b9ed98b2ecb199eca7a0eca894ecb199ecb2a0ed98b243ecb199eca7b9ed98bbecb1a0ed98b1eca7b9ecb199eca7a0ed98b5ecb199eca7b9ed98b2ecb199eca7a0
UHC 챙짹혲챙짠쨔챙철혲C챙짹혻챠혱짹챙짠혵챙짹혲챙짠 1100001110101100110000101011000111000010100110011100001110101100110000101010011111000010101110011100001110101100110000111011011011000010100110010100001111000011101011001100001010110001110000101010000011000011101011011100001010011000110000101011000111000011101011001100001010100111110000101001110011000011101011001100001010110001110000101001100111000011101011001100001010100111 c3acc2b1c299c3acc2a7c2b9c3acc3b6c29943c3acc2b1c2a0c3adc298c2b1c3acc2a7c29cc3acc2b1c299c3acc2a7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)