To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????nB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6e42
SJIS-WIN ?????????????????????nB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6e42
EUC-JP ?????????????????????nB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110111001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6e42
UTF-8 챙철짠챠혳혢챙짼혵챠혳혡챙짹짢챙짠쨍챙짠혲nB 1110110010110001100110011110110010110010101000001110110010100111101000001110110010110001101000001110110110011000101100111110110110011000101000101110110010110001100110011110110010100111101111001110110110011000101101011110110010110001101000001110110110011000101100111110110110011000101000011110110010110001100110011110110010100111101110011110110010100111101000101110110010110001100110011110110010100111101000001110110010101000100011011110110010110001100110011110110010100111101000001110110110011000101100100110111001000010 ecb199ecb2a0eca7a0ecb1a0ed98b3ed98a2ecb199eca7bced98b5ecb1a0ed98b3ed98a1ecb199eca7b9eca7a2ecb199eca7a0eca88decb199eca7a0ed98b26e42
UHC 챙철짠챠혳혢챙짼혵챠혳혡챙짹짢챙짠쨍챙짠혲nB 1100001110101100110000111011011011000010101001111100001110101101110000101001101011000010100010111100001110101100110000101011001011000010100111001100001110101101110000101001101011000010100010101100001110101100110000101011000111000010101010001100001110101100110000101010011111000010101110001100001110101100110000101010011111000010100110010110111001000010 c3acc3b6c2a7c3adc29ac28bc3acc2b2c29cc3adc29ac28ac3acc2b1c2a8c3acc2a7c2b8c3acc2a7c2996e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)