To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 셔섦셍섰셔롎렽섟셔렣렼섟셍렱렼섹셍섧셍렯렼섬B 11101100100001011001010011101100100001001010011011101100100001011000110111101100100001001011000011101100100001011001010011101011101000011000111011101011101000001011110111101100100001001001111111101100100001011001010011101011101000001010001111101011101000001011110011101100100001001001111111101100100001011000110111101011101000001011000111101011101000001011110011101100100001001011100111101100100001011000110111101100100001001010011111101100100001011000110111101011101000001010111111101011101000001011110011101100100001001010110001000010 ec8594ec84a6ec858dec84b0ec8594eba18eeba0bdec849fec8594eba0a3eba0bcec849fec858deba0b1eba0bcec84b9ec858dec84a7ec858deba0afeba0bcec84ac42
UHC 셔섦셍섰셔롎렽섟셔렣렼섟셍렱렼섹셍섧셍렯렼섬B 101111001100010110111100101101001011110011000100101111001011100110111100110001011000111011010100100011101100010110111100101100001011110011000101100011101011010010001110110001001011110010110000101111001100010010001110101111101000111011000100101111001011110110111100110001001011110010110101101111001100010010001110101111001000111011000100101111001011011001000010 bcc5bcb4bcc4bcb9bcc58ed48ec5bcb0bcc58eb48ec4bcb0bcc48ebe8ec4bcbdbcc4bcb5bcc48ebc8ec4bcb642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)