To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
EUC-JP ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
UTF-8 셔섀셍섟셍셉셔렯렼롙롚셔섀셍섟셍셉셔렯렼롙롘^ 11101100100001011001010011101100100001001000000011101100100001011000110111101100100001001001111111101100100001011000110111101100100001011000100111101100100001011001010011101011101000001010111111101011101000001011110011101011101000011001100111101011101000011001101011101100100001011001010011101100100001001000000011101100100001011000110111101100100001001001111111101100100001011000110111101100100001011000100111101100100001011001010011101011101000001010111111101011101000001011110011101011101000011001100111101011101000011001100001011110 ec8594ec8480ec858dec849fec858dec8589ec8594eba0afeba0bceba199eba19aec8594ec8480ec858dec849fec858dec8589ec8594eba0afeba0bceba199eba1985e
UHC 셔섀셍섟셍셉셔렯렼롙롚셔섀셍섟셍셉셔렯렼롙롘^ 101111001100010110111100101010001011110011000100101111001011000010111100110001001011110011000001101111001100010110001110101111001000111011000100100011101101110110001110110111101011110011000101101111001010100010111100110001001011110010110000101111001100010010111100110000011011110011000101100011101011110010001110110001001000111011011101100011101101110001011110 bcc5bca8bcc4bcb0bcc4bcc1bcc58ebc8ec48edd8edebcc5bca8bcc4bcb0bcc4bcc1bcc58ebc8ec48edd8edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)