To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????懿リ????竊??怨リ?孃??竊 0011111100111111001111110011111100111111001111111001110011110010100000111000101000111111001111110011111100111111111000101000011000111111001111111000100110000101100000111000101000111111100110110110111100111111001111111110001010000110 3f3f3f3f3f3f9cf2838a3f3f3f3fe2863f3f8985838a3f9b6f3f3fe286
EUC-JP ???嫄??懿リ????竊??怨リ?孃??竊 00111111001111110011111110001111101110101010000100111111001111111101100011110100101001011110101000111111001111110011111100111111111000111110011000111111001111111011000111100101101001011110101000111111110101011101000000111111001111111110001111100110 3f3f3f8fbaa13f3fd8f4a5ea3f3f3f3fe3e63f3fb1e5a5ea3fd5d03f3fe3e6
UTF-8 列룸뱪嫄쀤퐰懿リ괌呂얜벉竊뺝푻怨リ텊孃뉖봾竊 111011111010011010011100111010111010001110111000111010111011000110101010111001011010101110000100111011001000000010100100111011011001000010110000111001101000011110111111111000111000001110101010111010101011010010001100111011111010011010000000111011001001011010011100111010111011001010001001111001111010101110001010111010111011101010011101111011011001000110111011111001101000000010101000111000111000001110101010111011011000010110001010111001011010110110000011111010111000100110010110111010111011010010111110111001111010101110001010 efa69ceba3b8ebb1aae5ab84ec80a4ed90b0e687bfe383aaeab48cefa680ec969cebb289e7ab8aebba9ded91bbe680a8e383aaed858ae5ad83eb8996ebb4bee7ab8a
UHC 列룸뱪嫄쀤퐰懿リ괌呂얜벉竊뺝푻怨リ텊孃뉖봾竊 1110011011101010101101111110101110010011100100001110101010110001100101111110010010111101100110011110101111110011101010111110101010110001101000011110010111111011101111101110101110010011101011001110111110111100100101011110010110111110100001111110101010110011101010111110101010110110100001111110010110111110100001111110101110010100100001011110111110111100 e6eab7eb9390eab197e4bd99ebf3abeab1a1e5fbbeeb93acefbc95e5be87eab3abeab687e5be87eb9485efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)