To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????IG??????IR 00111111001111110011111100111111001111110011111101001001010001110011111100111111001111110011111100111111001111110100100101010010 3f3f3f3f3f3f49473f3f3f3f3f3f4952
SJIS-WIN 褶カ誾滂スェIG褶カ誾滂スェIR 11100101111101111011011011111011101001111001111111101111101111011010101001001001010001111110010111110111101101101111101110100111100111111110111110111101101010100100100101010010 e5f7b6fba79fefbdaa4947e5f7b6fba79fefbdaa4952
EUC-JP 褶カ誾滂スェIG褶カ誾滂スェIR 111010101111100110001110101101101000111111011110101001001101111011110001100011101011110110001110101010100100100101000111111010101111100110001110101101101000111111011110101001001101111011110001100011101011110110001110101010100100100101010010 eaf98eb68fdea4def18ebd8eaa4947eaf98eb68fdea4def18ebd8eaa4952
UTF-8 褶カ誾滂スェIG褶カ誾滂スェIR 11101000101001001011011011101111101111011011011011101000101010101011111011100110101110111000001011101111101111011011110111101111101111011010101001001001010001111110100010100100101101101110111110111101101101101110100010101010101111101110011010111011100000101110111110111101101111011110111110111101101010100100100101010010 e8a4b6efbdb6e8aabee6bb82efbdbdefbdaa4947e8a4b6efbdb6e8aabee6bb82efbdbdefbdaa4952
UHC 褶?誾滂??IG褶?誾滂??IR 11100011101010000011111111101011110111011101101110110101001111110011111101001001010001111110001110101000001111111110101111011101110110111011010100111111001111110100100101010010 e3a83febdddbb53f3f4947e3a83febdddbb53f3f4952

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)