To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?TIznf?TIzn^}Y?TIznf?TIzn^}bE 0011111101010100010010010111101001101110011001100011111101010100010010010111101001101110010111100111110101011001001111110101010001001001011110100110111001100110001111110101010001001001011110100110111001011110011111010110001001000101 3f54497a6e663f54497a6e5e7d593f54497a6e663f54497a6e5e7d6245
SJIS-WIN 咤TIznf咤TIzn^}Y咤TIznf咤TIzn^}bE 100110100100001001010100010010010111101001101110011001101001101001000010010101000100100101111010011011100101111001111101010110011001101001000010010101000100100101111010011011100110011010011010010000100101010001001001011110100110111001011110011111010110001001000101 9a4254497a6e669a4254497a6e5e7d599a4254497a6e669a4254497a6e5e7d6245
EUC-JP 咤TIznf咤TIzn^}Y咤TIznf咤TIzn^}bE 110100111010001101010100010010010111101001101110011001101101001110100011010101000100100101111010011011100101111001111101010110011101001110100011010101000100100101111010011011100110011011010011101000110101010001001001011110100110111001011110011111010110001001000101 d3a354497a6e66d3a354497a6e5e7d59d3a354497a6e66d3a354497a6e5e7d6245
UTF-8 咤TIznf咤TIzn^}Y咤TIznf咤TIzn^}bE 11100101100100101010010001010100010010010111101001101110011001101110010110010010101001000101010001001001011110100110111001011110011111010101100111100101100100101010010001010100010010010111101001101110011001101110010110010010101001000101010001001001011110100110111001011110011111010110001001000101 e592a454497a6e66e592a454497a6e5e7d59e592a454497a6e66e592a454497a6e5e7d6245
UHC 咤TIznf咤TIzn^}Y咤TIznf咤TIzn^}bE 111101101110001101010100010010010111101001101110011001101111011011100011010101000100100101111010011011100101111001111101010110011111011011100011010101000100100101111010011011100110011011110110111000110101010001001001011110100110111001011110011111010110001001000101 f6e354497a6e66f6e354497a6e5e7d59f6e354497a6e66f6e354497a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)