To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 繹??巍?????Lh繹??巍?????L 11100011100010000011111100111111100110111101100100111111001111110011111100111111001111110100110001101000111000111000100000111111001111111001101111011001001111110011111100111111001111110011111101001100 e3883f3f9bd93f3f3f3f3f4c68e3883f3f9bd93f3f3f3f3f4c
EUC-JP 繹??巍?????Lh繹??巍?????L 11100101111010000011111100111111110101101101101100111111001111110011111100111111001111110100110001101000111001011110100000111111001111111101011011011011001111110011111100111111001111110011111101001100 e5e83f3fd6db3f3f3f3f3f4c68e5e83f3fd6db3f3f3f3f3f4c
UTF-8 繹먮젾巍띾떯溜뗧뼢Lh繹먮젾巍띾떯溜뗧뼢L 111001111011100110111001111010111010100010101110111011001010000010111110111001011011011110001101111010111001110110111110111010111001011010101111111011111010011110001011111010111001011110100111111010111011110010100010010011000110100011100111101110011011100111101011101010001010111011101100101000001011111011100101101101111000110111101011100111011011111011101011100101101010111111101111101001111000101111101011100101111010011111101011101111001010001001001100 e7b9b9eba8aeeca0bee5b78deb9dbeeb96afefa78beb97a7ebbca24c68e7b9b9eba8aeeca0bee5b78deb9dbeeb96afefa78beb97a7ebbca24c
UHC 繹먮젾巍띾떯溜뗧뼢Lh繹먮젾巍띾떯溜뗧뼢L 111001101011101010010000111010111010000010110000111010001110010010001101111010111000101110111111111010101111111010001011111001111001011010100101010011000110100011100110101110101001000011101011101000001011000011101000111001001000110111101011100010111011111111101010111111101000101111100111100101101010010101001100 e6ba90eba0b0e8e48deb8bbfeafe8be796a54c68e6ba90eba0b0e8e48deb8bbfeafe8be796a54c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)