To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????h 0011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f68
SJIS-WIN 鄧玲ケソ驕吝濤h 11111011101110011001011111100110101110011011111111101001100000011001100111100101100111111011011101101000 fbb997e6b9bfe98199e59fb768
EUC-JP 鄧玲ケソ驕吝濤h 10001111111000101100011111001110111010001000111010111001100011101011111111110001111000011101001011100111110111101011100101101000 8fe2c7cee88eb98ebff1e1d2e7deb968
UTF-8 鄧玲ケソ驕吝濤h 11101001100001001010011111100111100011101011001011101111101111011011100111101111101111011011111111101001101010011001010111100101100100001001110111100110101111111010010001101000 e984a7e78eb2efbdb9efbdbfe9a995e5909de6bfa468
UHC 鄧玲??驕吝濤h 11010100111110001101011010111100001111110011111111001110111101101101011111110000110101001010011001101000 d4f8d6bc3f3fcef6d7f0d4a668

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)