Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????	00111111001111110011111100111111001111110011111100111111	3f3f3f3f3f3f3f
SJIS-WIN	襍ｦ譚楢伏迴撮	11101000101101011010011011100110100111011001001111101000100101011001101011100111100011111000111001000010	e8b5a6e69d93e8959ae78f8e42
EUC-JP	襍ｦ譚楢伏迴撮	1111000010110111100011101010011011101011111111011100011011101010110010011111101011101101111011111011101110100011	f0b78ea6ebfdc6eac9faedefbba3
UTF-8	襍ｦ譚楢伏迴撮	111010001010010110001101111011111011110110100110111010001010110110011010111001101010010110100010111001001011110010001111111010001011111110110100111001101001001010101110	e8a58defbda6e8ad9ae6a5a2e4bc8fe8bfb4e692ae
UHC	??譚楢伏?撮	0011111100111111110100111100100111101010111110011101110011010001001111111111010111001001	3f3fd3c9eaf9dcd13ff5c9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)