Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??k	001111110011111101101011	3f3f6b
SJIS-WIN	鋠汐k	1111101111010101100011101010110001101011	fbd58eac6b
EUC-JP	鋠汐k	100011111110010011000111101111001010111001101011	8fe4c7bcae6b
UTF-8	鋠汐k	11101001100010111010000011100110101100011001000001101011	e98ba0e6b1906b
UHC	?汐k	00111111111000001011000101101011	3fe0b16b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)