Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	髀瀧罰	111010011000111110010001111010111001010010110001	e98f91eb94b1
EUC-JP	髀瀧罰	111100011110111111000010111011011100100010110011	f1efc2edc8b3
UTF-8	髀瀧罰	111010011010101110000000111001111000000010100111111001111011110110110000	e9ab80e780a7e7bdb0
UHC	?瀧罰	0011111111010110111010011101101111101011	3fd6e9dbeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)