Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???B	00111111001111110011111101000010	3f3f3f42
SJIS-WIN	陋吝址B	11101000100110111001100111100101100110101010110001000010	e89b99e59aac42
EUC-JP	陋吝址B	11101111111110111101001011100111110101001010111001000010	effbd2e7d4ae42
UTF-8	陋吝址B	11101001100110011000101111100101100100001001110111100101100111011000000001000010	e9998be5909de59d8042
UHC	陋吝址B	11010111101100001101011111110000111100101010001101000010	d7b0d7f0f2a342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)