Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???	001111110011111100111111	3f3f3f
SJIS-WIN	?硯遂	0011111110001100101001011001000010001011	3f8ca5908b
EUC-JP	?硯遂	0011111110111000101001111011111111101011	3fb8a7bfeb
UTF-8	썍硯遂	111011001000110110001101111001111010000110101111111010011000000110000010	ec8d8de7a1afe98182
UHC	썍硯遂	100110110111101011100110110110111110001011000100	9b7ae6dbe2c4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)