Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	汐昭ｾ	11110010101111111000111010101100100011111011101010111110	f2bf8eac8fbabe
EUC-JP	?汐昭ｾ	00111111101111001010111010111110101111001000111010111110	3fbcaebebc8ebe
UTF-8	汐昭ｾ	111011101000011110110110111001101011000110010000111001101001100010101101111011111011110110111110	ee87b6e6b190e698adefbdbe
UHC	?汐昭?	001111111110000010110001111000011011100100111111	3fe0b1e1b93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)