Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	兆?凋濫	10010010100110110011111110010010100111001001011110010100	929b3f929c9794
EUC-JP	兆?凋濫	11000011111110110011111111000011111111001100110111110100	c3fb3fc3fccdf4
UTF-8	兆렊凋濫	111001011000010110000110111010111010000010001010111001011000011110001011111001101011111110101011	e58586eba08ae5878be6bfab
UHC	兆렊凋濫	1111000010111100100011101010000111110000101111011101010110111101	f0bc8ea1f0bdd5bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)