Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????	00111111001111110011111100111111	3f3f3f3f
SJIS-WIN	兄ゆ?嚥	10001100010110101000001011100100001111111001101010001011	8c5a82e43f9a8b
EUC-JP	兄ゆ?嚥	10110111101110111010010011100110001111111101001111101011	b7bba4e63fd3eb
UTF-8	兄ゆ굢嚥	111001011000010110000100111000111000001010000110111010101011010110100010111001011001101010100101	e58584e38286eab5a2e59aa5
UHC	兄ゆ굢嚥	1111101011111100101010101110011010000010100010011110011010111111	fafcaae68289e6bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)