Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	酊????酊	1110011111000010001111110011111100111111001111111110011111000010	e7c23f3f3f3fe7c2
EUC-JP	酊芚???酊	11101110110001001000111111010111101110110011111100111111001111111110111011000100	eec48fd7bb3f3f3feec4
UTF-8	酊芚렣쯔갱酊	111010011000010110001010111010001000101010011010111010111010000010100011111011001010111110010100111010101011000010110001111010011000010110001010	e9858ae88a9aeba0a3ecaf94eab0b1e9858a
UHC	酊芚렣쯔갱酊	111011111111100011010100111011001000111010110100110000101110101010110000101110111110111111111000	eff8d4ec8eb4c2eab0bbeff8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)