Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	?斐?漿?棒	001111111001010011100011001111111001111111110111001111111001011001011111	3f94e33f9ff73f965f
EUC-JP	?斐?漿?棒	001111111100100011100101001111111101111011111001001111111100101111000000	3fc8e53fdef93fcbc0
UTF-8	뤶斐쟎漿찊棒	111010111010010010110110111001101001011010010000111011001001111110001110111001101011110010111111111011001011000010001010111001101010001110010010	eba4b6e69690ec9f8ee6bcbfecb08ae6a392
UHC	뤶斐쟎漿찊棒	100011111110010011011101111011001100000011110011111011011110110010101001100011101101110011101010	8fe4ddecc0f3edeca98edcea

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)