Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	﨟ｼ霎樣飴螳	1111101110011101101111001110100010111110100111101110100110001000101110011110010110101110	fb9dbce8be9ee988b9e5ae
EUC-JP	?ｼ霎樣飴螳	0011111110001110101111001111000011000000110111001110101110110000101110111110101010110000	3f8ebcf0c0dcebb0bbeab0
UTF-8	﨟ｼ霎樣飴螳	111011111010100010011111111011111011110110111100111010011001110010001110111001101010100010100011111010011010001110110100111010001001111010110011	efa89fefbdbce99c8ee6a8a3e9a3b4e89eb3
UHC	???樣飴螳	001111110011111100111111111001011100011011101100110001111101001111011001	3f3f3fe5c6ecc7d3d9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)