Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??\|}??\|{^	001111110011111101111100011111010011111100111111011111000111101101011110	3f3f7c7d3f3f7c7b5e
SJIS-WIN	蛭?\|}蛭?\|{^	1001010101100111001111110111110001111101100101010110011100111111011111000111101101011110	95673f7c7d95673f7c7b5e
EUC-JP	蛭?\|}蛭?\|{^	1100100111001000001111110111110001111101110010011100100000111111011111000111101101011110	c9c83f7c7dc9c83f7c7b5e
UTF-8	蛭샷\|}蛭샷\|{^	1110100010011011101011011110110010000011101101110111110001111101111010001001101110101101111011001000001110110111011111000111101101011110	e89badec83b77c7de89badec83b77c7b5e
UHC	蛭샷\|}蛭샷\|{^	11110010111101001011110010100110011111000111110111110010111101001011110010100110011111000111101101011110	f2f4bca67c7df2f4bca67c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)