Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????	0011111100111111001111110011111100111111	3f3f3f3f3f
SJIS-WIN	逶門嶋鮨冗	11100111100110111001011011100101100100111000100011101001101111011000111111100111	e79b96e59388e9bd8fe7
EUC-JP	逶門嶋鮨冗	11101101111110111100110011100111110001011110100011110010101111111011111011101001	edfbcce7c5e8f2bfbee9
UTF-8	逶門嶋鮨冗	111010011000000010110110111010011001011010000000111001011011011010001011111010011010111010101000111001011000011010010111	e980b6e99680e5b68be9aea8e58697
UHC	?門嶋?冗	0011111111011010101001101101001111110111001111111110100110110111	3fdaa6d3f73fe9b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)