Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????^	00111111001111110011111100111111001111110011111101011110	3f3f3f3f3f3f5e
SJIS-WIN	単端尊辿卒息^	10010010010100001001001001011011100100011011100010010010010010001001000110110010100100011010011101011110	9250925b91b8924891b291a75e
EUC-JP	単端尊辿卒息^	11000011101100011100001110111100110000101011101011000011101010011100001010110100110000101010100101011110	c3b1c3bcc2bac3a9c2b4c2a95e
UTF-8	単端尊辿卒息^	11100101100011011001100011100111101010111010111111100101101100001000101011101000101111101011111111100101100011011001001011100110100000011010111101011110	e58d98e7abafe5b08ae8bebfe58d92e681af5e
UHC	?端尊?卒息^	0011111111010011101011101111000011101110001111111111000011101111111000111101001101011110	3fd3aef0ee3ff0efe3d35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)