Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	哀??悠∽Ⅷ	10001000101000110011111100111111100101110100100110000001111001001000011101011011	88a33f3f974981e4875b
EUC-JP	哀??悠∽?	101100001010010100111111001111111100110110101010101000101110011000111111	b0a53f3fcdaaa2e63f
UTF-8	哀노뀽悠∽Ⅷ	111001011001001110000000111010111000010110111000111010111000000010111101111001101000001010100000111000101000100010111101111000101000010110100111	e59380eb85b8eb80bde682a0e288bde285a7
UHC	哀노뀽悠∽Ⅷ	111001001110111010110011111010111000010110110011111010101110110110100001111011111010010110110111	e4eeb3eb85b3eaeda1efa5b7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)