Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????	001111110011111100111111001111110011111100111111	3f3f3f3f3f3f
SJIS-WIN	鱈巽蔵竪属測	100100100100110010010010010001101001000110100000100100100100011110010001101011101001000110101010	924c924691a0924791ae91aa
EUC-JP	鱈巽蔵竪属測	110000111010110111000011101001111100001010100010110000111010100011000010101100001100001010101100	c3adc3a7c2a2c3a8c2b0c2ac
UTF-8	鱈巽蔵竪属測	111010011011000110001000111001011011011110111101111010001001010010110101111001111010101110101010111001011011000110011110111001101011100010101100	e9b188e5b7bde894b5e7abaae5b19ee6b8ac
UHC	?巽?竪?測	001111111110000111011110001111111110001010110101001111111111011010110100	3fe1de3fe2b53ff6b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)