Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	????????^	001111110011111100111111001111110011111100111111001111110011111101011110	3f3f3f3f3f3f3f3f5e
SJIS-WIN	薰･遲掵薰･遲掉^	111110111001111010100101111001111010110110011101011111011111101110011110101001011110011110101101100111010111101101011110	fb9ea5e7ad9d7dfb9ea5e7ad9d7b5e
EUC-JP	?･遲掵?･遲掉^	001111111000111010100101111011101010111111011001110111100011111110001110101001011110111010101111110110011101110001011110	3f8ea5eeafd9de3f8ea5eeafd9dc5e
UTF-8	薰･遲掵薰･遲掉^	11101000100101101011000011101111101111011010010111101001100000011011001011100110100011101011010111101000100101101011000011101111101111011010010111101001100000011011001011100110100011101000100101011110	e896b0efbda5e981b2e68eb5e896b0efbda5e981b2e68e895e
UHC	薰?遲?薰?遲掉^	1111110110111001001111111111001011000000001111111111110110111001001111111111001011000000110100111111110001011110	fdb93ff2c03ffdb93ff2c0d3fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)