Character and Charcode - Check how computer recognize characters

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	?????????^	00111111001111110011111100111111001111110011111100111111001111110011111101011110	3f3f3f3f3f3f3f3f3f5e
SJIS-WIN	???陰??淫??^	001111110011111100111111100010010100000100111111001111111000100011111010001111110011111101011110	3f3f3f89413f3f88fa3f3f5e
EUC-JP	渶??陰??淫??^	1000111111000111111011010011111100111111101100011010001000111111001111111011000011111100001111110011111101011110	8fc7ed3f3fb1a23f3fb0fc3f3f5e
UTF-8	渶⑹꽍陰곁랭淫앹넩^	11100110101110001011011011100010100100011011100111101010101111011000110111101001100110011011000011101010101100111000000111101011100111101010110111100110101101111010101111101100100101011011100111101011100001001010100101011110	e6b8b6e291b9eabd8de999b0eab381eb9eade6b7abec95b9eb84a95e
UHC	渶⑹꽍陰곁랭淫앹넩^	11100111101101111010100111101100100001001001110111101011111001001011000011100111101101111010100111101011111000101001110111101100100001101010100101011110	e7b7a9ec849debe4b0e7b7a9ebe29dec86a95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)