Character and Charcode - Check how computer recognize characters

To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	???????????????B	00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010	3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN	藥??逸??愿?????勇??B	1110010101011010001111110011111110001000111011010011111100111111100111001100001100111111001111110011111100111111001111111001011101000101001111110011111101000010	e55a3f3f88ed3f3f9cc33f3f3f3f3f97453f3f42
EUC-JP	藥??逸??愿?????勇??B	1110100110111011001111110011111110110000111011110011111100111111110110001100010100111111001111110011111100111111001111111100110110100110001111110011111101000010	e9bb3f3fb0ef3f3fd8c53f3f3f3f3fcda63f3f42
UTF-8	藥띲끏逸썽툣愿녶톹栒욱뫝勇싰큷B	11101000100101111010010111101011100111011011001011101011100000011000111111101001100000001011100011101100100011011011110111101101100010001010001111100110100001001011111111101011100001011011011011101101100001101011100111100110101000001001001011101100100110101011000111101011101010111001110111100101100010111000011111101100100010111011000011101101100000011011011101000010	e897a5eb9db2eb818fe980b8ec8dbded88a3e684bfeb85b6ed86b9e6a092ec9ab1ebab9de58b87ec8bb0ed81b742
UHC	藥띲끏逸썽툣愿녶톹栒욱뫝勇싰큷B	11100101101101111000110111100011100001011011111111101100111011111011110111101001101110001001101011101010101101001000011011100101101101111000110111100010111000111011111111101101100100011011110111101001101110001001101011101010101101001000011001000010	e5b78de385bfecefbde9b89aeab486e5b78de2e3bfed91bde9b89aeab48642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)