To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 藥?1??い? 11100101010110100011111110000010010100000011111100111111100000101010001000111111 e55a3f82503f3f82a23f
EUC-JP 藥?1??い? 11101001101110110011111110100011101100010011111100111111101001001010010000111111 e9bb3fa3b13f3fa4a43f
UTF-8 藥먨1溫롨い溫 111010001001011110100101111010111010100010101000111011111011110010010001111001101011101010101011111010111010000110101000111000111000000110000100111001101011101010101011 e897a5eba8a8efbc91e6baabeba1a8e38184e6baab
UHC 藥먨1溫롨い溫 1110010110110111100100001110010110100011101100011110100010101110100011101110100010101010101001001110100010101110 e5b790e5a3b1e8ae8ee8aaa4e8ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)