To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鈺??湲??宋 11111011110001000011111100111111100111111101000100111111001111111001000101110110 fbc43f3f9fd13f3f9176
EUC-JP 鈺??湲??宋 1000111111100011110101010011111100111111110111101101001100111111001111111100000111010111 8fe3d53f3fded33f3fc1d7
UTF-8 鈺쎈맢湲썽쪛宋 111010011000100010111010111011001000111010001000111010111010011110100010111001101011100110110010111011001000110110111101111011001010101010011011111001011010111010001011 e988baec8e88eba7a2e6b9b2ec8dbdecaa9be5ae8b
UHC 鈺쎈맢湲썽쪛宋 1110100010101101101111011110101110010000101011101110101010111000101111011110100110100101100101001110000111100100 e8adbdeb90aeeab8bde9a594e1e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)