To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 額??悅??深 10001010011110100011111100111111111110101011110100111111001111111001000001011011 8a7a3f3ffabd3f3f905b
EUC-JP 額?????深 101100111101101100111111001111110011111100111111001111111011111110111100 b3db3f3f3f3f3fbfbc
UTF-8 額뗮궇悅뚪괮深 111010011010000110001101111010111001011110101110111010101011011010000111111001101000001010000101111010111001101010101010111010101011010010101110111001101011011110110001 e9a18deb97aeeab687e68285eb9aaaeab4aee6b7b1
UHC 額뗮궇悅뚪괮深 1110010011111110100010111110110110000010101000001110011011101101100011001110100110000010010101011110010010100010 e4fe8bed82a0e6ed8ce98255e4a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)