To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 樗烽?鞨??? 10010010100101001110000010000010001111111110100011100000001111110011111100111111 9294e0823fe8e03f3f3f
EUC-JP 樗烽?鞨??? 11000011111101001101111111100010001111111111000011100010001111110011111100111111 c3f4dfe23ff0e23f3f3f
UTF-8 樗烽렒鞨렜잴샴 111001101010100010010111111001111000001110111101111010111010000010010010111010011001111010101000111010111010000010011100111011001001111010110100111011001000001110110100 e6a897e783bdeba092e99ea8eba09cec9eb4ec83b4
UHC 樗烽렒鞨렜잴샴 1110111011000000110111001110101110001110101001111100101011101010100011101010111011000000111010101011110010100100 eec0dceb8ea7caea8eaec0eabca4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)