To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蒸???量頂? 10001111111101100011111100111111001111111001011111001010100100101011100000111111 8ff63f3f3f97ca92b83f
EUC-JP 蒸???量頂? 10111110111110000011111100111111001111111100111011001100110001001011101000111111 bef83f3f3fceccc4ba3f
UTF-8 蒸븃렓당量頂렋 111010001001001010111000111010111011100010000011111010111010000010010011111010111000101110111001111010011000011110001111111010011010000010000010111010111010000010001011 e892b8ebb883eba093eb8bb9e9878fe9a082eba08b
UHC 蒸븃렓당量頂렋 1111000111111010101110101110100010001110101010001011010011100111110101011110000111110000101000101000111010100010 f1fabae88ea8b4e7d5e1f0a28ea2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)