To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 艾?????罐? 11100100100010000011111100111111001111110011111100111111111000111010001100111111 e4883f3f3f3f3fe3a33f
EUC-JP 艾?????罐堉 111001111110100000111111001111110011111100111111001111111110011010100101100011111011011111111101 e7e83f3f3f3f3fe6a58fb7fd
UTF-8 艾싳엺柳룟넼罐堉 111010001000100110111110111011001000101110110011111011001001011110111010111011111010011110001001111010111010001110011111111010111000010010111100111001111011110110010000111001011010000010001001 e889beec8bb3ec97baefa789eba39feb84bce7bd90e5a089
UHC 艾싳엺柳룟넼罐堉 11100100111101011001101011101100100111101000110011101010111101111011011111100101100001101011011011001110101110001110101110111100 e4f59aec9e8ceaf7b7e586b6ceb8ebbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)