To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???WD???WD^ 0011111100111111001111110101011101000100001111110011111100111111010101110100010001011110 3f3f3f57443f3f3f57445e
SJIS-WIN ??あWD??あWD^ 00111111001111111000001010100000010101110100010000111111001111111000001010100000010101110100010001011110 3f3f82a057443f3f82a057445e
EUC-JP ??あWD??あWD^ 00111111001111111010010010100010010101110100010000111111001111111010010010100010010101110100010001011110 3f3fa4a257443f3fa4a257445e
UTF-8 룴횕あWD룴횕あWD^ 1110101110100011101101001110110110011010100101011110001110000001100000100101011101000100111010111010001110110100111011011001101010010101111000111000000110000010010101110100010001011110 eba3b4ed9a95e381825744eba3b4ed9a95e3818257445e
UHC 룴횕あWD룴횕あWD^ 1000111110101001110000111000111110101010101000100101011101000100100011111010100111000011100011111010101010100010010101110100010001011110 8fa9c38faaa257448fa9c38faaa257445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)