To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ????桎??? 001111110011111100111111001111111001111001111110001111110011111100111111 3f3f3f3f9e7e3f3f3f
EUC-JP ????桎??? 001111110011111100111111001111111101101111011111001111110011111100111111 3f3f3f3fdbdf3f3f3f
UTF-8 섹셩셸양桎렱섹셨 111011001000010010111001111011001000010110101001111011001000010110111000111011001001011010010001111001101010000110001110111010111010000010110001111011001000010010111001111011001000010110101000 ec84b9ec85a9ec85b8ec9691e6a18eeba0b1ec84b9ec85a8
UHC 섹셩셸양桎렱섹셨 10111100101111011011110011001101101111001101000010111110111001111111001011101110100011101011111010111100101111011011110011001100 bcbdbccdbcd0bee7f2ee8ebebcbdbccc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)