To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瘟??押?????瘟?????蘖 1110000110001001001111110011111110001001100111110011111100111111001111110011111100111111111000011000100100111111001111110011111100111111001111111001111101010000 e1893f3f899f3f3f3f3f3fe1893f3f3f3f3f9f50
EUC-JP 瘟??押??孼??瘟?????蘖 11100001111010010011111100111111101100101010000100111111001111111000111110111010110000110011111100111111111000011110100100111111001111110011111100111111001111111101110110110001 e1e93f3fb2a13f3f8fbac33f3fe1e93f3f3f3f3fddb1
UTF-8 瘟룩큹押띈꽦孼껇퀕瘟룩큹呂잒쒼蘖 111001111001100010011111111010111010001110101001111011011000000110111001111001101000101010111100111010111001110110001000111010101011110110100110111001011010110110111100111010101011101110000111111011011000000010010101111001111001100010011111111010111010001110101001111011011000000110111001111011111010011010000000111011001001111010010010111011001001001010111100111010001001100010010110 e7989feba3a9ed81b9e68abceb9d88eabda6e5adbceabb87ed8095e7989feba3a9ed81b9efa680ec9e92ec92bce89896
UHC 瘟룩큹押띈꽦孼껇퀕瘟룩큹呂잒쒼蘖 1110100010110000101101111110100010110100100010001110010011100011101101101110100010000100101100011110010111101101100000111110100010110011100010101110100010110000101101111110100010110100100010001110010111111011100111111110100010111110101100001110010111101110 e8b0b7e8b488e4e3b6e884b1e5ed83e8b38ae8b0b7e8b488e5fb9fe8beb0e5ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)