To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 瘟??耶?????}瘟??耶?????{^ 11100001100010010011111100111111100101101110101100111111001111110011111100111111001111110111110111100001100010010011111100111111100101101110101100111111001111110011111100111111001111110111101101011110 e1893f3f96eb3f3f3f3f3f7de1893f3f96eb3f3f3f3f3f7b5e
EUC-JP 瘟??耶?????}瘟??耶?????{^ 11100001111010010011111100111111110011001110110100111111001111110011111100111111001111110111110111100001111010010011111100111111110011001110110100111111001111110011111100111111001111110111101101011110 e1e93f3fcced3f3f3f3f3f7de1e93f3fcced3f3f3f3f3f7b5e
UTF-8 瘟루윮耶섋쒼呂묉쓳}瘟루윮耶섋쒼呂묉쓳{^ 111001111001100010011111111010111010001110101000111011001001110010101110111010001000000010110110111011001000010010001011111011001001001010111100111011111010011010000000111010111010110010001001111011001001001110110011011111011110011110011000100111111110101110100011101010001110110010011100101011101110100010000000101101101110110010000100100010111110110010010010101111001110111110100110100000001110101110101100100010011110110010010011101100110111101101011110 e7989feba3a8ec9caee880b6ec848bec92bcefa680ebac89ec93b37de7989feba3a8ec9caee880b6ec848bec92bcefa680ebac89ec93b37b5e
UHC 瘟루윮耶섋쒼呂묉쓳}瘟루윮耶섋쒼呂묉쓳{^ 111010001011000010110111111001111001111110101101111001011010110110011000111010001011111010110000111001011111101110010001111001101001110110010001011111011110100010110000101101111110011110011111101011011110010110101101100110001110100010111110101100001110010111111011100100011110011010011101100100010111101101011110 e8b0b7e79fade5ad98e8beb0e5fb91e69d917de8b0b7e79fade5ad98e8beb0e5fb91e69d917b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)