To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 晶ウトセワセヘコ晶ウトセワセヘテE 100011111011101110110011110001001111000011101100101111101101110011110000111011001011111011001101101110101000111110111011101100111100010011110001100011101011111011011100111100011000111010111110110011011100001101000101 8fbbb3c4f0ecbedcf0ecbecdba8fbbb3c4f18ebedcf18ebecdc345
EUC-JP 晶ウト?セワ?セヘコ晶ウト?セワ?セヘテE 10111110101111011000111010110011100011101100010000111111100011101011111010001110110111000011111110001110101111101000111011001101100011101011101010111110101111011000111010110011100011101100010000111111100011101011111010001110110111000011111110001110101111101000111011001101100011101100001101000101 bebd8eb38ec43f8ebe8edc3f8ebe8ecd8ebabebd8eb38ec43f8ebe8edc3f8ebe8ecd8ec345
UTF-8 晶ウトセワセヘコ晶ウトセワセヘテE 11100110100110011011011011101111101111011011001111101111101111101000010011101110100000101010101111101111101111011011111011101111101111101001110011101110100000101010101111101111101111011011111011101111101111101000110111101111101111011011101011100110100110011011011011101111101111011011001111101111101111101000010011101110100001001000100111101111101111011011111011101111101111101001110011101110100001001000100111101111101111011011111011101111101111101000110111101111101111101000001101000101 e699b6efbdb3efbe84ee82abefbdbeefbe9cee82abefbdbeefbe8defbdbae699b6efbdb3efbe84ee8489efbdbeefbe9cee8489efbdbeefbe8defbe8345
UHC 晶?????????晶?????????E 1110111111011100001111110011111100111111001111110011111100111111001111110011111100111111111011111101110000111111001111110011111100111111001111110011111100111111001111110011111101000101 efdc3f3f3f3f3f3f3f3f3fefdc3f3f3f3f3f3f3f3f3f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)