To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????踰???iП?????????^ 00111111001111110011111100111111001111110011111111100110111110100011111100111111001111111000001010001001100001000101000000111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3fe6fa3f3f3f828984503f3f3f3f3f3f3f3f3f5e
EUC-JP ??????踰???iП??????孼??^ 001111110011111100111111001111110011111100111111111011001111110000111111001111110011111110100011111010011010011110110001001111110011111100111111001111110011111100111111100011111011101011000011001111110011111101011110 3f3f3f3f3f3fecfc3f3f3fa3e9a7b13f3f3f3f3f3f8fbac33f3f5e
UTF-8 遼깅젨驪낂솈踰녔뤃溜iП溜졿칲料뚮껙孼대젫^ 111011111010011110000011111010101011100110000101111011001010000010101000111011111010011010000111111010111000001010000010111011001000011010001000111010001011100010110000111010111000010110010100111010111010010010000011111011111010011110001011111011111011110110001001110100001001111111101111101001111000101111101100101000011011111111101100101110011011001011101111101001101011111011101011100110101010111011101010101110111001100111100101101011011011110011101011100011001000000011101100101000001010101101011110 efa783eab985eca0a8efa687eb8282ec8688e8b8b0eb8594eba483efa78befbd89d09fefa78beca1bfecb9b2efa6beeb9aaeeabb99e5adbceb8c80eca0ab5e
UHC 遼깅젨驪낂솈踰녔뤃溜iП溜졿칲料뚮껙孼대젫^ 11101001101011001011000111101011101000001010000011100110101011111000010111101001100110011000110011101011101100101011001111100110100011111011010011101010111111101010001111101001101011001011000111101010111111101010000011100110101011111000010111101000111101111000110011101011101100101011001111100101111011011011010011101011101000001010001101011110 e9acb1eba0a0e6af85e9998cebb2b3e68fb4eafea3e9acb1eafea0e6af85e8f78cebb2b3e5edb4eba0a35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)