To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 æ¿¡ë¡«ëþºDæ¿¡ë¡«ëþºD^ 111001101011111110100001111010111010000110101011111010111111111010111010010001001110011010111111101000011110101110100001101010111110101111111110101110100100010001011110 e6bfa1eba1abebfeba44e6bfa1eba1abebfeba445e
SJIS-WIN ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
EUC-JP æ¿¡ë¡?ëþºDæ¿¡ë¡?ëþºD^ 1000111110101001110000011000111110100010110001001000111110100010110000101000111110101011101100111000111110100010110000100011111110001111101010111011001110001111101010011101000010001111101000101110101101000100100011111010100111000001100011111010001011000100100011111010001011000010100011111010101110110011100011111010001011000010001111111000111110101011101100111000111110101001110100001000111110100010111010110100010001011110 8fa9c18fa2c48fa2c28fabb38fa2c23f8fabb38fa9d08fa2eb448fa9c18fa2c48fa2c28fabb38fa2c23f8fabb38fa9d08fa2eb445e
UTF-8 æ¿¡ë¡«ëþºDæ¿¡ë¡«ëþºD^ 110000111010011011000010101111111100001010100001110000111010101111000010101000011100001010101011110000111010101111000011101111101100001010111010010001001100001110100110110000101011111111000010101000011100001110101011110000101010000111000010101010111100001110101011110000111011111011000010101110100100010001011110 c3a6c2bfc2a1c3abc2a1c2abc3abc3bec2ba44c3a6c2bfc2a1c3abc2a1c2abc3abc3bec2ba445e
UHC æ¿¡?¡??þºDæ¿¡?¡??þºD^ 101010011010000110100010101011111010001010101110001111111010001010101110001111110011111110101001101011011010100010101100010001001010100110100001101000101010111110100010101011100011111110100010101011100011111100111111101010011010110110101000101011000100010001011110 a9a1a2afa2ae3fa2ae3f3fa9ada8ac44a9a1a2afa2ae3fa2ae3f3fa9ada8ac445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)