To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ª¿¡ë¡à¢ízª¿¡ë¡à¢ízB 1000111110101010101111111010000111101011101000011110000010001111101000101110110101111010100011111010101010111111101000011110101110100001111000001000111110100010111011010111101001000010 8faabfa1eba1e08fa2ed7a8faabfa1eba1e08fa2ed7a42
SJIS-WIN ????????¢?z????????¢?zB 00111111001111110011111100111111001111110011111100111111001111111000000110010001001111110111101000111111001111110011111100111111001111110011111100111111001111111000000110010001001111110111101001000010 3f3f3f3f3f3f3f3f81913f7a3f3f3f3f3f3f3f3f81913f7a42
EUC-JP ?ª¿¡ë¡à?¢íz?ª¿¡ë¡à?¢ízB 0011111110001111101000101110110010001111101000101100010010001111101000101100001010001111101010111011001110001111101000101100001010001111101010111010001000111111101000011111000110001111101010111011111101111010001111111000111110100010111011001000111110100010110001001000111110100010110000101000111110101011101100111000111110100010110000101000111110101011101000100011111110100001111100011000111110101011101111110111101001000010 3f8fa2ec8fa2c48fa2c28fabb38fa2c28faba23fa1f18fabbf7a3f8fa2ec8fa2c48fa2c28fabb38fa2c28faba23fa1f18fabbf7a42
UTF-8 ª¿¡ë¡à¢ízª¿¡ë¡à¢ízB 11000010100011111100001010101010110000101011111111000010101000011100001110101011110000101010000111000011101000001100001010001111110000101010001011000011101011010111101011000010100011111100001010101010110000101011111111000010101000011100001110101011110000101010000111000011101000001100001010001111110000101010001011000011101011010111101001000010 c28fc2aac2bfc2a1c3abc2a1c3a0c28fc2a2c3ad7ac28fc2aac2bfc2a1c3abc2a1c3a0c28fc2a2c3ad7a42
UHC ?ª¿¡?¡????z?ª¿¡?¡????zB 00111111101010001010001110100010101011111010001010101110001111111010001010101110001111110011111100111111001111110111101000111111101010001010001110100010101011111010001010101110001111111010001010101110001111110011111100111111001111110111101001000010 3fa8a3a2afa2ae3fa2ae3f3f3f3f7a3fa8a3a2afa2ae3fa2ae3f3f3f3f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)