To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 »óìªH»óìªD»óìªH»óìªDB 10001111101110111111001111101100101010100100100010001111101110111111001111101100101010100100010010001111101110111111001111101100101010100100100010001111101110111111001111101100101010100100010001000010 8fbbf3ecaa488fbbf3ecaa448fbbf3ecaa488fbbf3ecaa4442
SJIS-WIN ?????H?????D?????H?????DB 00111111001111110011111100111111001111110100100000111111001111110011111100111111001111110100010000111111001111110011111100111111001111110100100000111111001111110011111100111111001111110100010001000010 3f3f3f3f3f483f3f3f3f3f443f3f3f3f3f483f3f3f3f3f4442
EUC-JP ??óìªH??óìªD??óìªH??óìªDB 00111111001111111000111110101011110100011000111110101011110000001000111110100010111011000100100000111111001111111000111110101011110100011000111110101011110000001000111110100010111011000100010000111111001111111000111110101011110100011000111110101011110000001000111110100010111011000100100000111111001111111000111110101011110100011000111110101011110000001000111110100010111011000100010001000010 3f3f8fabd18fabc08fa2ec483f3f8fabd18fabc08fa2ec443f3f8fabd18fabc08fa2ec483f3f8fabd18fabc08fa2ec4442
UTF-8 »óìªH»óìªD»óìªH»óìªDB 110000101000111111000010101110111100001110110011110000111010110011000010101010100100100011000010100011111100001010111011110000111011001111000011101011001100001010101010010001001100001010001111110000101011101111000011101100111100001110101100110000101010101001001000110000101000111111000010101110111100001110110011110000111010110011000010101010100100010001000010 c28fc2bbc3b3c3acc2aa48c28fc2bbc3b3c3acc2aa44c28fc2bbc3b3c3acc2aa48c28fc2bbc3b3c3acc2aa4442
UHC ????ªH????ªD????ªH????ªDB 0011111100111111001111110011111110101000101000110100100000111111001111110011111100111111101010001010001101000100001111110011111100111111001111111010100010100011010010000011111100111111001111110011111110101000101000110100010001000010 3f3f3f3fa8a3483f3f3f3fa8a3443f3f3f3fa8a3483f3f3f3fa8a34442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)