To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 檍??議??飮??D檍??議??飮??D^ 100111101111100000111111001111111000101101100011001111110011111110011111010110100011111100111111010001001001111011111000001111110011111110001011011000110011111100111111100111110101101000111111001111110100010001011110 9ef83f3f8b633f3f9f5a3f3f449ef83f3f8b633f3f9f5a3f3f445e
EUC-JP 檍??議??飮??D檍??議??飮??D^ 110111001111101000111111001111111011010111000100001111110011111111011101101110110011111100111111010001001101110011111010001111110011111110110101110001000011111100111111110111011011101100111111001111110100010001011110 dcfa3f3fb5c43f3fddbb3f3f44dcfa3f3fb5c43f3fddbb3f3f445e
UTF-8 檍용낄議뗦에飮뉗젴D檍용낄議뗦에飮뉗젴D^ 111001101010101010001101111011001001101010101001111010111000001010000100111010001010110110110000111010111001011110100110111011001001011110010000111010011010001110101110111010111000100110010111111011001010000010110100010001001110011010101010100011011110110010011010101010011110101110000010100001001110100010101101101100001110101110010111101001101110110010010111100100001110100110100011101011101110101110001001100101111110110010100000101101000100010001011110 e6aa8dec9aa9eb8284e8adb0eb97a6ec9790e9a3aeeb8997eca0b444e6aa8dec9aa9eb8284e8adb0eb97a6ec9790e9a3aeeb8997eca0b4445e
UHC 檍용낄議뗦에飮뉗젴D檍용낄議뗦에飮뉗젴D^ 111001011110010110111111111010111011001110100101111011001010000110001011111001101011111110100001111010111110011010000111111011001010000010101000010001001110010111100101101111111110101110110011101001011110110010100001100010111110011010111111101000011110101111100110100001111110110010100000101010000100010001011110 e5e5bfebb3a5eca18be6bfa1ebe687eca0a844e5e5bfebb3a5eca18be6bfa1ebe687eca0a8445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)