To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 澳??澳??俉??v澳??澳??俉??vB 111000000101001100111111001111111110000001010011001111110011111111111010011000010011111100111111011101101110000001010011001111110011111111100000010100110011111100111111111110100110000100111111001111110111011001000010 e0533f3fe0533f3ffa613f3f76e0533f3fe0533f3ffa613f3f7642
EUC-JP 澳??澳??俉??v澳??澳??俉??vB 1101111110110100001111110011111111011111101101000011111100111111100011111011000110111011001111110011111101110110110111111011010000111111001111111101111110110100001111110011111110001111101100011011101100111111001111110111011001000010 dfb43f3fdfb43f3f8fb1bb3f3f76dfb43f3fdfb43f3f8fb1bb3f3f7642
UTF-8 澳랃슛澳묕쉈俉드눀v澳랃슛澳묕쉈俉드눀vB 111001101011111010110011111010111001111010000011111011001000101010011011111001101011111010110011111010111010110010010101111011001000100110001000111001001011111110001001111010111001001110011100111010111000100010000000011101101110011010111110101100111110101110011110100000111110110010001010100110111110011010111110101100111110101110101100100101011110110010001001100010001110010010111111100010011110101110010011100111001110101110001000100000000111011001000010 e6beb3eb9e83ec8a9be6beb3ebac95ec8988e4bf89eb939ceb888076e6beb3eb9e83ec8a9be6beb3ebac95ec8988e4bf89eb939ceb88807642
UHC 澳랃슛澳묕쉈俉드눀v澳랃슛澳묕쉈俉드눀vB 111001111111111010001101111011111011110110111000111001111111111010010001111011111011110110100101111001111110101110110101111001011000011110100001011101101110011111111110100011011110111110111101101110001110011111111110100100011110111110111101101001011110011111101011101101011110010110000111101000010111011001000010 e7fe8defbdb8e7fe91efbda5e7ebb5e587a176e7fe8defbdb8e7fe91efbda5e7ebb5e587a17642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)