To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????d??????m??????j 001111110011111100111111001111110011111100111111011001000011111100111111001111110011111100111111001111110110110100111111001111110011111100111111001111110011111101101010 3f3f3f3f3f3f643f3f3f3f3f3f6d3f3f3f3f3f3f6a
SJIS-WIN 澳??曜??d澳??曜??m澳??曜??j 111000000101001100111111001111111001011101101010001111110011111101100100111000000101001100111111001111111001011101101010001111110011111101101101111000000101001100111111001111111001011101101010001111110011111101101010 e0533f3f976a3f3f64e0533f3f976a3f3f6de0533f3f976a3f3f6a
EUC-JP 澳??曜??d澳??曜??m澳??曜??j 110111111011010000111111001111111100110111001011001111110011111101100100110111111011010000111111001111111100110111001011001111110011111101101101110111111011010000111111001111111100110111001011001111110011111101101010 dfb43f3fcdcb3f3f64dfb43f3fcdcb3f3f6ddfb43f3fcdcb3f3f6a
UTF-8 澳랃슈曜뱄슴d澳랃슈曜댐슬m澳랃슈曜뱄스j 111001101011111010110011111010111001111010000011111011001000101010001000111001101001101110011100111010111011000110000100111011001000101010110100011001001110011010111110101100111110101110011110100000111110110010001010100010001110011010011011100111001110101110001100100100001110110010001010101011000110110111100110101111101011001111101011100111101000001111101100100010101000100011100110100110111001110011101011101100011000010011101100100010101010010001101010 e6beb3eb9e83ec8a88e69b9cebb184ec8ab464e6beb3eb9e83ec8a88e69b9ceb8c90ec8aac6de6beb3eb9e83ec8a88e69b9cebb184ec8aa46a
UHC 澳랃슈曜뱄슴d澳랃슈曜댐슬m澳랃슈曜뱄스j 111001111111111010001101111011111011110110110100111010001111100010111001111011111011110110111111011001001110011111111110100011011110111110111101101101001110100011111000101101001110111110111101101111010110110111100111111111101000110111101111101111011011010011101000111110001011100111101111101111011011101001101010 e7fe8defbdb4e8f8b9efbdbf64e7fe8defbdb4e8f8b4efbdbd6de7fe8defbdb4e8f8b9efbdba6a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)