To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????姐??拌¬????姐??拌¬B 00111111001111110011111100111111100010001011011100111111001111111001110101100001100000011100101000111111001111110011111100111111100010001011011100111111001111111001110101100001100000011100101001000010 3f3f3f3f88b73f3f9d6181ca3f3f3f3f88b73f3f9d6181ca42
EUC-JP 焌???姐?邕拌¬焌???姐?邕拌¬B 100011111100100111101000001111110011111100111111101100001011100100111111100011111110000111101101110110011100001010100010110011001000111111001001111010000011111100111111001111111011000010111001001111111000111111100001111011011101100111000010101000101100110001000010 8fc9e83f3f3fb0b93f8fe1edd9c2a2cc8fc9e83f3f3fb0b93f8fe1edd9c2a2cc42
UTF-8 焌띳렰렡姐렒邕拌¬焌띳렰렡姐렒邕拌¬B 11100111100001001000110011101011100111011011001111101011101000001011000011101011101000001010000111100101101001111001000011101011101000001001001011101001100000101001010111100110100010111000110011101111101111111010001011100111100001001000110011101011100111011011001111101011101000001011000011101011101000001010000111100101101001111001000011101011101000001001001011101001100000101001010111100110100010111000110011101111101111111010001001000010 e7848ceb9db3eba0b0eba0a1e5a790eba092e98295e68b8cefbfa2e7848ceb9db3eba0b0eba0a1e5a790eba092e98295e68b8cefbfa242
UHC 焌띳렰렡姐렒邕拌¬焌띳렰렡姐렒邕拌¬B 11110001111000001011011011110001100011101011110110001110101100101110111010111011100011101010011111101000101110111101101011100101101000011111111011110001111000001011011011110001100011101011110110001110101100101110111010111011100011101010011111101000101110111101101011100101101000011111111001000010 f1e0b6f18ebd8eb2eebb8ea7e8bbdae5a1fef1e0b6f18ebd8eb2eebb8ea7e8bbdae5a1fe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)