To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN セャ璽セュ鴆爾竺セャ謝セャ璽セュ鴆爾竺セャ謝B 101111101010110010001110101000111011111010101101111010011110111110001110101000101000111010110001101111101010110010001110110100111011111010101100100011101010001110111110101011011110100111101111100011101010001010001110101100011011111010101100100011101101001101000010 beac8ea3beade9ef8ea28eb1beac8ed3beac8ea3beade9ef8ea28eb1beac8ed342
EUC-JP セャ璽セュ鴆爾竺セャ謝セャ璽セュ鴆爾竺セャ謝B 100011101011111010001110101011001011110010100101100011101011111010001110101011011111001011110001101111001010010010111100101100111000111010111110100011101010110010111100110101011000111010111110100011101010110010111100101001011000111010111110100011101010110111110010111100011011110010100100101111001011001110001110101111101000111010101100101111001101010101000010 8ebe8eacbca58ebe8eadf2f1bca4bcb38ebe8eacbcd58ebe8eacbca58ebe8eadf2f1bca4bcb38ebe8eacbcd542
UTF-8 セャ璽セュ鴆爾竺セャ謝セャ璽セュ鴆爾竺セャ謝B 11101111101111011011111011101111101111011010110011100111100100101011110111101111101111011011111011101111101111011010110111101001101101001000011011100111100010001011111011100111101010111011101011101111101111011011111011101111101111011010110011101000101011001001110111101111101111011011111011101111101111011010110011100111100100101011110111101111101111011011111011101111101111011010110111101001101101001000011011100111100010001011111011100111101010111011101011101111101111011011111011101111101111011010110011101000101011001001110101000010 efbdbeefbdace792bdefbdbeefbdade9b486e788bee7abbaefbdbeefbdace8ac9defbdbeefbdace792bdefbdbeefbdade9b486e788bee7abbaefbdbeefbdace8ac9d42
UHC ??璽???爾竺??謝??璽???爾竺??謝B 00111111001111111101111111011110001111110011111100111111111011001011001111110101111001110011111100111111110111101111001100111111001111111101111111011110001111110011111100111111111011001011001111110101111001110011111100111111110111101111001101000010 3f3fdfde3f3f3fecb3f5e73f3fdef33f3fdfde3f3f3fecb3f5e73f3fdef342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)