To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN セ、竺セャ社セュ爾セ、詔ニセ、竺セャ社セュ爾セ、証ム^ 1011111010100100100011101011000110111110101011001000111011010000101111101010110110001110101000101011111010100100100011111101100111000110101111101010010010001110101100011011111010101100100011101101000010111110101011011000111010100010101111101010010010001111110110001101000101011110 bea48eb1beac8ed0bead8ea2bea48fd9c6bea48eb1beac8ed0bead8ea2bea48fd8d15e
EUC-JP セ、竺セャ社セュ爾セ、詔ニセ、竺セャ社セュ爾セ、証ム^ 1000111010111110100011101010010010111100101100111000111010111110100011101010110010111100110100101000111010111110100011101010110110111100101001001000111010111110100011101010010010111110110110111000111011000110100011101011111010001110101001001011110010110011100011101011111010001110101011001011110011010010100011101011111010001110101011011011110010100100100011101011111010001110101001001011111011011010100011101101000101011110 8ebe8ea4bcb38ebe8eacbcd28ebe8eadbca48ebe8ea4bedb8ec68ebe8ea4bcb38ebe8eacbcd28ebe8eadbca48ebe8ea4beda8ed15e
UTF-8 セ、竺セャ社セュ爾セ、詔ニセ、竺セャ社セュ爾セ、証ム^ 11101111101111011011111011101111101111011010010011100111101010111011101011101111101111011011111011101111101111011010110011100111101001001011111011101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010010011101000101010011001010011101111101111101000011011101111101111011011111011101111101111011010010011100111101010111011101011101111101111011011111011101111101111011010110011100111101001001011111011101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010010011101000101010001011110011101111101111101001000101011110 efbdbeefbda4e7abbaefbdbeefbdace7a4beefbdbeefbdade788beefbdbeefbda4e8a994efbe86efbdbeefbda4e7abbaefbdbeefbdace7a4beefbdbeefbdade788beefbdbeefbda4e8a8bcefbe915e
UHC ??竺??社??爾??詔???竺??社??爾????^ 00111111001111111111010111100111001111110011111111011110111001000011111100111111111011001011001100111111001111111111000011011111001111110011111100111111111101011110011100111111001111111101111011100100001111110011111111101100101100110011111100111111001111110011111101011110 3f3ff5e73f3fdee43f3fecb33f3ff0df3f3f3ff5e73f3fdee43f3fecb33f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)