To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN セ、柴セュ爾セャ胥爾柴セ、柴セュ爾セャ胥爾柴B 101111101010010010001110110001001011111010101101100011101010001010111110101011001110001111101111100011101010001010001110110001001011111010100100100011101100010010111110101011011000111010100010101111101010110011100011111011111000111010100010100011101100010001000010 bea48ec4bead8ea2beace3ef8ea28ec4bea48ec4bead8ea2beace3ef8ea28ec442
EUC-JP セ、柴セュ爾セャ胥爾柴セ、柴セュ爾セャ胥爾柴B 100011101011111010001110101001001011110011000110100011101011111010001110101011011011110010100100100011101011111010001110101011001110011011110001101111001010010010111100110001101000111010111110100011101010010010111100110001101000111010111110100011101010110110111100101001001000111010111110100011101010110011100110111100011011110010100100101111001100011001000010 8ebe8ea4bcc68ebe8eadbca48ebe8eace6f1bca4bcc68ebe8ea4bcc68ebe8eadbca48ebe8eace6f1bca4bcc642
UTF-8 セ、柴セュ爾セャ胥爾柴セ、柴セュ爾セャ胥爾柴B 11101111101111011011111011101111101111011010010011100110100111111011010011101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010110011101000100000111010010111100111100010001011111011100110100111111011010011101111101111011011111011101111101111011010010011100110100111111011010011101111101111011011111011101111101111011010110111100111100010001011111011101111101111011011111011101111101111011010110011101000100000111010010111100111100010001011111011100110100111111011010001000010 efbdbeefbda4e69fb4efbdbeefbdade788beefbdbeefbdace883a5e788bee69fb4efbdbeefbda4e69fb4efbdbeefbdade788beefbdbeefbdace883a5e788bee69fb442
UHC ??柴??爾??胥爾柴??柴??爾??胥爾柴B 001111110011111111100011110000110011111100111111111011001011001100111111001111111110000010100001111011001011001111100011110000110011111100111111111000111100001100111111001111111110110010110011001111110011111111100000101000011110110010110011111000111100001101000010 3f3fe3c33f3fecb33f3fe0a1ecb3e3c33f3fe3c33f3fecb33f3fe0a1ecb3e3c342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)