To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????z??????????zB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101111010001111110011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN シト酌芝捨柴酌芝シウzシト酌芝捨柴酌芝シウzB 1011110011000100100011101101111010001110110001011000111011001100100011101100010010001110110111101000111011000101101111001011001101111010101111001100010010001110110111101000111011000101100011101100110010001110110001001000111011011110100011101100010110111100101100110111101001000010 bcc48ede8ec58ecc8ec48ede8ec5bcb37abcc48ede8ec58ecc8ec48ede8ec5bcb37a42
EUC-JP シト酌芝捨柴酌芝シウzシト酌芝捨柴酌芝シウzB 10001110101111001000111011000100101111001110000010111100110001111011110011001110101111001100011010111100111000001011110011000111100011101011110010001110101100110111101010001110101111001000111011000100101111001110000010111100110001111011110011001110101111001100011010111100111000001011110011000111100011101011110010001110101100110111101001000010 8ebc8ec4bce0bcc7bccebcc6bce0bcc78ebc8eb37a8ebc8ec4bce0bcc7bccebcc6bce0bcc78ebc8eb37a42
UTF-8 シト酌芝捨柴酌芝シウzシト酌芝捨柴酌芝シウzB 111011111011110110111100111011111011111010000100111010011000010110001100111010001000101010011101111001101000110110101000111001101001111110110100111010011000010110001100111010001000101010011101111011111011110110111100111011111011110110110011011110101110111110111101101111001110111110111110100001001110100110000101100011001110100010001010100111011110011010001101101010001110011010011111101101001110100110000101100011001110100010001010100111011110111110111101101111001110111110111101101100110111101001000010 efbdbcefbe84e9858ce88a9de68da8e69fb4e9858ce88a9defbdbcefbdb37aefbdbcefbe84e9858ce88a9de68da8e69fb4e9858ce88a9defbdbcefbdb37a42
UHC ??酌芝捨柴酌芝??z??酌芝捨柴酌芝??zB 0011111100111111111011011100110011110010101110011101111011010111111000111100001111101101110011001111001010111001001111110011111101111010001111110011111111101101110011001111001010111001110111101101011111100011110000111110110111001100111100101011100100111111001111110111101001000010 3f3fedccf2b9ded7e3c3edccf2b93f3f7a3f3fedccf2b9ded7e3c3edccf2b93f3f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)