To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????BB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4242
SJIS-WIN シセシオシセシミシセシナシセシア疾鴃蒔BB 101111001011111010111100101101011011110010111110101111001101000010111100101111101011110011000101101111001011111010111100101100011000111010111110111010011110111010001110101010100100001001000010 bcbebcb5bcbebcd0bcbebcc5bcbebcb18ebee9ee8eaa4242
EUC-JP シセシオシセシミシセシナシセシア疾鴃蒔BB 10001110101111001000111010111110100011101011110010001110101101011000111010111100100011101011111010001110101111001000111011010000100011101011110010001110101111101000111010111100100011101100010110001110101111001000111010111110100011101011110010001110101100011011110011000000111100101111000010111100101011000100001001000010 8ebc8ebe8ebc8eb58ebc8ebe8ebc8ed08ebc8ebe8ebc8ec58ebc8ebe8ebc8eb1bcc0f2f0bcac4242
UTF-8 シセシオシセシミシセシナシセシア疾鴃蒔BB 1110111110111101101111001110111110111101101111101110111110111101101111001110111110111101101101011110111110111101101111001110111110111101101111101110111110111101101111001110111110111110100100001110111110111101101111001110111110111101101111101110111110111101101111001110111110111110100001011110111110111101101111001110111110111101101111101110111110111101101111001110111110111101101100011110011110010110101111101110100110110100100000111110100010010010100101000100001001000010 efbdbcefbdbeefbdbcefbdb5efbdbcefbdbeefbdbcefbe90efbdbcefbdbeefbdbcefbe85efbdbcefbdbeefbdbcefbdb1e796bee9b483e892944242
UHC ????????????????疾?蒔BB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111110010111100000011111111100011110010000100001001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3ff2f03fe3c84242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)