To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シト蛇芝者柴社芝斜柴蛇芝者柴社芝赦質 10111100110001001000111011010110100011101100010110001110110100101000111011000100100011101101000010001110110001011000111011001110100011101100010010001110110101101000111011000101100011101101001010001110110001001000111011010000100011101100010110001110110011011000111010111111 bcc48ed68ec58ed28ec48ed08ec58ece8ec48ed68ec58ed28ec48ed08ec58ecd8ebf
EUC-JP シト蛇芝者柴社芝斜柴蛇芝者柴社芝赦質 100011101011110010001110110001001011110011011000101111001100011110111100110101001011110011000110101111001101001010111100110001111011110011010000101111001100011010111100110110001011110011000111101111001101010010111100110001101011110011010010101111001100011110111100110011111011110011000001 8ebc8ec4bcd8bcc7bcd4bcc6bcd2bcc7bcd0bcc6bcd8bcc7bcd4bcc6bcd2bcc7bccfbcc1
UTF-8 シト蛇芝者柴社芝斜柴蛇芝者柴社芝赦質 111011111011110110111100111011111011111010000100111010001001101110000111111010001000101010011101111010001000000010000101111001101001111110110100111001111010010010111110111010001000101010011101111001101001011010011100111001101001111110110100111010001001101110000111111010001000101010011101111010001000000010000101111001101001111110110100111001111010010010111110111010001000101010011101111010001011010110100110111010001011001110101010 efbdbcefbe84e89b87e88a9de88085e69fb4e7a4bee88a9de6969ce69fb4e89b87e88a9de88085e69fb4e7a4bee88a9de8b5a6e8b3aa
UHC ??蛇芝者柴社芝斜柴蛇芝者柴社芝赦質 00111111001111111101111011101111111100101011100111101101101110101110001111000011110111101110010011110010101110011101111011011000111000111100001111011110111011111111001010111001111011011011101011100011110000111101111011100100111100101011100111011110111101011111001011110101 3f3fdeeff2b9edbae3c3dee4f2b9ded8e3c3deeff2b9edbae3c3dee4f2b9def5f2f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)