To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????P}?????????P{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101000001111101001111110011111100111111001111110011111100111111001111110011111100111111010100000111101101011110 3f3f3f3f3f3f3f3f3f507d3f3f3f3f3f3f3f3f3f507b5e
SJIS-WIN シマシイ霪シユシ・P}シマシイ霪シユシ・P{^ 10111100110011111011110010110010111010001100010010111100110101011011110010100101010100000111110110111100110011111011110010110010111010001100010010111100110101011011110010100101010100000111101101011110 bccfbcb2e8c4bcd5bca5507dbccfbcb2e8c4bcd5bca5507b5e
EUC-JP シマシイ霪シユシ・P}シマシイ霪シユシ・P{^ 1000111010111100100011101100111110001110101111001000111010110010111100001100011010001110101111001000111011010101100011101011110010001110101001010101000001111101100011101011110010001110110011111000111010111100100011101011001011110000110001101000111010111100100011101101010110001110101111001000111010100101010100000111101101011110 8ebc8ecf8ebc8eb2f0c68ebc8ed58ebc8ea5507d8ebc8ecf8ebc8eb2f0c68ebc8ed58ebc8ea5507b5e
UTF-8 シマシイ霪シユシ・P}シマシイ霪シユシ・P{^ 1110111110111101101111001110111110111110100011111110111110111101101111001110111110111101101100101110100110011100101010101110111110111101101111001110111110111110100101011110111110111101101111001110111110111101101001010101000001111101111011111011110110111100111011111011111010001111111011111011110110111100111011111011110110110010111010011001110010101010111011111011110110111100111011111011111010010101111011111011110110111100111011111011110110100101010100000111101101011110 efbdbcefbe8fefbdbcefbdb2e99caaefbdbcefbe95efbdbcefbda5507defbdbcefbe8fefbdbcefbdb2e99caaefbdbcefbe95efbdbcefbda5507b5e
UHC ?????????P}?????????P{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101000001111101001111110011111100111111001111110011111100111111001111110011111100111111010100000111101101011110 3f3f3f3f3f3f3f3f3f507d3f3f3f3f3f3f3f3f3f507b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)