To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霑ク邇門梭驍擾スソ霑ク邇門梭驍擾スソ^ 11101000101111111011100011100111100011101001011011100101100111101000100011101001100000101000111111101111101111011011111111101000101111111011100011100111100011101001011011100101100111101000100011101001100000101000111111101111101111011011111101011110 e8bfb8e78e96e59e88e9828fefbdbfe8bfb8e78e96e59e88e9828fefbdbf5e
EUC-JP 霑ク邇門梭驍擾スソ霑ク邇門梭驍擾スソ^ 11110000110000011000111010111000111011011110111011001100111001111101101111101000111100011110001010111110111100011000111010111101100011101011111111110000110000011000111010111000111011011110111011001100111001111101101111101000111100011110001010111110111100011000111010111101100011101011111101011110 f0c18eb8edeecce7dbe8f1e2bef18ebd8ebff0c18eb8edeecce7dbe8f1e2bef18ebd8ebf5e
UTF-8 霑ク邇門梭驍擾スソ霑ク邇門梭驍擾スソ^ 11101001100111001001000111101111101111011011100011101001100000101000011111101001100101101000000011100110101000101010110111101001101010011000110111100110100100111011111011101111101111011011110111101111101111011011111111101001100111001001000111101111101111011011100011101001100000101000011111101001100101101000000011100110101000101010110111101001101010011000110111100110100100111011111011101111101111011011110111101111101111011011111101011110 e99c91efbdb8e98287e99680e6a2ade9a98de693beefbdbdefbdbfe99c91efbdb8e98287e99680e6a2ade9a98de693beefbdbdefbdbf5e
UHC 霑?邇門梭驍擾??霑?邇門梭驍擾??^ 11101111110001010011111111101100110001001101101010100110110111101101110011111101101001001110100011110110001111110011111111101111110001010011111111101100110001001101101010100110110111101101110011111101101001001110100011110110001111110011111101011110 efc53fecc4daa6dedcfda4e8f63f3fefc53fecc4daa6dedcfda4e8f63f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)