To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霑「蝣、譴カ霑「蟋急霑「蝣、譴カ霑「蟋宮^ 111010001011111110100010111001011010000010100100111001101010001010110110111010001011111110100010111001011010011110001011011111011110100010111111101000101110010110100000101001001110011010100010101101101110100010111111101000101110010110100111100010110111101101011110 e8bfa2e5a0a4e6a2b6e8bfa2e5a78b7de8bfa2e5a0a4e6a2b6e8bfa2e5a78b7b5e
EUC-JP 霑「蝣、譴カ霑「蟋急霑「蝣、譴カ霑「蟋宮^ 1111000011000001100011101010001011101010101000101000111010100100111011001010010010001110101101101111000011000001100011101010001011101010101010011011010111011110111100001100000110001110101000101110101010100010100011101010010011101100101001001000111010110110111100001100000110001110101000101110101010101001101101011101110001011110 f0c18ea2eaa28ea4eca48eb6f0c18ea2eaa9b5def0c18ea2eaa28ea4eca48eb6f0c18ea2eaa9b5dc5e
UTF-8 霑「蝣、譴カ霑「蟋急霑「蝣、譴カ霑「蟋宮^ 11101001100111001001000111101111101111011010001011101000100111011010001111101111101111011010010011101000101011011011010011101111101111011011011011101001100111001001000111101111101111011010001011101000100111111000101111100110100000001010010111101001100111001001000111101111101111011010001011101000100111011010001111101111101111011010010011101000101011011011010011101111101111011011011011101001100111001001000111101111101111011010001011101000100111111000101111100101101011101010111001011110 e99c91efbda2e89da3efbda4e8adb4efbdb6e99c91efbda2e89f8be680a5e99c91efbda2e89da3efbda4e8adb4efbdb6e99c91efbda2e89f8be5aeae5e
UHC 霑???譴?霑??急霑???譴?霑??宮^ 1110111111000101001111110011111100111111110011001011100100111111111011111100010100111111001111111101000011100001111011111100010100111111001111110011111111001100101110010011111111101111110001010011111100111111110011111110000001011110 efc53f3f3fccb93fefc53f3fd0e1efc53f3f3fccb93fefc53f3fcfe05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)