To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霑ッ蜀怜コカ霑エ譫晁ソッ蜀怜コカ霑エ譁ス^ 11101000101111111010111111100101100001101001011111100101101110101011011011101000101111111011010011100110100111101001110111101000101111111010111111100101100001101001011111100101101110101011011011101000101111111011010011100110100101101011110101011110 e8bfafe58697e5bab6e8bfb4e69e9de8bfafe58697e5bab6e8bfb4e696bd5e
EUC-JP 霑ッ蜀怜コカ霑エ譫晁ソッ蜀怜コカ霑エ譁ス^ 1111000011000001100011101010111111101001111001101100111011100111100011101011101010001110101101101111000011000001100011101011010011101011111111101101101011101010100011101011111110001110101011111110100111100110110011101110011110001110101110101000111010110110111100001100000110001110101101001110101111110110100011101011110101011110 f0c18eafe9e6cee78eba8eb6f0c18eb4ebfedaea8ebf8eafe9e6cee78eba8eb6f0c18eb4ebf68ebd5e
UTF-8 霑ッ蜀怜コカ霑エ譫晁ソッ蜀怜コカ霑エ譁ス^ 11101001100111001001000111101111101111011010111111101000100111001000000011100110100000001001110011101111101111011011101011101111101111011011011011101001100111001001000111101111101111011011010011101000101011011010101111100110100110011000000111101111101111011011111111101111101111011010111111101000100111001000000011100110100000001001110011101111101111011011101011101111101111011011011011101001100111001001000111101111101111011011010011101000101011011000000111101111101111011011110101011110 e99c91efbdafe89c80e6809cefbdbaefbdb6e99c91efbdb4e8adabe69981efbdbfefbdafe89c80e6809cefbdbaefbdb6e99c91efbdb4e8ad81efbdbd5e
UHC 霑?蜀怜??霑??晁??蜀怜??霑?譁?^ 111011111100010100111111111101011011100111010110101110110011111100111111111011111100010100111111001111111111000011000101001111110011111111110101101110011101011010111011001111110011111111101111110001010011111111111100101001100011111101011110 efc53ff5b9d6bb3f3fefc53f3ff0c53f3ff5b9d6bb3f3fefc53ffca63f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)