To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 霑ッ蛛憺杉蟲オ蟄コ霑ッ蛛憺杉蟲オ蟄ク^ 11101000101111111010111111100101100000011001110011101001100100001001100111100101101100111011010111100101101011011011101011101000101111111010111111100101100000011001110011101001100100001001100111100101101100111011010111100101101011011011100001011110 e8bfafe5819ce99099e5b3b5e5adbae8bfafe5819ce99099e5b3b5e5adb85e
EUC-JP 霑ッ蛛憺杉蟲オ蟄コ霑ッ蛛憺杉蟲オ蟄ク^ 11110000110000011000111010101111111010011110000111011000111010111011111111111001111010101011010110001110101101011110101010101111100011101011101011110000110000011000111010101111111010011110000111011000111010111011111111111001111010101011010110001110101101011110101010101111100011101011100001011110 f0c18eafe9e1d8ebbff9eab58eb5eaaf8ebaf0c18eafe9e1d8ebbff9eab58eb5eaaf8eb85e
UTF-8 霑ッ蛛憺杉蟲オ蟄コ霑ッ蛛憺杉蟲オ蟄ク^ 11101001100111001001000111101111101111011010111111101000100110111001101111100110100001101011101011100110100111011000100111101000100111111011001011101111101111011011010111101000100111111000010011101111101111011011101011101001100111001001000111101111101111011010111111101000100110111001101111100110100001101011101011100110100111011000100111101000100111111011001011101111101111011011010111101000100111111000010011101111101111011011100001011110 e99c91efbdafe89b9be686bae69d89e89fb2efbdb5e89f84efbdbae99c91efbdafe89b9be686bae69d89e89fb2efbdb5e89f84efbdb85e
UHC 霑?蛛憺杉蟲?蟄?霑?蛛憺杉蟲?蟄?^ 11101111110001010011111111110001110010001101001110111100110111111011010011110101111110010011111111110110110111100011111111101111110001010011111111110001110010001101001110111100110111111011010011110101111110010011111111110110110111100011111101011110 efc53ff1c8d3bcdfb4f5f93ff6de3fefc53ff1c8d3bcdfb4f5f93ff6de3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)