To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????E 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 雋ゑスイ霑ェ諛亥シ該雋ゑスイ霑ェ諛亥シ鈎E 111010001011001010000010111011111011110110110010111010001011111110101010111001101000011110001000111001011011110010001010010110011110100010110010100000101110111110111101101100101110100010111111101010101110011010000111100010001110010110111100100010100110001001000101 e8b282efbdb2e8bfaae68788e5bc8a59e8b282efbdb2e8bfaae68788e5bc8a6245
EUC-JP 雋ゑスイ霑ェ諛亥シ該雋ゑスイ霑ェ諛亥シ鈎E 1111000010110100101001001111000110001110101111011000111010110010111100001100000110001110101010101110101111100111101100001110011110001110101111001011001110111010111100001011010010100100111100011000111010111101100011101011001011110000110000011000111010101010111010111110011110110000111001111000111010111100101100111100001101000101 f0b4a4f18ebd8eb2f0c18eaaebe7b0e78ebcb3baf0b4a4f18ebd8eb2f0c18eaaebe7b0e78ebcb3c345
UTF-8 雋ゑスイ霑ェ諛亥シ該雋ゑスイ霑ェ諛亥シ鈎E 11101001100110111000101111100011100000101001000111101111101111011011110111101111101111011011001011101001100111001001000111101111101111011010101011101000101010111001101111100100101110101010010111101111101111011011110011101000101010011011001011101001100110111000101111100011100000101001000111101111101111011011110111101111101111011011001011101001100111001001000111101111101111011010101011101000101010111001101111100100101110101010010111101111101111011011110011101001100010001000111001000101 e99b8be38291efbdbdefbdb2e99c91efbdaae8ab9be4baa5efbdbce8a9b2e99b8be38291efbdbdefbdb2e99c91efbdaae8ab9be4baa5efbdbce9888e45
UHC 雋ゑ??霑?諛亥?該雋ゑ??霑?諛亥??E 1111000111100110101010101111000100111111001111111110111111000101001111111110101110110000111110101010010000111111111110101011000111110001111001101010101011110001001111110011111111101111110001010011111111101011101100001111101010100100001111110011111101000101 f1e6aaf13f3fefc53febb0faa43ffab1f1e6aaf13f3fefc53febb0faa43f3f45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)