To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 魘牙ョ、蠇懷シ州N}魘牙ョ、蠇懷シ州N{^ 11101001101101001000100111100101101011101010010011111011101000011001110011100101101111001000111101000010010011100111110111101001101101001000100111100101101011101010010011111011101000011001110011100101101111001000111101000010010011100111101101011110 e9b489e5aea4fba19ce5bc8f424e7de9b489e5aea4fba19ce5bc8f424e7b5e
EUC-JP 魘牙ョ、?懷シ州N}魘牙ョ、?懷シ州N{^ 1111001010110110101100101110011110001110101011101000111010100100001111111101100011100111100011101011110010111101101000110100111001111101111100101011011010110010111001111000111010101110100011101010010000111111110110001110011110001110101111001011110110100011010011100111101101011110 f2b6b2e78eae8ea43fd8e78ebcbda34e7df2b6b2e78eae8ea43fd8e78ebcbda34e7b5e
UTF-8 魘牙ョ、蠇懷シ州N}魘牙ョ、蠇懷シ州N{^ 1110100110101101100110001110011110001001100110011110111110111101101011101110111110111101101001001110100010100000100001111110011010000111101101111110111110111101101111001110010110110111100111100100111001111101111010011010110110011000111001111000100110011001111011111011110110101110111011111011110110100100111010001010000010000111111001101000011110110111111011111011110110111100111001011011011110011110010011100111101101011110 e9ad98e78999efbdaeefbda4e8a087e687b7efbdbce5b79e4e7de9ad98e78999efbdaeefbda4e8a087e687b7efbdbce5b79e4e7b5e
UHC ?牙???懷?州N}?牙???懷?州N{^ 001111111110010010110011001111110011111100111111111111001110001100111111111100011011011001001110011111010011111111100100101100110011111100111111001111111111110011100011001111111111000110110110010011100111101101011110 3fe4b33f3f3ffce33ff1b64e7d3fe4b33f3f3ffce33ff1b64e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)