To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?莎?莎慢指????莎?莎慢指???^ 001111111110010010110011001111111110010010110011100101101001110110001110011101110011111100111111001111110011111111100100101100110011111111100100101100111001011010011101100011100111011100111111001111110011111101011110 3fe4b33fe4b3969d8e773f3f3f3fe4b33fe4b3969d8e773f3f3f5e
EUC-JP 蔣莎?莎慢指???蔣莎?莎慢指???^ 10001111110110011011011011101000101101010011111111101000101101011100101111111101101110111101100000111111001111110011111110001111110110011011011011101000101101010011111111101000101101011100101111111101101110111101100000111111001111110011111101011110 8fd9b6e8b53fe8b5cbfdbbd83f3f3f8fd9b6e8b53fe8b5cbfdbbd83f3f3f5e
UTF-8 蔣莎렍莎慢指편렮쁩蔣莎렍莎慢指편렮쁠^ 11101000100101001010001111101000100011101000111011101011101000001000110111101000100011101000111011100110100001011010001011100110100011001000011111101101100011101011100011101011101000001010111011101100100000011010100111101000100101001010001111101000100011101000111011101011101000001000110111101000100011101000111011100110100001011010001011100110100011001000011111101101100011101011100011101011101000001010111011101100100000011010000001011110 e894a3e88e8eeba08de88e8ee685a2e68c87ed8eb8eba0aeec81a9e894a3e88e8eeba08de88e8ee685a2e68c87ed8eb8eba0aeec81a05e
UHC 蔣莎렍莎慢指편렮쁩蔣莎렍莎慢指편렮쁠^ 11101101111110001101111011101101100011101010001111011110111011011101100010110111111100101010011011000110111011011000111010111011101110111101111011101101111110001101111011101101100011101010001111011110111011011101100010110111111100101010011011000110111011011000111010111011101110111101110001011110 edf8deed8ea3deedd8b7f2a6c6ed8ebbbbdeedf8deed8ea3deedd8b7f2a6c6ed8ebbbbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)