To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?莎?莎?基?????莎?莎?基????^ 001111111110010010110011001111111110010010110011001111111000101011101110001111110011111100111111001111110011111111100100101100110011111111100100101100110011111110001010111011100011111100111111001111110011111101011110 3fe4b33fe4b33f8aee3f3f3f3f3fe4b33fe4b33f8aee3f3f3f3f5e
EUC-JP 蔣莎?莎?基????蔣莎?莎?基????^ 10001111110110011011011011101000101101010011111111101000101101010011111110110100111100000011111100111111001111110011111110001111110110011011011011101000101101010011111111101000101101010011111110110100111100000011111100111111001111110011111101011110 8fd9b6e8b53fe8b53fb4f03f3f3f3f8fd9b6e8b53fe8b53fb4f03f3f3f3f5e
UTF-8 蔣莎렍莎렍基렑몇렢룁蔣莎렍莎렍基렑몇렢뢸^ 11101000100101001010001111101000100011101000111011101011101000001000110111101000100011101000111011101011101000001000110111100101100111111011101011101011101000001001000111101011101010101000011111101011101000001010001011101011101000111000000111101000100101001010001111101000100011101000111011101011101000001000110111101000100011101000111011101011101000001000110111100101100111111011101011101011101000001001000111101011101010101000011111101011101000001010001011101011101000101011100001011110 e894a3e88e8eeba08de88e8eeba08de59fbaeba091ebaa87eba0a2eba381e894a3e88e8eeba08de88e8eeba08de59fbaeba091ebaa87eba0a2eba2b85e
UHC 蔣莎렍莎렍基렑몇렢룁蔣莎렍莎렍基렑몇렢뢸^ 1110110111111000110111101110110110001110101000111101111011101101100011101010001111010000111100011000111010100110101110001110111010001110101100111011011111011110111011011111100011011110111011011000111010100011110111101110110110001110101000111101000011110001100011101010011010111000111011101000111010110011101101111101110001011110 edf8deed8ea3deed8ea3d0f18ea6b8ee8eb3b7deedf8deed8ea3deed8ea3d0f18ea6b8ee8eb3b7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)