To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 藥??押??倭??}藥??押??倭??{^ 111001010101101000111111001111111000100110011111001111110011111110011000011000000011111100111111011111011110010101011010001111110011111110001001100111110011111100111111100110000110000000111111001111110111101101011110 e55a3f3f899f3f3f98603f3f7de55a3f3f899f3f3f98603f3f7b5e
EUC-JP 藥??押??倭??}藥??押??倭??{^ 111010011011101100111111001111111011001010100001001111110011111111001111110000010011111100111111011111011110100110111011001111110011111110110010101000010011111100111111110011111100000100111111001111110111101101011110 e9bb3f3fb2a13f3fcfc13f3f7de9bb3f3fb2a13f3fcfc13f3f7b5e
UTF-8 藥썹텥押띄윮倭좄뇮}藥썹텥押띄윮倭좄뇮{^ 111010001001011110100101111011001000110110111001111011011000010110100101111001101000101010111100111010111001110110000100111011001001110010101110111001011000000010101101111011001010001010000100111010111000011110101110011111011110100010010111101001011110110010001101101110011110110110000101101001011110011010001010101111001110101110011101100001001110110010011100101011101110010110000000101011011110110010100010100001001110101110000111101011100111101101011110 e897a5ec8db9ed85a5e68abceb9d84ec9caee580adeca284eb87ae7de897a5ec8db9ed85a5e68abceb9d84ec9caee580adeca284eb87ae7b5e
UHC 藥썹텥押띄윮倭좄뇮}藥썹텥押띄윮倭좄뇮{^ 111001011011011110111101111001111011011010011010111001001110001110110110111001111001111110101101111010001101111010100000111010001000011110010011011111011110010110110111101111011110011110110110100110101110010011100011101101101110011110011111101011011110100011011110101000001110100010000111100100110111101101011110 e5b7bde7b69ae4e3b6e79fade8dea0e887937de5b7bde7b69ae4e3b6e79fade8dea0e887937b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)