To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 荳茨スヲ逍セ訷匁差霑エ闕千明訷匁差閹コ 111001001011100010001000111011111011110110100110111001111001011010111110111110111010010010010110111001101000110110110111111010001011111110110100111010001000110110010000111001111001011010111110111110111010010010010110111001101000110110110111111010001000011010111010 e4b888efbda6e796befba496e68db7e8bfb4e88d90e796befba496e68db7e886ba
EUC-JP 荳茨スヲ逍セ訷匁差霑エ闕千明訷匁差閹コ 11101000101110101011000011110001100011101011110110001110101001101110110111110110100011101011111010001111110111011101010011001100111010001011101010111001111100001100000110001110101101001110111111101101110000001110100111001100110000001000111111011101110101001100110011101000101110101011100111101111111001101000111010111010 e8bab0f18ebd8ea6edf68ebe8fddd4cce8bab9f0c18eb4efedc0e9ccc08fddd4cce8bab9efe68eba
UTF-8 荳茨スヲ逍セ訷匁差霑エ闕千明訷匁差閹コ 111010001000110110110011111010001000110010101000111011111011110110111101111011111011110110100110111010011000000010001101111011111011110110111110111010001010100010110111111001011000110010000001111001011011011110101110111010011001110010010001111011111011110110110100111010011001011110010101111001011000110110000011111001101001100010001110111010001010100010110111111001011000110010000001111001011011011110101110111010011001011010111001111011111011110110111010 e88db3e88ca8efbdbdefbda6e9808defbdbee8a8b7e58c81e5b7aee99c91efbdb4e99795e58d83e6988ee8a8b7e58c81e5b7aee996b9efbdba
UHC 荳茨??逍???差霑?闕千明??差?? 11010100111001011110110110111100001111110011111111100001110011100011111100111111001111111111001110101100111011111100010100111111110011111111010011110100101101101101100110100101001111110011111111110011101011000011111100111111 d4e5edbc3f3fe1ce3f3f3ff3acefc53fcff4f4b6d9a53f3ff3ac3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)