To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 荳茨スソ赱シ﨟サ銜配荳茨スソ赱シ﨟サ銜配B 111001001011100010001000111011111011110110111111111001101101111110111100111110111001110110111011111001111111000010010100011110101110010010111000100010001110111110111101101111111110011011011111101111001111101110011101101110111110011111110000100101000111101001000010 e4b888efbdbfe6dfbcfb9dbbe7f0947ae4b888efbdbfe6dfbcfb9dbbe7f0947a42
EUC-JP 荳茨スソ赱シ?サ銜配荳茨スソ赱シ?サ銜配B 111010001011101010110000111100011000111010111101100011101011111111101100111000011000111010111100001111111000111010111011111011101111001011000111110110111110100010111010101100001111000110001110101111011000111010111111111011001110000110001110101111000011111110001110101110111110111011110010110001111101101101000010 e8bab0f18ebd8ebfece18ebc3f8ebbeef2c7dbe8bab0f18ebd8ebfece18ebc3f8ebbeef2c7db42
UTF-8 荳茨スソ赱シ﨟サ銜配荳茨スソ赱シ﨟サ銜配B 11101000100011011011001111101000100011001010100011101111101111011011110111101111101111011011111111101000101101011011000111101111101111011011110011101111101010001001111111101111101111011011101111101001100010101001110011101001100001011000110111101000100011011011001111101000100011001010100011101111101111011011110111101111101111011011111111101000101101011011000111101111101111011011110011101111101010001001111111101111101111011011101111101001100010101001110011101001100001011000110101000010 e88db3e88ca8efbdbdefbdbfe8b5b1efbdbcefa89fefbdbbe98a9ce9858de88db3e88ca8efbdbdefbdbfe8b5b1efbdbcefa89fefbdbbe98a9ce9858d42
UHC 荳茨??????銜配荳茨??????銜配B 1101010011100101111011011011110000111111001111110011111100111111001111110011111111111001111001111101101111010101110101001110010111101101101111000011111100111111001111110011111100111111001111111111100111100111110110111101010101000010 d4e5edbc3f3f3f3f3f3ff9e7dbd5d4e5edbc3f3f3f3f3f3ff9e7dbd542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)