To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 上韲ク社シセセシ簀爵淏骼ク社シセセシ簀灼^ 10001111111000111110100011101100101110001000111011010000101111001011111010111110101111001110001011000101100011101101110111111011010000101110100110001110101110001000111011010000101111001011111010111110101111001110001011000101100011101101110001011110 8fe3e8ecb88ed0bcbebebce2c58eddfb42e98eb88ed0bcbebebce2c58edc5e
EUC-JP 上韲ク社シセセシ簀爵淏骼ク社シセセシ簀灼^ 101111101110010111110000111011101000111010111000101111001101001010001110101111001000111010111110100011101011111010001110101111001110010011000111101111001101111110001111110001111101100111110001111011101000111010111000101111001101001010001110101111001000111010111110100011101011111010001110101111001110010011000111101111001101111001011110 bee5f0ee8eb8bcd28ebc8ebe8ebe8ebce4c7bcdf8fc7d9f1ee8eb8bcd28ebc8ebe8ebe8ebce4c7bcde5e
UTF-8 上韲ク社シセセシ簀爵淏骼ク社シセセシ簀灼^ 11100100101110001000101011101001100111111011001011101111101111011011100011100111101001001011111011101111101111011011110011101111101111011011111011101111101111011011111011101111101111011011110011100111101100001000000011100111100010001011010111100110101101111000111111101001101010101011110011101111101111011011100011100111101001001011111011101111101111011011110011101111101111011011111011101111101111011011111011101111101111011011110011100111101100001000000011100111100000011011110001011110 e4b88ae99fb2efbdb8e7a4beefbdbcefbdbeefbdbeefbdbce7b080e788b5e6b78fe9aabcefbdb8e7a4beefbdbcefbdbeefbdbeefbdbce7b080e781bc5e
UHC 上??社?????爵淏??社?????灼^ 110111111011111000111111001111111101111011100100001111110011111100111111001111110011111111101101110010011111101111001000001111110011111111011110111001000011111100111111001111110011111100111111111011011100011101011110 dfbe3f3fdee43f3f3f3f3fedc9fbc83f3fdee43f3f3f3f3fedc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)