To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 上ュ昌キシキ上チシシ上ュ昌キシキ晶ヌ質 1000111111100011111100101011110010101101100011111011100110110111101111001011011110001111111000111100000110111100101111001000111111100011111100101011110010101101100011111011100110110111101111001011011110001111101110111111010010001110110001111000111010111111 8fe3f2bcad8fb9b7bcb78fe3c1bcbc8fe3f2bcad8fb9b7bcb78fbbf48ec78ebf
EUC-JP 上?ュ昌キシキ上チシシ上?ュ昌キシキ晶?ヌ質 1011111011100101001111111000111010101101101111101011101110001110101101111000111010111100100011101011011110111110111001011000111011000001100011101011110010001110101111001011111011100101001111111000111010101101101111101011101110001110101101111000111010111100100011101011011110111110101111010011111110001110110001111011110011000001 bee53f8eadbebb8eb78ebc8eb7bee58ec18ebc8ebcbee53f8eadbebb8eb78ebc8eb7bebd3f8ec7bcc1
UTF-8 上ュ昌キシキ上チシシ上ュ昌キシキ晶ヌ質 111001001011100010001010111011101000011110110011111011111011110110101101111001101001100010001100111011111011110110110111111011111011110110111100111011111011110110110111111001001011100010001010111011111011111010000001111011111011110110111100111011111011110110111100111001001011100010001010111011101000011110110011111011111011110110101101111001101001100010001100111011111011110110110111111011111011110110111100111011111011110110110111111001101001100110110110111011101000110010111101111011111011111010000111111010001011001110101010 e4b88aee87b3efbdade6988cefbdb7efbdbcefbdb7e4b88aefbe81efbdbcefbdbce4b88aee87b3efbdade6988cefbdb7efbdbcefbdb7e699b6ee8cbdefbe87e8b3aa
UHC 上??昌???上???上??昌???晶??質 1101111110111110001111110011111111110011111000110011111100111111001111111101111110111110001111110011111100111111110111111011111000111111001111111111001111100011001111110011111100111111111011111101110000111111001111111111001011110101 dfbe3f3ff3e33f3f3fdfbe3f3f3fdfbe3f3ff3e33f3f3fefdc3f3ff2f5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)