To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????g????}????g????{^ 001111110011111100111111001111110110011100111111001111110011111100111111011111010011111100111111001111110011111101100111001111110011111100111111001111110111101101011110 3f3f3f3f673f3f3f3f7d3f3f3f3f673f3f3f3f7b5e
SJIS-WIN 上ェショg硝緕キ煮}上ェショg硝緕キ煮{^ 1000111111100011101010101011110010101110011001111000111111001001111000111000111010110111100011101100111101111101100011111110001110101010101111001010111001100111100011111100100111100011100011101011011110001110110011110111101101011110 8fe3aabcae678fc9e38eb78ecf7d8fe3aabcae678fc9e38eb78ecf7b5e
EUC-JP 上ェショg硝緕キ煮}上ェショg硝緕キ煮{^ 10111110111001011000111010101010100011101011110010001110101011100110011110111110110010111110010111101110100011101011011110111100110100010111110110111110111001011000111010101010100011101011110010001110101011100110011110111110110010111110010111101110100011101011011110111100110100010111101101011110 bee58eaa8ebc8eae67becbe5ee8eb7bcd17dbee58eaa8ebc8eae67becbe5ee8eb7bcd17b5e
UTF-8 上ェショg硝緕キ煮}上ェショg硝緕キ煮{^ 1110010010111000100010101110111110111101101010101110111110111101101111001110111110111101101011100110011111100111101000011001110111100111101101111001010111101111101111011011011111100111100001011010111001111101111001001011100010001010111011111011110110101010111011111011110110111100111011111011110110101110011001111110011110100001100111011110011110110111100101011110111110111101101101111110011110000101101011100111101101011110 e4b88aefbdaaefbdbcefbdae67e7a19de7b795efbdb7e785ae7de4b88aefbdaaefbdbcefbdae67e7a19de7b795efbdb7e785ae7b5e
UHC 上???g硝??煮}上???g硝??煮{^ 110111111011111000111111001111110011111101100111111101011010011000111111001111111110110110110100011111011101111110111110001111110011111100111111011001111111010110100110001111110011111111101101101101000111101101011110 dfbe3f3f3f67f5a63f3fedb47ddfbe3f3f3f67f5a63f3fedb47b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)