To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 上ォシアシ・昭ロシエ上ォシアシ・昭ロシエB 1000111111100011101010111011110010110001111100101111011110111100101001011000111110111010110110111011110010110100100011111110001110101011101111001011000111110010111101111011110010100101100011111011101011011011101111001011010001000010 8fe3abbcb1f2f7bca58fbadbbcb48fe3abbcb1f2f7bca58fbadbbcb442
EUC-JP 上ォシア?シ・昭ロシエ上ォシア?シ・昭ロシエB 10111110111001011000111010101011100011101011110010001110101100010011111110001110101111001000111010100101101111101011110010001110110110111000111010111100100011101011010010111110111001011000111010101011100011101011110010001110101100010011111110001110101111001000111010100101101111101011110010001110110110111000111010111100100011101011010001000010 bee58eab8ebc8eb13f8ebc8ea5bebc8edb8ebc8eb4bee58eab8ebc8eb13f8ebc8ea5bebc8edb8ebc8eb442
UTF-8 上ォシアシ・昭ロシエ上ォシアシ・昭ロシエB 11100100101110001000101011101111101111011010101111101111101111011011110011101111101111011011000111101110100010001010111011101111101111011011110011101111101111011010010111100110100110001010110111101111101111101001101111101111101111011011110011101111101111011011010011100100101110001000101011101111101111011010101111101111101111011011110011101111101111011011000111101110100010001010111011101111101111011011110011101111101111011010010111100110100110001010110111101111101111101001101111101111101111011011110011101111101111011011010001000010 e4b88aefbdabefbdbcefbdb1ee88aeefbdbcefbda5e698adefbe9befbdbcefbdb4e4b88aefbdabefbdbcefbdb1ee88aeefbdbcefbda5e698adefbe9befbdbcefbdb442
UHC 上??????昭???上??????昭???B 110111111011111000111111001111110011111100111111001111110011111111100001101110010011111100111111001111111101111110111110001111110011111100111111001111110011111100111111111000011011100100111111001111110011111101000010 dfbe3f3f3f3f3f3fe1b93f3f3fdfbe3f3f3f3f3f3fe1b93f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)