To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??維??矣??檍?????釉?????應 11100001100111110011111100111111100010001101101100111111001111111110000111100001001111110011111110011110111110000011111100111111001111110011111100111111111001111101011000111111001111110011111100111111001111111001110011100100 e19f3f3f88db3f3fe1e13f3f9ef83f3f3f3f3fe7d63f3f3f3f3f9ce4
EUC-JP 癲??維??矣??檍??庾??釉??孼??應 1110001010100001001111110011111110110000110111010011111100111111111000101110001100111111001111111101110011111010001111110011111110001111101111001100111000111111001111111110111011011000001111110011111110001111101110101100001100111111001111111101100011100110 e2a13f3fb0dd3f3fe2e33f3fdcfa3f3f8fbcce3f3feed83f3f8fbac33f3fd8e6
UTF-8 癲녴굚維끾룚矣섑벖檍우뼚庾양븦釉먮짗孼뽮꼍應 111001111001100110110010111010111000010110110100111010101011010110011010111001111011011010101101111010111000000110111110111010111010001110011010111001111001111110100011111011001000010010010001111010111011001010010110111001101010101010001101111011001001101010110000111010111011110010011010111001011011101010111110111011001001011010010001111010111011100010100110111010011000011110001001111010111010100010101110111011001010011110010111111001011010110110111100111010111011110110101110111010101011110010001101111001101000011110001001 e799b2eb85b4eab59ae7b6adeb81beeba39ae79fa3ec8491ebb296e6aa8dec9ab0ebbc9ae5babeec9691ebb8a6e98789eba8aeeca797e5adbcebbdaeeabc8de68789
UHC 癲녴굚維끾룚矣섑벖檍우뼚庾양븦釉먮짗孼뽮꼍應 1110111110100110100001101110001110000010100000101110101110101011100001011110011010001111100101101110101111111000100110001110110110010011101101001110010111100101101111111110110010010110101000001110101011101100101111101110011110010101100011111110101110111000100100001110101110100011100111101110010111101101100101101110101010110010101111011110101111101011 efa686e38282ebab85e68f96ebf898ed93b4e5e5bfec96a0eaecbee7958febb890eba39ee5ed96eab2bdebeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)