To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??揄?????維??淫??域??魏 1110000110011111001111110011111110001011010110000011111100111111100111011000100100111111001111110011111100111111001111111000100011011011001111110011111110001000111110100011111100111111100010001110011000111111001111111110100110110000 e19f3f3f8b583f3f9d893f3f3f3f3f88db3f3f88fa3f3f88e63f3fe9b0
EUC-JP 癲??宜??揄?????維??淫??域??魏 1110001010100001001111110011111110110101101110010011111100111111110110011110100100111111001111110011111100111111001111111011000011011101001111110011111110110000111111000011111100111111101100001110100000111111001111111111001010110010 e2a13f3fb5b93f3fd9e93f3f3f3f3fb0dd3f3fb0fc3f3fb0e83f3ff2b2
UTF-8 癲덈챶宜룬씘揄우물醴븐뼦維쏉쬊淫볝렊域뱀쉸魏 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111010001110101100111011001001010010011000111001101000111110000100111011001001101010110000111010111010110010111100111011111010011010110111111010111011100010010000111010111011110010100110111001111011011010101101111011001000111110001001111011001010110010001010111001101011011110101011111010111011001110011101111010111010000010001010111001011001111110011111111010111011000110000000111011001000100110111000111010011010110110001111 e799b2eb8d88ecb1b6e5ae9ceba3acec9498e68f84ec9ab0ebacbcefa6b7ebb890ebbca6e7b6adec8f89ecac8ae6b7abebb39deba08ae59f9febb180ec89b8e9ad8f
UHC 癲덈챶宜룬씘揄우물醴븐뼦維쏉쬊淫볝렊域뱀쉸魏 1110111110100110100010001110101110101010100000111110101111110001101101111110100110011101101011011110101011110001101111111110110010111001101100001110011111100100101110101110110010010110101010011110101110101011100110111110111110100110101000001110101111100010100100111110001110001110101000011110011010110100101110011110110010011010100011101110101011100000 efa688ebaa83ebf1b7e99dadeaf1bfecb9b0e7e4baec96a9ebab9befa6a0ebe293e38ea1e6b4b9ec9a8eeae0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)