To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???爰??淫?????已??兢宥??臾??^ 0011111100111111001111111110000010100111001111110011111110001000111110100011111100111111001111110011111100111111100110111101111100111111001111111001100101011101100101110100011100111111001111111110010001101011001111110011111101011110 3f3f3fe0a73f3f88fa3f3f3f3f3f9bdf3f3f995d97473f3fe46b3f3f5e
EUC-JP ???爰??淫?????已??兢宥??臾??^ 0011111100111111001111111110000010101001001111110011111110110000111111000011111100111111001111110011111100111111110101101110000100111111001111111101000110111110110011011010100000111111001111111110011111001100001111110011111101011110 3f3f3fe0a93f3fb0fc3f3f3f3f3fd6e13f3fd1becda83f3fe7cc3f3f5e
UTF-8 捻뀀뿣爰껃뵱淫뗫빝列띕뜉已녔걗兢宥붶틪臾먯쪕^ 11101111101001101010010011101011100000001000000011101011101111111010001111100111100010001011000011101010101110111000001111101011101101011011000111100110101101111010101111101011100101111010101111101011101110011001110111101111101001101001110011101011100111011001010111101011100111001000100111100101101101111011001011101011100001011001010011101010101100011001011111100101100001011010001011100101101011101010010111101011101101101011011011101101100010111010101011101000100001111011111011101011101010001010111111101100101010101001010101011110 efa6a4eb8080ebbfa3e788b0eabb83ebb5b1e6b7abeb97abebb99defa69ceb9d95eb9c89e5b7b2eb8594eab197e585a2e5aea5ebb6b6ed8baae887beeba8afecaa955e
UHC 捻뀀뿣爰껃뵱淫뗫빝列띕뜉已녔걗兢宥붶틪臾먯쪕^ 111001101111011110110010111010111001011110100011111010101011101010000011111001011001010010101111111010111110001010001011111010111001010110111011111001101110101010110110111010111000110110001100111011001010101110110011111001101000000110000010110100001110011111101010111010011001010011100100101110101001010011101011101011001001000011101100101001011000111101011110 e6f7b2eb97a3eaba83e594afebe28beb95bbe6eab6eb8d8cecabb3e68182d0e7eae994e4ba94ebac90eca58f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)