To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???爰??淫??嚴щ?疑??儒??椰??B 0011111100111111001111111110000010100111001111110011111110001000111110100011111100111111100110101000111010000100100010110011111110001011010111100011111100111111100011101111001000111111001111111001111010111101001111110011111101000010 3f3f3fe0a73f3f88fa3f3f9a8e848b3f8b5e3f3f8ef23f3f9ebd3f3f42
EUC-JP ???爰??淫??嚴щ?疑??儒??椰??B 0011111100111111001111111110000010101001001111110011111110110000111111000011111100111111110100111110111010100111111010110011111110110101101111110011111100111111101111001111010000111111001111111101110010111111001111110011111101000010 3f3f3fe0a93f3fb0fc3f3fd3eea7eb3fb5bf3f3fbcf43f3fdcbf3f3f42
UTF-8 捻뀀뿣爰껃뵱淫뚭괵嚴щㅏ疑귞뫀儒좏돪椰꾨푾B 111011111010011010100100111010111000000010000000111010111011111110100011111001111000100010110000111010101011101110000011111010111011010110110001111001101011011110101011111010111001101010101101111010101011010010110101111001011001101010110100110100011000100111100011100001011000111111100111100101101001000111101010101101111001111011101011101010111000000011100101100001001001001011101100101000101000111111101011100011111010101011100110101001001011000011101010101111101010100011101101100100011011111001000010 efa6a4eb8080ebbfa3e788b0eabb83ebb5b1e6b7abeb9aadeab4b5e59ab4d189e3858fe79691eab79eebab80e58492eca28feb8faae6a4b0eabea8ed91be42
UHC 捻뀀뿣爰껃뵱淫뚭괵嚴щㅏ疑귞뫀儒좏돪椰꾨푾B 11100110111101111011001011101011100101111010001111101010101110101000001111100101100101001010111111101011111000101000110011101010101100011010110011100101111100011010110011101011101001001011111111101011111101111000001011100111100100011010010011101010111000111010000011101101100010011010110111100101101010111000010011101011101111101000100101000010 e6f7b2eb97a3eaba83e594afebe28ceab1ace5f1aceba4bfebf782e791a4eae3a0ed89ade5ab84ebbe8942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)