To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??蹂〓?語??苡????????? 001111110011111100111111100010111000001100111111001111111110011011111000100000011010110000111111100011001110101000111111001111111110010010001111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f8b833f3fe6f881ac3f8cea3f3fe48f3f3f3f3f3f3f3f3f3f
EUC-JP ???泣??蹂〓?語??苡????????? 001111110011111100111111101101011110001100111111001111111110110011111010101000101010111000111111101110001110110000111111001111111110011111101111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3fb5e33f3fecfaa2ae3fb8ec3f3fe7ef3f3f3f3f3f3f3f3f3f
UTF-8 捻꿔꺂泣볢땻蹂〓븶語ⓥ뫗苡며춯琉우뒗列룸뿥履 111011111010011010100100111010101011111110010100111010101011101010000010111001101011001110100011111010111011001110100010111010111001010110111011111010001011100110000010111000111000000010010011111010111011100010110110111010001010101010011110111000101001001110100101111010111010101110010111111010001000101110100001111010111010100110110000111011001011011010101111111011111010011110001100111011001001101010110000111010111001001010010111111011111010011010011100111010111010001110111000111010111011111110100101111011111010011110011111 efa6a4eabf94eaba82e6b3a3ebb3a2eb95bbe8b982e38093ebb8b6e8aa9ee293a5ebab97e88ba1eba9b0ecb6afefa78cec9ab0eb9297efa69ceba3b8ebbfa5efa79f
UHC 捻꿔꺂泣볢땻蹂〓븶語ⓥ뫗苡며춯琉우뒗列룸뿥履 1110011011110111101100101110001110000011101010111110101111101000100100111110100010001011100100011110101110110011101000011110101110010101100111111110010111011110101010001110001010010001101110011110110010111110101110001110011110101101100011001110101110100100101111111110110010001010100101001110011011101010101101111110101110010111101001011110110010101010 e6f7b2e383abebe893e88b91ebb3a1eb959fe5dea8e291b9ecbeb8e7ad8ceba4bfec8a94e6eab7eb97a5ecaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)