To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???艤??議?????唯??轅?????? 0011111100111111001111111110010001111110001111110011111110001011011000110011111100111111001111110011111100111111100101110100001000111111001111111110011101110110001111110011111100111111001111110011111100111111 3f3f3fe47e3f3f8b633f3f3f3f3f97423f3fe7763f3f3f3f3f3f
EUC-JP ???艤??議?????唯??轅?????瑗 00111111001111110011111111100111110111110011111100111111101101011100010000111111001111110011111100111111001111111100110110100011001111110011111111101101110101110011111100111111001111110011111100111111100011111100110011000000 3f3f3fe7df3f3fb5c43f3f3f3f3fcda33f3fedd73f3f3f3f3f8fccc0
UTF-8 捻뚭엥艤욕쉽議우퐭捻뚭였唯덄솈轅깅닅捻뚭여瑗 111011111010011010100100111010111001101010101101111011001001011110100101111010001000100110100100111011001001101010010101111011001000100110111101111010001010110110110000111011001001101010110000111011011001000010101101111011111010011010100100111010111001101010101101111011001001100010000000111001011001010010101111111010111000110110000100111011001000011010001000111010001011110110000101111010101011100110000101111010111000101110000101111011111010011010100100111010111001101010101101111011001001011110101100111001111001000110010111 efa6a4eb9aadec97a5e889a4ec9a95ec89bde8adb0ec9ab0ed90adefa6a4eb9aadec9880e594afeb8d84ec8688e8bd85eab985eb8b85efa6a4eb9aadec97ace79197
UHC 捻뚭엥艤욕쉽議우퐭捻뚭였唯덄솈轅깅닅捻뚭여瑗 1110011011110111100011001110101010111111101010001110101111111010101111111110010110111101101100011110110010100001101111111110110010111101100101101110011011110111100011001110101010111111101101001110101011100110100010001110011110011001100011001110101010111111101100011110101110001000100011101110011011110111100011001110101010111111101010011110101010111100 e6f78ceabfa8ebfabfe5bdb1eca1bfecbd96e6f78ceabfb4eae688e7998ceabfb1eb888ee6f78ceabfa9eabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)