To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????±????????±? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000100111111001111110011111100111111001111110011111100111111001111111011000100111111 3f3f3f3f3f3f3f3f3f3f3fb13f3f3f3f3f3f3f3fb13f
SJIS-WIN ???異??醫??壤?±艤??轅??壤?±? 001111110011111100111111100010001101100100111111001111111110011111001110001111110011111110011010110111110011111110000001011111011110010001111110001111110011111111100111011101100011111100111111100110101101111100111111100000010111110100111111 3f3f3f88d93f3fe7ce3f3f9adf3f817de47e3f3fe7763f3f9adf3f817d3f
EUC-JP ???異??醫??壤?±艤??轅??壤?±? 001111110011111100111111101100001101101100111111001111111110111011010000001111110011111111010100111000010011111110100001110111101110011111011111001111110011111111101101110101110011111100111111110101001110000100111111101000011101111000111111 3f3f3fb0db3f3feed03f3fd4e13fa1dee7df3f3fedd73f3fd4e13fa1de3f
UTF-8 捻뚭엽異쇔쉽醫귙뀏壤깆±艤욜솈轅깊뮎壤깆±栒 11101111101001101010010011101011100110101010110111101100100101111011110111100111100101011011000011101100100001111001010011101100100010011011110111101001100001101010101111101010101101111001100111101011100000001000111111100101101000111010010011101010101110011000011011000010101100011110100010001001101001001110110010011010100111001110110010000110100010001110100010111101100001011110101010111001100010101110101110101110100011101110010110100011101001001110101010111001100001101100001010110001111001101010000010010010 efa6a4eb9aadec97bde795b0ec8794ec89bde986abeab799eb808fe5a3a4eab986c2b1e889a4ec9a9cec8688e8bd85eab98aebae8ee5a3a4eab986c2b1e6a092
UHC 捻뚭엽異쇔쉽醫귙뀏壤깆±艤욜솈轅깊뮎壤깆±栒 1110011011110111100011001110101010111111101100011110110010110110101111001110010110111101101100011110110010100010100000101110001110000101100010101110010110111101101100011110110010100001101111101110101111111010101111111110011110011001100011001110101010111111101100011110110110010010100110111110010110111101101100011110110010100001101111101110001011100011 e6f78ceabfb1ecb6bce5bdb1eca282e3858ae5bdb1eca1beebfabfe7998ceabfb1ed929be5bdb1eca1bee2e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)