To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????±??? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110110001001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fb13f3f3f
SJIS-WIN ???肄??醫??永?????轅??壤?±??? 001111110011111100111111111000111110010100111111001111111110011111001110001111110011111110001001011010010011111100111111001111110011111100111111111001110111011000111111001111111001101011011111001111111000000101111101001111110011111100111111 3f3f3fe3e53f3fe7ce3f3f89693f3f3f3f3fe7763f3f9adf3f817d3f3f3f
EUC-JP ???肄??醫??永??瑗??轅??壤?±??? 0011111100111111001111111110011011100111001111110011111111101110110100000011111100111111101100011100101000111111001111111000111111001100110000000011111100111111111011011101011100111111001111111101010011100001001111111010000111011110001111110011111100111111 3f3f3fe6e73f3feed03f3fb1ca3f3f8fccc03f3fedd73f3fd4e13fa1de3f3f3f
UTF-8 捻뚭여肄뽩쉽醫묒뒓永띔쐽瑗뉒솈轅깅츉壤깆±痢믣쉽 1110111110100110101001001110101110011010101011011110110010010111101011001110100010000010100001001110101110111101101010011110110010001001101111011110100110000110101010111110101110101100100100101110101110010010100100111110011010110000101110001110101110011101100101001110110010010000101111011110011110010001100101111110101110001001100100101110110010000110100010001110100010111101100001011110101010111001100001011110110010111000100010011110010110100011101001001110101010111001100001101100001010110001111011111010011110100101111010111010111110100011111011001000100110111101 efa6a4eb9aadec97ace88284ebbda9ec89bde986abebac92eb9293e6b0b8eb9d94ec90bde79197eb8992ec8688e8bd85eab985ecb889e5a3a4eab986c2b1efa7a5ebafa3ec89bd
UHC 捻뚭여肄뽩쉽醫묒뒓永띔쐽瑗뉒솈轅깅츉壤깆±痢믣쉽 111001101111011110001100111010101011111110101001111011001011110110010110111001011011110110110001111011001010001010010001111011001000101010010000111001111011010110110110111010101011111010100011111010101011110010000111111001111001100110001100111010101011111110110001111010111010111010000101111001011011110110110001111011001010000110111110111011001011100010010010111001011011110110110001 e6f78ceabfa9ecbd96e5bdb1eca291ec8a90e7b5b6eabea3eabc87e7998ceabfb1ebae85e5bdb1eca1beecb892e5bdb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)