To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????×??? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111010111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd73f3f3f
SJIS-WIN ??????醫??永??魏??轅??壤?×援℡? 0011111100111111001111110011111100111111001111111110011111001110001111110011111110001001011010010011111100111111111010011011000000111111001111111110011101110110001111110011111110011010110111110011111110000001011111101000100110000111100001111000010000111111 3f3f3f3f3f3fe7ce3f3f89693f3fe9b03f3fe7763f3f9adf3f817e898787843f
EUC-JP ??????醫??永??魏??轅??壤?×援?? 00111111001111110011111100111111001111110011111111101110110100000011111100111111101100011100101000111111001111111111001010110010001111110011111111101101110101110011111100111111110101001110000100111111101000011101111110110001111001110011111100111111 3f3f3f3f3f3feed03f3fb1ca3f3ff2b23f3fedd73f3fd4e13fa1dfb1e73f3f
UTF-8 捻뚭염栒듿쉽醫묓뭷永띔쑤魏섊솈轅깅츉壤깆×援℡쉽 1110111110100110101001001110101110011010101011011110110010010111101111001110011010100000100100101110101110010011101111111110110010001001101111011110100110000110101010111110101110101100100100111110101110101101101101111110011010110000101110001110101110011101100101001110110010010001101001001110100110101101100011111110110010000100100010101110110010000110100010001110100010111101100001011110101010111001100001011110110010111000100010011110010110100011101001001110101010111001100001101100001110010111111001101000111110110100111000101000010010100001111011001000100110111101 efa6a4eb9aadec97bce6a092eb93bfec89bde986abebac93ebadb7e6b0b8eb9d94ec91a4e9ad8fec848aec8688e8bd85eab985ecb889e5a3a4eab986c397e68fb4e284a1ec89bd
UHC 捻뚭염栒듿쉽醫묓뭷永띔쑤魏섊솈轅깅츉壤깆×援℡쉽 111001101111011110001100111010101011111110110000111000101110001110001010111001011011110110110001111011001010001010010001111011011001001010000110111001111011010110110110111010101011111010100101111010101110000010011000111001111001100110001100111010101011111110110001111010111010111010000101111001011011110110110001111011001010000110111111111010101011010110100010111001011011110110110001 e6f78ceabfb0e2e38ae5bdb1eca291ed9286e7b5b6eabea5eae098e7998ceabfb1ebae85e5bdb1eca1bfeab5a2e5bdb1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)