To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????×??????v}B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111111010111001111110011111100111111001111110011111100111111011101100111110101000010 3f3f3f3f3f3f3f3f3f3f3fd73f3f3f3f3f3f767d42
SJIS-WIN ??????轅??壤?×裕??醫??v}B 0011111100111111001111110011111100111111001111111110011101110110001111110011111110011010110111110011111110000001011111101001011101010100001111110011111111100111110011100011111100111111011101100111110101000010 3f3f3f3f3f3fe7763f3f9adf3f817e97543f3fe7ce3f3f767d42
EUC-JP ??????轅??壤?×裕??醫??v}B 0011111100111111001111110011111100111111001111111110110111010111001111110011111111010100111000010011111110100001110111111100110110110101001111110011111111101110110100000011111100111111011101100111110101000010 3f3f3f3f3f3fedd73f3fd4e13fa1dfcdb53f3feed03f3f767d42
UTF-8 捻뚭엽六쀧솈轅겹궨壤깆×裕덂쉽醫꾪뮎v}B 1110111110100110101001001110101110011010101011011110110010010111101111011110111110100111100100011110110010000000101001111110110010000110100010001110100010111101100001011110101010110010101110011110101010110110101010001110010110100011101001001110101010111001100001101100001110010111111010001010001110010101111010111000110110000010111011001000100110111101111010011000011010101011111010101011111010101010111010111010111010001110011101100111110101000010 efa6a4eb9aadec97bdefa791ec80a7ec8688e8bd85eab2b9eab6a8e5a3a4eab986c397e8a395eb8d82ec89bde986abeabeaaebae8e767d42
UHC 捻뚭엽六쀧솈轅겹궨壤깆×裕덂쉽醫꾪뮎v}B 111001101111011110001100111010101011111110110001111010111011101110010111111001111001100110001100111010101011111110110000111000111000001010111010111001011011110110110001111011001010000110111111111010111010111010001000111001011011110110110001111011001010001010000100111011011001001010011011011101100111110101000010 e6f78ceabfb1ebbb97e7998ceabfb0e382bae5bdb1eca1bfebae88e5bdb1eca284ed929b767d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)