To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????×? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101011100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd73f
SJIS-WIN ???鎰??轅??壤??泣??魏??壤?×? 0011111100111111001111111110100001001100001111110011111111100111011101100011111100111111100110101101111100111111001111111000101110000011001111110011111111101001101100000011111100111111100110101101111100111111100000010111111000111111 3f3f3fe84c3f3fe7763f3f9adf3f3f8b833f3fe9b03f3f9adf3f817e3f
EUC-JP ???鎰??轅??壤??泣??魏??壤?×瑗 00111111001111110011111111101111101011010011111100111111111011011101011100111111001111111101010011100001001111110011111110110101111000110011111100111111111100101011001000111111001111111101010011100001001111111010000111011111100011111100110011000000 3f3f3fefad3f3fedd73f3fd4e13f3fb5e33f3ff2b23f3fd4e13fa1df8fccc0
UTF-8 捻뚭엽鎰쏁솈轅댁럞壤깆쥜泣ㅷ독魏곹뮎壤깆×瑗 1110111110100110101001001110101110011010101011011110110010010111101111011110100110001110101100001110110010001111100000011110110010000110100010001110100010111101100001011110101110001100100000011110101110011111100111101110010110100011101001001110101010111001100001101110110010100101100111001110011010110011101000111110001110000101101101111110101110001111100001011110100110101101100011111110101010110011101110011110101110101110100011101110010110100011101001001110101010111001100001101100001110010111111001111001000110010111 efa6a4eb9aadec97bde98eb0ec8f81ec8688e8bd85eb8c81eb9f9ee5a3a4eab986eca59ce6b3a3e385b7eb8f85e9ad8feab3b9ebae8ee5a3a4eab986c397e79197
UHC 捻뚭엽鎰쏁솈轅댁럞壤깆쥜泣ㅷ독魏곹뮎壤깆×瑗 1110011011110111100011001110101010111111101100011110110011110000100110111110011110011001100011001110101010111111101101001110110010001110100000011110010110111101101100011110110010100010100100011110101111101000101001001110011110110101101101101110101011100000100000011110110110010010100110111110010110111101101100011110110010100001101111111110101010111100 e6f78ceabfb1ecf09be7998ceabfb4ec8e81e5bdb1eca291ebe8a4e7b5b6eae081ed929be5bdb1eca1bfeabc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)