To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡???制耿??鬱??狡?訂? 111000110111000100111111001111110011111110010000101001111110001111010100001111110011111110011111010101000011111100111111111000001100001000111111100100101111100100111111 e3713f3f3f90a7e3d43f3f9f543f3fe0c23f92f93f
EUC-JP 縡???制耿??鬱??狡?訂? 111001011101001000111111001111110011111111000000101010011110011011010110001111110011111111011101101101010011111100111111111000001100010000111111110001001111101100111111 e5d23f3f3fc0a9e6d63f3fddb53f3fe0c43fc4fb3f
UTF-8 縡렕亐렕制耿렱렟鬱讀렲狡㉢訂렦 111001111011100010100001111010111010000010010101111001001011101010010000111010111010000010010101111001011000100010110110111010001000000010111111111010111010000010110001111010111010000010011111111010011010110010110001111011111010010110011010111010111010000010110010111001111000101110100001111000111000100110100010111010001010100010000010111010111010000010100110 e7b8a1eba095e4ba90eba095e588b6e880bfeba0b1eba09fe9acb1efa59aeba0b2e78ba1e389a2e8a882eba0a6
UHC 縡렕亐렕制耿렱렟鬱讀렲狡㉢訂렦 111011101010110110001110101010101110101010100111100011101010101011110000101001001100110011101010100011101011111010001110101100001110101010100110110101001110011010001110101111111100111011101010101010001011001111101111111101001000111010110101 eead8eaaeaa78eaaf0a4ccea8ebe8eb0eaa6d4e68ebfceeaa8b3eff48eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)