To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?C?Cf?C?C^}Y?C?Cf?C?C^}bE 00111111010000110011111101000011011001100011111101000011001111110100001101011110011111010101100100111111010000110011111101000011011001100011111101000011001111110100001101011110011111010110001001000101 3f433f43663f433f435e7d593f433f43663f433f435e7d6245
SJIS-WIN 茱C茱Cf茱C茱C^}Y茱C茱Cf茱C茱C^}bE 111001001010001101000011111001001010001101000011011001101110010010100011010000111110010010100011010000110101111001111101010110011110010010100011010000111110010010100011010000110110011011100100101000110100001111100100101000110100001101011110011111010110001001000101 e4a343e4a34366e4a343e4a3435e7d59e4a343e4a34366e4a343e4a3435e7d6245
EUC-JP 茱C茱Cf茱C茱C^}Y茱C茱Cf茱C茱C^}bE 111010001010010101000011111010001010010101000011011001101110100010100101010000111110100010100101010000110101111001111101010110011110100010100101010000111110100010100101010000110110011011101000101001010100001111101000101001010100001101011110011111010110001001000101 e8a543e8a54366e8a543e8a5435e7d59e8a543e8a54366e8a543e8a5435e7d6245
UTF-8 茱C茱Cf茱C茱C^}Y茱C茱Cf茱C茱C^}bE 1110100010001100101100010100001111101000100011001011000101000011011001101110100010001100101100010100001111101000100011001011000101000011010111100111110101011001111010001000110010110001010000111110100010001100101100010100001101100110111010001000110010110001010000111110100010001100101100010100001101011110011111010110001001000101 e88cb143e88cb14366e88cb143e88cb1435e7d59e88cb143e88cb14366e88cb143e88cb1435e7d6245
UHC 茱C茱Cf茱C茱C^}Y茱C茱Cf茱C茱C^}bE 111000101011110001000011111000101011110001000011011001101110001010111100010000111110001010111100010000110101111001111101010110011110001010111100010000111110001010111100010000110110011011100010101111000100001111100010101111000100001101011110011111010110001001000101 e2bc43e2bc4366e2bc43e2bc435e7d59e2bc43e2bc4366e2bc43e2bc435e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)