To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?C?Cf?C?C^}Y?C?Cf?C?C^}bE 00111111010000110011111101000011011001100011111101000011001111110100001101011110011111010101100100111111010000110011111101000011011001100011111101000011001111110100001101011110011111010110001001000101 3f433f43663f433f435e7d593f433f43663f433f435e7d6245
SJIS-WIN 償C償Cf償C償C^}Y償C償Cf償C償C^}bE 100011111001111001000011100011111001111001000011011001101000111110011110010000111000111110011110010000110101111001111101010110011000111110011110010000111000111110011110010000110110011010001111100111100100001110001111100111100100001101011110011111010110001001000101 8f9e438f9e43668f9e438f9e435e7d598f9e438f9e43668f9e438f9e435e7d6245
EUC-JP 償C償Cf償C償C^}Y償C償Cf償C償C^}bE 101111011111111001000011101111011111111001000011011001101011110111111110010000111011110111111110010000110101111001111101010110011011110111111110010000111011110111111110010000110110011010111101111111100100001110111101111111100100001101011110011111010110001001000101 bdfe43bdfe4366bdfe43bdfe435e7d59bdfe43bdfe4366bdfe43bdfe435e7d6245
UTF-8 償C償Cf償C償C^}Y償C償Cf償C償C^}bE 1110010110000100100111110100001111100101100001001001111101000011011001101110010110000100100111110100001111100101100001001001111101000011010111100111110101011001111001011000010010011111010000111110010110000100100111110100001101100110111001011000010010011111010000111110010110000100100111110100001101011110011111010110001001000101 e5849f43e5849f4366e5849f43e5849f435e7d59e5849f43e5849f4366e5849f43e5849f435e7d6245
UHC 償C償Cf償C償C^}Y償C償Cf償C償C^}bE 110111111100000101000011110111111100000101000011011001101101111111000001010000111101111111000001010000110101111001111101010110011101111111000001010000111101111111000001010000110110011011011111110000010100001111011111110000010100001101011110011111010110001001000101 dfc143dfc14366dfc143dfc1435e7d59dfc143dfc14366dfc143dfc1435e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)