To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?C?Cf?C?C^}Y?C?Cf?C?C^}bE 00111111010000110011111101000011011001100011111101000011001111110100001101011110011111010101100100111111010000110011111101000011011001100011111101000011001111110100001101011110011111010110001001000101 3f433f43663f433f435e7d593f433f43663f433f435e7d6245
SJIS-WIN 癌C癌Cf癌C癌C^}Y癌C癌Cf癌C癌C^}bE 100010101110000001000011100010101110000001000011011001101000101011100000010000111000101011100000010000110101111001111101010110011000101011100000010000111000101011100000010000110110011010001010111000000100001110001010111000000100001101011110011111010110001001000101 8ae0438ae043668ae0438ae0435e7d598ae0438ae043668ae0438ae0435e7d6245
EUC-JP 癌C癌Cf癌C癌C^}Y癌C癌Cf癌C癌C^}bE 101101001110001001000011101101001110001001000011011001101011010011100010010000111011010011100010010000110101111001111101010110011011010011100010010000111011010011100010010000110110011010110100111000100100001110110100111000100100001101011110011111010110001001000101 b4e243b4e24366b4e243b4e2435e7d59b4e243b4e24366b4e243b4e2435e7d6245
UTF-8 癌C癌Cf癌C癌C^}Y癌C癌Cf癌C癌C^}bE 1110011110011001100011000100001111100111100110011000110001000011011001101110011110011001100011000100001111100111100110011000110001000011010111100111110101011001111001111001100110001100010000111110011110011001100011000100001101100110111001111001100110001100010000111110011110011001100011000100001101011110011111010110001001000101 e7998c43e7998c4366e7998c43e7998c435e7d59e7998c43e7998c4366e7998c43e7998c435e7d6245
UHC 癌C癌Cf癌C癌C^}Y癌C癌Cf癌C癌C^}bE 111001001101111101000011111001001101111101000011011001101110010011011111010000111110010011011111010000110101111001111101010110011110010011011111010000111110010011011111010000110110011011100100110111110100001111100100110111110100001101011110011111010110001001000101 e4df43e4df4366e4df43e4df435e7d59e4df43e4df4366e4df43e4df435e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)