To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?@?@f?@?@^}Y?@?@f?@?@^}bE 00111111010000000011111101000000011001100011111101000000001111110100000001011110011111010101100100111111010000000011111101000000011001100011111101000000001111110100000001011110011111010110001001000101 3f403f40663f403f405e7d593f403f40663f403f405e7d6245
SJIS-WIN 癌@癌@f癌@癌@^}Y癌@癌@f癌@癌@^}bE 100010101110000001000000100010101110000001000000011001101000101011100000010000001000101011100000010000000101111001111101010110011000101011100000010000001000101011100000010000000110011010001010111000000100000010001010111000000100000001011110011111010110001001000101 8ae0408ae040668ae0408ae0405e7d598ae0408ae040668ae0408ae0405e7d6245
EUC-JP 癌@癌@f癌@癌@^}Y癌@癌@f癌@癌@^}bE 101101001110001001000000101101001110001001000000011001101011010011100010010000001011010011100010010000000101111001111101010110011011010011100010010000001011010011100010010000000110011010110100111000100100000010110100111000100100000001011110011111010110001001000101 b4e240b4e24066b4e240b4e2405e7d59b4e240b4e24066b4e240b4e2405e7d6245
UTF-8 癌@癌@f癌@癌@^}Y癌@癌@f癌@癌@^}bE 1110011110011001100011000100000011100111100110011000110001000000011001101110011110011001100011000100000011100111100110011000110001000000010111100111110101011001111001111001100110001100010000001110011110011001100011000100000001100110111001111001100110001100010000001110011110011001100011000100000001011110011111010110001001000101 e7998c40e7998c4066e7998c40e7998c405e7d59e7998c40e7998c4066e7998c40e7998c405e7d6245
UHC 癌@癌@f癌@癌@^}Y癌@癌@f癌@癌@^}bE 111001001101111101000000111001001101111101000000011001101110010011011111010000001110010011011111010000000101111001111101010110011110010011011111010000001110010011011111010000000110011011100100110111110100000011100100110111110100000001011110011111010110001001000101 e4df40e4df4066e4df40e4df405e7d59e4df40e4df4066e4df40e4df405e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)