To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 午??踰??????蹂??烏??源??節??^ 1000110011011111001111110011111111100110111110100011111100111111001111110011111100111111001111111110011011111000001111110011111110001001010001110011111100111111100011001011100100111111001111111001000011011111001111110011111101011110 8cdf3f3fe6fa3f3f3f3f3f3fe6f83f3f89473f3f8cb93f3f90df3f3f5e
EUC-JP 午??踰??????蹂??烏??源??節??^ 1011100011100001001111110011111111101100111111000011111100111111001111110011111100111111001111111110110011111010001111110011111110110001101010000011111100111111101110001011101100111111001111111100000011100001001111110011111101011110 b8e13f3fecfc3f3f3f3f3f3fecfa3f3fb1a83f3fb8bb3f3fc0e13f3f5e
UTF-8 午닿퓥踰곮린栒맡곈씘蹂잕텫烏겹룗源썲콏節띾눒^ 11100101100011011000100011101011100010111011111111101101100100111010010111101000101110001011000011101010101100111010111011101011101001101011000011100110101000001001001011101011101001111010000111101010101100111000100011101100100101001001100011101000101110011000001011101100100111101001010111101101100001011010101111100111100000111000111111101010101100101011100111101011101000111001011111100110101110101001000011101100100011011011001011101100101111011000111111100111101011111000000011101011100111011011111011101011100010001001001001011110 e58d88eb8bbfed93a5e8b8b0eab3aeeba6b0e6a092eba7a1eab388ec9498e8b982ec9e95ed85abe7838feab2b9eba397e6ba90ec8db2ecbd8fe7af80eb9dbeeb88925e
UHC 午닿퓥踰곮린栒맡곈씘蹂잕텫烏겹룗源썲콏節띾눒^ 111001111110110110110100111010101011111110001110111010111011001010000001111010001011100010110000111000101110001110111000110000111011000011101001100111011010110111101011101100111001111111101010101101101001111111101000101000011011000011100011100011111001001111101010101110011011110111100101101100011000101111101111101111011000110111101011100001111010111001011110 e7edb4eabf8eebb281e8b8b0e2e3b8c3b0e99dadebb39feab69fe8a1b0e38f93eab9bde5b18befbd8deb87ae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)