To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN タ愠・チ・タ愠・チ・B 11000000111110101100000110100101110000011010010111000000111110101100000110100101110000011010010101000010 c0fac1a5c1a5c0fac1a5c1a542
EUC-JP タ?・チ・タ?・チ・B 10001110110000000011111110001110101001011000111011000001100011101010010110001110110000000011111110001110101001011000111011000001100011101010010101000010 8ec03f8ea58ec18ea58ec03f8ea58ec18ea542
UTF-8 タ愠・チ・タ愠・チ・B 11101111101111101000000011100110100001001010000011101111101111011010010111101111101111101000000111101111101111011010010111101111101111101000000011100110100001001010000011101111101111011010010111101111101111101000000111101111101111011010010101000010 efbe80e684a0efbda5efbe81efbda5efbe80e684a0efbda5efbe81efbda542
UHC ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)