To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN タ螯ョ社ヤ齊アク謗シシ齊・ヒセ 11110000101110101100000011100101101001101010111010001110110100001101010011101010100011101011000110111000111001101000111010111100101111001110101010001110101001011100101110111110 f0bac0e5a6ae8ed0d4ea8eb1b8e68ebcbcea8ea5cbbe
EUC-JP ?タ螯ョ社ヤ齊アク謗シシ齊・ヒセ 00111111100011101100000011101010101010001000111010101110101111001101001010001110110101001111001111101110100011101011000110001110101110001110101111101110100011101011110010001110101111001111001111101110100011101010010110001110110010111000111010111110 3f8ec0eaa88eaebcd28ed4f3ee8eb18eb8ebee8ebc8ebcf3ee8ea58ecb8ebe
UTF-8 タ螯ョ社ヤ齊アク謗シシ齊・ヒセ 111011101000000110111001111011111011111010000000111010001001111010101111111011111011110110101110111001111010010010111110111011111011111010010100111010011011110110001010111011111011110110110001111011111011110110111000111010001010110010010111111011111011110110111100111011111011110110111100111010011011110110001010111011111011110110100101111011111011111010001011111011111011110110111110 ee81b9efbe80e89eafefbdaee7a4beefbe94e9bd8aefbdb1efbdb8e8ac97efbdbcefbdbce9bd8aefbda5efbe8befbdbe
UHC ????社?齊??謗??齊??? 0011111100111111001111110011111111011110111001000011111111110000101110100011111100111111110110111011111100111111001111111111000010111010001111110011111100111111 3f3f3f3fdee43ff0ba3f3fdbbf3f3ff0ba3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)