To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シス痔シス蒔 11110000101100111011110011110001100011101011110110001110101001001111000011111011101111001111000110001110101111011000111010101010 f0b3bcf18ebd8ea4f0fbbcf18ebd8eaa
EUC-JP ?シ?ス痔?シ?ス蒔 00111111100011101011110000111111100011101011110110111100101001100011111110001110101111000011111110001110101111011011110010101100 3f8ebc3f8ebdbca63f8ebc3f8ebdbcac
UTF-8 シス痔シス蒔 111011101000000110110010111011111011110110111100111011101000010010001001111011111011110110111101111001111001011110010100111011101000001010111010111011111011110110111100111011101000010010001001111011111011110110111101111010001001001010010100 ee81b2efbdbcee8489efbdbde79794ee82baefbdbcee8489efbdbde89294
UHC ????痔????蒔 001111110011111100111111001111111111011011000000001111110011111100111111001111111110001111001000 3f3f3f3ff6c03f3f3f3fe3c8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)