To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???維??蟻??域??溢??揄щ?荏? 001111110011111100111111100010001101101100111111001111111000101101100001001111110011111110001000111001100011111100111111100010001110110000111111001111111001110110001001100001001000101100111111100010010110000000111111 3f3f3f88db3f3f8b613f3f88e63f3f88ec3f3f9d89848b3f89603f
EUC-JP ???維??蟻??域??溢??揄щ?荏? 001111110011111100111111101100001101110100111111001111111011010111000010001111110011111110110000111010000011111100111111101100001110111000111111001111111101100111101001101001111110101100111111101100011100000100111111 3f3f3fb0dd3f3fb5c23f3fb0e83f3fb0ee3f3fd9e9a7eb3fb1c13f
UTF-8 嶺뚢꽓維곭춱蟻뚣뀋域㏃슦溢꿰춯揄щ늉荏웘 1110111110100110101010111110101110011010101000101110101010111101100100111110011110110110101011011110101010110011101011011110110010110110101100011110100010011111101110111110101110011010101000111110101110000000100010111110010110011111100111111110001110001111100000111110110010001010101001101110011010111010101000101110101010111111101100001110110010110110101011111110011010001111100001001101000110001001111010111000101010001001111010001000110110001111111011001001101110011000 efa6abeb9aa2eabd93e7b6adeab3adecb6b1e89fbbeb9aa3eb808be59f9fe38f83ec8aa6e6baa2eabfb0ecb6afe68f84d189eb8a89e88d8fec9b98
UHC 嶺뚢꽓維곭춱蟻뚣뀋域㏃슦溢꿰춯揄щ늉荏웘 11100111101011011000110011100010100001001010001011101011101010111000000111100111101011011000110111101011111111001000110011100011100001011000011111100110101101001010011111101100100110101011000011101100111011101011001011100111101011011000110011101010111100011010110011101011101101001011111111101100111110111001111101101000 e7ad8ce284a2ebab81e7ad8debfc8ce38587e6b4a7ec9ab0eceeb2e7ad8ceaf1acebb4bfecfb9f68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)