To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貉ソ雉ェ貉ソ譚捺ケソ隰晄ケソ螳滓ケソ逎 1110011010111001101111111110100010110011101010101110011010111001101111111110011010011101100100111110011010111001101111111110100010101100100111011110011010111001101111111110010110101110100111111110011010111001101111111110011110100011 e6b9bfe8b3aae6b9bfe69d93e6b9bfe8ac9de6b9bfe5ae9fe6b9bfe7a3
EUC-JP 貉ソ雉ェ貉ソ譚捺ケソ隰晄ケソ螳滓ケソ逎 1110110010111011100011101011111111110000101101011000111010101010111011001011101110001110101111111110101111111101110001101110100010001110101110011000111010111111111100001010111011011010111010001000111010111001100011101011111111101010101100001101111011101000100011101011100110001110101111111110111010100101 ecbb8ebff0b58eaaecbb8ebfebfdc6e88eb98ebff0aedae88eb98ebfeab0dee88eb98ebfeea5
UTF-8 貉ソ雉ェ貉ソ譚捺ケソ隰晄ケソ螳滓ケソ逎 111010001011001010001001111011111011110110111111111010011001101110001001111011111011110110101010111010001011001010001001111011111011110110111111111010001010110110011010111001101000110110111010111011111011110110111001111011111011110110111111111010011001101010110000111001101001100110000100111011111011110110111001111011111011110110111111111010001001111010110011111001101011101110010011111011111011110110111001111011111011110110111111111010011000000010001110 e8b289efbdbfe99b89efbdaae8b289efbdbfe8ad9ae68dbaefbdb9efbdbfe99ab0e69984efbdb9efbdbfe89eb3e6bb93efbdb9efbdbfe9808e
UHC ??雉???譚捺???晄??螳滓??? 00111111001111111111011011001011001111110011111100111111110100111100100111010001111101000011111100111111001111111111110011001101001111110011111111010011110110011110111010101011001111110011111100111111 3f3ff6cb3f3f3fd3c9d1f43f3f3ffccd3f3fd3d9eeab3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)