To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 貉ソ闃晄ケソ逞疲ケソ闃晄ケソ螳滓ケソ闃 1110011010111001101111111110100010001010100111011110011010111001101111111110011110010111100101001110011010111001101111111110100010001010100111011110011010111001101111111110010110101110100111111110011010111001101111111110100010001010 e6b9bfe88a9de6b9bfe79794e6b9bfe88a9de6b9bfe5ae9fe6b9bfe88a
EUC-JP 貉ソ闃晄ケソ逞疲ケソ闃晄ケソ螳滓ケソ闃 1110110010111011100011101011111111101111111010101101101011101000100011101011100110001110101111111110110111110111110010001110100010001110101110011000111010111111111011111110101011011010111010001000111010111001100011101011111111101010101100001101111011101000100011101011100110001110101111111110111111101010 ecbb8ebfefeadae88eb98ebfedf7c8e88eb98ebfefeadae88eb98ebfeab0dee88eb98ebfefea
UTF-8 貉ソ闃晄ケソ逞疲ケソ闃晄ケソ螳滓ケソ闃 111010001011001010001001111011111011110110111111111010011001011110000011111001101001100110000100111011111011110110111001111011111011110110111111111010011000000010011110111001111001011010110010111011111011110110111001111011111011110110111111111010011001011110000011111001101001100110000100111011111011110110111001111011111011110110111111111010001001111010110011111001101011101110010011111011111011110110111001111011111011110110111111111010011001011110000011 e8b289efbdbfe99783e69984efbdb9efbdbfe9809ee796b2efbdb9efbdbfe99783e69984efbdb9efbdbfe89eb3e6bb93efbdb9efbdbfe99783
UHC ???晄??逞疲???晄??螳滓??? 00111111001111110011111111111100110011010011111100111111110101101100000111111001101010100011111100111111001111111111110011001101001111110011111111010011110110011110111010101011001111110011111100111111 3f3f3ffccd3f3fd6c1f9aa3f3f3ffccd3f3fd3d9eeab3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)