To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 螳肴ケソ髴、蜍コ螳毆螳肴ケソ髴、蜍コ螳毆^ 111001011010111010001101111001101011100110111111111010011001110010100100111001011000101110111010111001011010111010011111011101111110010110101110100011011110011010111001101111111110100110011100101001001110010110001011101110101110010110101110100111110111011101011110 e5ae8de6b9bfe99ca4e58bbae5ae9f77e5ae8de6b9bfe99ca4e58bbae5ae9f775e
EUC-JP 螳肴ケソ髴、蜍コ螳毆螳肴ケソ髴、蜍コ螳毆^ 1110101010110000101110101110100010001110101110011000111010111111111100011111110010001110101001001110100111101011100011101011101011101010101100001101110111011000111010101011000010111010111010001000111010111001100011101011111111110001111111001000111010100100111010011110101110001110101110101110101010110000110111011101100001011110 eab0bae88eb98ebff1fc8ea4e9eb8ebaeab0ddd8eab0bae88eb98ebff1fc8ea4e9eb8ebaeab0ddd85e
UTF-8 螳肴ケソ髴、蜍コ螳毆螳肴ケソ髴、蜍コ螳毆^ 11101000100111101011001111101000100000101011010011101111101111011011100111101111101111011011111111101001101010111011010011101111101111011010010011101000100111001000110111101111101111011011101011101000100111101011001111100110101011111000011011101000100111101011001111101000100000101011010011101111101111011011100111101111101111011011111111101001101010111011010011101111101111011010010011101000100111001000110111101111101111011011101011101000100111101011001111100110101011111000011001011110 e89eb3e882b4efbdb9efbdbfe9abb4efbda4e89c8defbdbae89eb3e6af86e89eb3e882b4efbdb9efbdbfe9abb4efbda4e89c8defbdbae89eb3e6af865e
UHC 螳肴??????螳毆螳肴??????螳毆^ 1101001111011001111111011010001000111111001111110011111100111111001111110011111111010011110110011100111110110010110100111101100111111101101000100011111100111111001111110011111100111111001111111101001111011001110011111011001001011110 d3d9fda23f3f3f3f3f3fd3d9cfb2d3d9fda23f3f3f3f3f3fd3d9cfb25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)