To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 治爨捨セュ蛇蕉ヤ辞篠セュ璽セャ質セャ鴆辞蒔セャ紗 10001110101000011110000010100100100011101100110010111110101011011000111011010110100011111101010011010100100011101010101110001110110000101011111010101101100011101010001110111110101011001000111010111111101111101010110011101001111011111000111010101011100011101010101010111110101011001000111011010001 8ea1e0a48eccbead8ed68fd4d48eab8ec2bead8ea3beac8ebfbeace9ef8eab8eaabeac8ed1
EUC-JP 治爨捨セュ蛇蕉ヤ辞篠セュ璽セャ質セャ鴆辞蒔セャ紗 101111001010001111100000101001101011110011001110100011101011111010001110101011011011110011011000101111101101011010001110110101001011110010101101101111001100010010001110101111101000111010101101101111001010010110001110101111101000111010101100101111001100000110001110101111101000111010101100111100101111000110111100101011011011110010101100100011101011111010001110101011001011110011010011 bca3e0a6bcce8ebe8eadbcd8bed68ed4bcadbcc48ebe8eadbca58ebe8eacbcc18ebe8eacf2f1bcadbcac8ebe8eacbcd3
UTF-8 治爨捨セュ蛇蕉ヤ辞篠セュ璽セャ質セャ鴆辞蒔セャ紗 111001101011001010111011111001111000100010101000111001101000110110101000111011111011110110111110111011111011110110101101111010001001101110000111111010001001010110001001111011111011111010010100111010001011111010011110111001111010111110100000111011111011110110111110111011111011110110101101111001111001001010111101111011111011110110111110111011111011110110101100111010001011001110101010111011111011110110111110111011111011110110101100111010011011010010000110111010001011111010011110111010001001001010010100111011111011110110111110111011111011110110101100111001111011010010010111 e6b2bbe788a8e68da8efbdbeefbdade89b87e89589efbe94e8be9ee7afa0efbdbeefbdade792bdefbdbeefbdace8b3aaefbdbeefbdace9b486e8be9ee89294efbdbeefbdace7b497
UHC 治?捨??蛇蕉??篠??璽??質????蒔??紗 111101101011110100111111110111101101011100111111001111111101111011101111111101011010111100111111001111111110000111000110001111110011111111011111110111100011111100111111111100101111010100111111001111110011111100111111111000111100100000111111001111111101111011101001 f6bd3fded73f3fdeeff5af3f3fe1c63f3fdfde3f3ff2f53f3f3f3fe3c83f3fdee9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)