To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 灼璽治上シ縞酌而韆灼璽治上シ縞酌而靼^ 1000111011011100100011101010001110001110101000011000111111100011101111001000111011001000100011101101111010001110101001111110100011100110100011101101110010001110101000111000111010100001100011111110001110111100100011101100100010001110110111101000111010100111111010001101100001011110 8edc8ea38ea18fe3bc8ec88ede8ea7e8e68edc8ea38ea18fe3bc8ec88ede8ea7e8d85e
EUC-JP 灼璽治上シ縞酌而韆灼璽治上シ縞酌而靼^ 10111100110111101011110010100101101111001010001110111110111001011000111010111100101111001100101010111100111000001011110010101001111100001110100010111100110111101011110010100101101111001010001110111110111001011000111010111100101111001100101010111100111000001011110010101001111100001101101001011110 bcdebca5bca3bee58ebcbccabce0bca9f0e8bcdebca5bca3bee58ebcbccabce0bca9f0da5e
UTF-8 灼璽治上シ縞酌而韆灼璽治上シ縞酌而靼^ 11100111100000011011110011100111100100101011110111100110101100101011101111100100101110001000101011101111101111011011110011100111101110001001111011101001100001011000110011101000100000001000110011101001100111111000011011100111100000011011110011100111100100101011110111100110101100101011101111100100101110001000101011101111101111011011110011100111101110001001111011101001100001011000110011101000100000001000110011101001100111011011110001011110 e781bce792bde6b2bbe4b88aefbdbce7b89ee9858ce8808ce99f86e781bce792bde6b2bbe4b88aefbdbce7b89ee9858ce8808ce99dbc5e
UHC 灼璽治上?縞酌而韆灼璽治上?縞酌而?^ 11101101110001111101111111011110111101101011110111011111101111100011111111111011110101101110110111001100111011001011101111110100110001111110110111000111110111111101111011110110101111011101111110111110001111111111101111010110111011011100110011101100101110110011111101011110 edc7dfdef6bddfbe3ffbd6edccecbbf4c7edc7dfdef6bddfbe3ffbd6edccecbb3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)