To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?瀞??縡?虞?除???畯孟∧梯?畯貊? 1001001010100010001111111001001111010010001111110011111111100011011100010011111110001011111100010011111110001111100111000011111100111111001111111111101101101111100101101101000010000001110010001001001011110010001111111111101101101111111001101011101100111111 92a23f93d23f3fe3713f8bf13f8f9c3f3f3ffb6f96d081c892f23ffb6fe6bb3f
EUC-JP 弔?瀞?汶縡?虞?除???畯孟∧梯?畯貊? 110001001010010000111111110001101101010000111111100011111100011011100101111001011101001000111111101101101111001100111111101111011111110000111111001111110011111110001111110011011011101111001100110100101010001011001010110001001111010000111111100011111100110110111011111011001011110100111111 c4a43fc6d43f8fc6e5e5d23fb6f33fbdfc3f3f3f8fcdbbccd2a2cac4f43f8fcdbbecbd3f
UTF-8 弔렲瀞펨汶縡렕虞렧除곌렕렟畯孟∧梯렟畯貊렎 111001011011110010010100111010111010000010110010111001111000000010011110111011011000111010101000111001101011000110110110111001111011100010100001111010111010000010010101111010001001100110011110111010111010000010100111111010011001100110100100111010101011001110001100111010111010000010010101111010111010000010011111111001111001010110101111111001011010110110011111111000101000100010100111111001101010001010101111111010111010000010011111111001111001010110101111111010001011001010001010111010111010000010001110 e5bc94eba0b2e7809eed8ea8e6b1b6e7b8a1eba095e8999eeba0a7e999a4eab38ceba095eba09fe795afe5ad9fe288a7e6a2afeba09fe795afe8b28aeba08e
UHC 弔렲瀞펨汶縡렕虞렧除곌렕렟畯孟∧梯렟畯貊렎 111100001100000010001110101111111110111111100111110001101110100011011010101000011110111010101101100011101010101011101001111001011000111010110110111100001011011010110000111010101000111010101010100011101011000011110001111000011101100011101011101000011111110011110000101011001000111010110000111100011110000111011000111001111000111010100100 f0c08ebfefe7c6e8daa1eead8eaae9e58eb6f0b6b0ea8eaa8eb0f1e1d8eba1fcf0ac8eb0f1e1d8e78ea4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)