To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??Z^h??Z^fN}??Z^h??Z^fN{^ 00111111001111110101101001011110011010000011111100111111010110100101111001100110010011100111110100111111001111110101101001011110011010000011111100111111010110100101111001100110010011100111101101011110 3f3f5a5e683f3f5a5e664e7d3f3f5a5e683f3f5a5e664e7b5e
SJIS-WIN 叩賊Z^h叩賊Z^fN}叩賊Z^h叩賊Z^fN{^ 100100100100000010010001101011110101101001011110011010001001001001000000100100011010111101011010010111100110011001001110011111011001001001000000100100011010111101011010010111100110100010010010010000001001000110101111010110100101111001100110010011100111101101011110 924091af5a5e68924091af5a5e664e7d924091af5a5e68924091af5a5e664e7b5e
EUC-JP 叩賊Z^h叩賊Z^fN}叩賊Z^h叩賊Z^fN{^ 110000111010000111000010101100010101101001011110011010001100001110100001110000101011000101011010010111100110011001001110011111011100001110100001110000101011000101011010010111100110100011000011101000011100001010110001010110100101111001100110010011100111101101011110 c3a1c2b15a5e68c3a1c2b15a5e664e7dc3a1c2b15a5e68c3a1c2b15a5e664e7b5e
UTF-8 叩賊Z^h叩賊Z^fN}叩賊Z^h叩賊Z^fN{^ 1110010110001111101010011110100010110011100010100101101001011110011010001110010110001111101010011110100010110011100010100101101001011110011001100100111001111101111001011000111110101001111010001011001110001010010110100101111001101000111001011000111110101001111010001011001110001010010110100101111001100110010011100111101101011110 e58fa9e8b38a5a5e68e58fa9e8b38a5a5e664e7de58fa9e8b38a5a5e68e58fa9e8b38a5a5e664e7b5e
UHC 叩賊Z^h叩賊Z^fN}叩賊Z^h叩賊Z^fN{^ 110011011011000011101110111001000101101001011110011010001100110110110000111011101110010001011010010111100110011001001110011111011100110110110000111011101110010001011010010111100110100011001101101100001110111011100100010110100101111001100110010011100111101101011110 cdb0eee45a5e68cdb0eee45a5e664e7dcdb0eee45a5e68cdb0eee45a5e664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)