To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 踝ス鄒」鋺キ鄒夂弡鄒嘲踝ス鄒」鋺キ鄒夂弡鄒嘴^ 111001101111010010111101111001111011111010100011111001111111101010110111111001111011111010011010111001111111101010110111111001111011111010011010011111011110011011110100101111011110011110111110101000111110011111111010101101111110011110111110100110101110011111111010101101111110011110111110100110100111101101011110 e6f4bde7bea3e7fab7e7be9ae7fab7e7be9a7de6f4bde7bea3e7fab7e7be9ae7fab7e7be9a7b5e
EUC-JP 踝ス鄒」鋺キ鄒夂弡鄒嘲踝ス鄒」鋺キ鄒夂弡鄒嘴^ 1110110011110110100011101011110111101110110000001000111010100011111011101111110010001110101101111110111011000000110101001110100110001111101111001110010011101110110000001101001111011110111011001111011010001110101111011110111011000000100011101010001111101110111111001000111010110111111011101100000011010100111010011000111110111100111001001110111011000000110100111101110001011110 ecf68ebdeec08ea3eefc8eb7eec0d4e98fbce4eec0d3deecf68ebdeec08ea3eefc8eb7eec0d4e98fbce4eec0d3dc5e
UTF-8 踝ス鄒」鋺キ鄒夂弡鄒嘲踝ス鄒」鋺キ鄒夂弡鄒嘴^ 11101000101110001001110111101111101111011011110111101001100001001001001011101111101111011010001111101001100010111011101011101111101111011011011111101001100001001001001011100101101001001000001011100101101111001010000111101001100001001001001011100101100110001011001011101000101110001001110111101111101111011011110111101001100001001001001011101111101111011010001111101001100010111011101011101111101111011011011111101001100001001001001011100101101001001000001011100101101111001010000111101001100001001001001011100101100110001011010001011110 e8b89defbdbde98492efbda3e98bbaefbdb7e98492e5a482e5bca1e98492e598b2e8b89defbdbde98492efbda3e98bbaefbdb7e98492e5a482e5bca1e98492e598b45e
UHC ??鄒???鄒??鄒嘲??鄒???鄒??鄒嘴^ 00111111001111111111010111011011001111110011111100111111111101011101101100111111001111111111010111011011111100001011111100111111001111111111010111011011001111110011111100111111111101011101101100111111001111111111010111011011111101101010010001011110 3f3ff5db3f3f3ff5db3f3ff5dbf0bf3f3ff5db3f3f3ff5db3f3ff5dbf6a45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)