To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???n}???n{^ 0011111100111111001111110110111001111101001111110011111100111111011011100111101101011110 3f3f3f6e7d3f3f3f6e7b5e
SJIS-WIN 惹??n}惹??n{^ 10001110111001000011111100111111011011100111110110001110111001000011111100111111011011100111101101011110 8ee43f3f6e7d8ee43f3f6e7b5e
EUC-JP 惹??n}惹??n{^ 10111100111001100011111100111111011011100111110110111100111001100011111100111111011011100111101101011110 bce63f3f6e7dbce63f3f6e7b5e
UTF-8 惹곩립n}惹곩립n{^ 1110011010000011101110011110101010110011101010011110101110100110101111010110111001111101111001101000001110111001111010101011001110101001111010111010011010111101011011100111101101011110 e683b9eab3a9eba6bd6e7de683b9eab3a9eba6bd6e7b5e
UHC 惹곩립n}惹곩립n{^ 1110010110101001100000011110010110111000101100110110111001111101111001011010100110000001111001011011100010110011011011100111101101011110 e5a981e5b8b36e7de5a981e5b8b36e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)