To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???n}???n{^ 0011111100111111001111110110111001111101001111110011111100111111011011100111101101011110 3f3f3f6e7d3f3f3f6e7b5e
SJIS-WIN 聿埼皐n}聿埼皐n{^ 1110001111100100100011011110100110001110010010000110111001111101111000111110010010001101111010011000111001001000011011100111101101011110 e3e48de98e486e7de3e48de98e486e7b5e
EUC-JP 聿埼皐n}聿埼皐n{^ 1110011011100110101110101110101110111011101010010110111001111101111001101110011010111010111010111011101110101001011011100111101101011110 e6e6baebbba96e7de6e6baebbba96e7b5e
UTF-8 聿埼皐n}聿埼皐n{^ 1110100010000001101111111110010110011111101111001110011110011010100100000110111001111101111010001000000110111111111001011001111110111100111001111001101010010000011011100111101101011110 e881bfe59fbce79a906e7de881bfe59fbce79a906e7b5e
UHC 聿埼皐n}聿埼皐n{^ 1110101111010011110100001111001011001101110000010110111001111101111010111101001111010000111100101100110111000001011011100111101101011110 ebd3d0f2cdc16e7debd3d0f2cdc16e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)