To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????u^ 00111111001111110011111100111111001111110011111100111111001111110111010101011110 3f3f3f3f3f3f3f3f755e
SJIS-WIN 矚u^ 111101011011110111110000111011101111010110111101111000011101111111110101101111011111011011100010111101011011110111110101101101010111010101011110 f5bdf0eef5bde1dff5bdf6e2f5bdf5b5755e
EUC-JP ???矚????u^ 0011111100111111001111111110001011100001001111110011111100111111001111110111010101011110 3f3f3fe2e13f3f3f3f755e
UTF-8 矚u^ 1110111010010000101010001110111010000010101011011110111010010000101010001110011110011111100110101110111010010000101010001110111010010100100010011110111010010000101010001110111010010000101000000111010101011110 ee90a8ee82adee90a8e79f9aee90a8ee9489ee90a8ee90a0755e
UHC ????????u^ 00111111001111110011111100111111001111110011111100111111001111110111010101011110 3f3f3f3f3f3f3f3f755e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)