To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 軟??軟??軟?? 100100111110111000111111001111111001001111101110001111110011111110010011111011100011111100111111 93ee3f3f93ee3f3f93ee3f3f
EUC-JP 軟??軟??軟?? 110001101111000000111111001111111100011011110000001111110011111111000110111100000011111100111111 c6f03f3fc6f03f3fc6f03f3f
UTF-8 軟쏙쫵軟썼퀪軟쏙쫵 111010001011101110011111111011001000111110011001111011001010101110110101111010001011101110011111111011001000110110111100111011011000000010101010111010001011101110011111111011001000111110011001111011001010101110110101 e8bb9fec8f99ecabb5e8bb9fec8dbced80aae8bb9fec8f99ecabb5
UHC 軟쏙쫵軟썼퀪軟쏙쫵 111001101110001110111101111011111010011010001100111001101110001110111101111010001011001110011110111001101110001110111101111011111010011010001100 e6e3bdefa68ce6e3bde8b39ee6e3bdefa68c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)