To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 巍ロ??????? 1001101111011001100000111000110100111111001111110011111100111111001111110011111100111111 9bd9838d3f3f3f3f3f3f3f
EUC-JP 巍ロ??????? 1101011011011011101001011110110100111111001111110011111100111111001111110011111100111111 d6dba5ed3f3f3f3f3f3f3f
UTF-8 巍ロ룉溜뤿툍溜볥젻 111001011011011110001101111000111000001110101101111010111010001110001001111011111010011110001011111010111010010010111111111011011000100010001101111011111010011110001011111010111011001110100101111011001010000010111011 e5b78de383adeba389efa78beba4bfed888defa78bebb3a5eca0bb
UHC 巍ロ룉溜뤿툍溜볥젻 111010001110010010101011111011011000111110001000111010101111111010001111111010111011100010000101111010101111111010010011111010111010000010101110 e8e4abed8f88eafe8febb885eafe93eba0ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)