To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????E 0011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f45
SJIS-WIN 而兢??而矜φE 10001110101001111001100101011101001111110011111110001110101001111110000111100000100000111101001101000101 8ea7995d3f3f8ea7e1e083d345
EUC-JP 而兢??而矜φE 10111100101010011101000110111110001111110011111110111100101010011110001011100010101001101101010101000101 bca9d1be3f3fbca9e2e2a6d545
UTF-8 而兢렯렊而矜φE 111010001000000010001100111001011000010110100010111010111010000010101111111010111010000010001010111010001000000010001100111001111001111110011100110011111000011001000101 e8808ce585a2eba0afeba08ae8808ce79f9ccf8645
UHC 而兢렯렊而矜φE 111011001011101111010000111001111000111010111100100011101010000111101100101110111101000011101000101001011111010101000101 ecbbd0e78ebc8ea1ecbbd0e8a5f545

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)