To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 烏??毓?烏??毓?E 100010010100011100111111001111111001111101111001001111111000100101000111001111110011111110011111011110010011111101000101 89473f3f9f793f89473f3f9f793f45
EUC-JP 烏??毓?烏??毓?E 101100011010100000111111001111111101110111011010001111111011000110101000001111110011111111011101110110100011111101000101 b1a83f3fddda3fb1a83f3fddda3f45
UTF-8 烏띾슓毓풷烏띾슓毓풺E 11100111100000111000111111101011100111011011111011101100100010101001001111100110101011111001001111101101100100101011011111100111100000111000111111101011100111011011111011101100100010101001001111100110101011111001001111101101100100101011101001000101 e7838feb9dbeec8a93e6af93ed92b7e7838feb9dbeec8a93e6af93ed92ba45
UHC 烏띾슓毓풷烏띾슓毓풺E 111010001010000110001101111010111001101010100010111010111011111010111111010110011110100010100001100011011110101110011010101000101110101110111110101111110110001001000101 e8a18deb9aa2ebbebf59e8a18deb9aa2ebbebf6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)