To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 烏??魏??諛 10001001010001110011111100111111111010011011000000111111001111111110011010000111 89473f3fe9b03f3fe687
EUC-JP 烏??魏??諛 10110001101010000011111100111111111100101011001000111111001111111110101111100111 b1a83f3ff2b23f3febe7
UTF-8 烏띾슗魏좂뿆諛 111001111000001110001111111010111001110110111110111011001000101010010111111010011010110110001111111011001010001010000010111010111011111110000110111010001010101110011011 e7838feb9dbeec8a97e9ad8feca282ebbf86e8ab9b
UHC 烏띾슗魏좂뿆諛 1110100010100001100011011110101110011010101001101110101011100000101000001110011110010111100011011110101110110000 e8a18deb9aa6eae0a0e7978debb0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)