To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 姚붹떣æ 11100101101001111001101011101011101101101011100111101011100101101010001111100110 e5a79aebb6b9eb96a3e6
SJIS-WIN ?§??¶???£? 00111111100000011001100000111111001111111000000111110111001111110011111100111111100000011001001000111111 3f81983f3f81f73f3f3f81923f
EUC-JP å§?ë¶?ë?£æ 100011111010101110101001101000011111100000111111100011111010101110110011101000101111100100111111100011111010101110110011001111111010000111110010100011111010100111000001 8faba9a1f83f8fabb3a2f93f8fabb33fa1f28fa9c1
UTF-8 姚붹떣æ 1100001110100101110000101010011111000010100110101100001110101011110000101011011011000010101110011100001110101011110000101001011011000010101000111100001110100110 c3a5c2a7c29ac3abc2b6c2b9c3abc296c2a3c3a6
UHC ?§??¶¹???æ 0011111110100001110101110011111100111111101000101101001010101001111101100011111100111111001111111010100110100001 3fa1d73f3fa2d2a9f63f3f3fa9a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)