To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 秧?????轅?? 1110001001011110001111110011111100111111001111110011111111100111011101100011111100111111 e25e3f3f3f3f3fe7763f3f
EUC-JP 秧??庾??轅?? 11100011101111110011111100111111100011111011110011001110001111110011111111101101110101110011111100111111 e3bf3f3f8fbcce3f3fedd73f3f
UTF-8 秧덈떣庾룟윜轅깅탺 111001111010011110100111111010111000110110001000111010111001011010100011111001011011101010111110111010111010001110011111111011001001110010011100111010001011110110000101111010101011100110000101111011011000001110111010 e7a7a7eb8d88eb96a3e5babeeba39fec9c9ce8bd85eab985ed83ba
UHC 秧덈떣庾룟윜轅깅탺 111001001110101110001000111010111000101110110111111010101110110010110111111001011001111110011111111010101011111110110001111010111011010110010110 e4eb88eb8bb7eaecb7e59f9feabfb1ebb596

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)