To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 猷?┰循?+巍 100101110101000100111111100001001011101110001111011110100011111110000001011110111001101111011001 97513f84bb8f7a3f817b9bd9
EUC-JP 猷?┰循?+巍 110011011011001000111111101010001011110110111101110110110011111110100001110111001101011011011011 cdb23fa8bdbddb3fa1dcd6db
UTF-8 猷띠┰循용+巍 111001111000110010110111111010111001110110100000111000101001010010110000111001011011111010101010111011001001101010101001111011111011110010001011111001011011011110001101 e78cb7eb9da0e294b0e5beaaec9aa9efbc8be5b78d
UHC 猷띠┰循용+巍 1110101110100011101101101110110010100110101111011110001011100000101111111110101110100011101010111110100011100100 eba3b6eca6bde2e0bfeba3abe8e4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)