To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????F 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f46
SJIS-WIN 猷??受△?猷??受〓+??F 10010111010100010011111100111111100011101111001110000001101000100011111110010111010100010011111100111111100011101111001110000001101011001000000101111011001111110011111101000110 97513f3f8ef381a23f97513f3f8ef381ac817b3f3f46
EUC-JP 猷??受△?猷??受〓+??F 11001101101100100011111100111111101111001111010110100010101001000011111111001101101100100011111100111111101111001111010110100010101011101010000111011100001111110011111101000110 cdb23f3fbcf5a2a43fcdb23f3fbcf5a2aea1dc3f3f46
UTF-8 猷듸쨬受△뒄猷듸쨬受〓+玲둆F 11100111100011001011011111101011100100111011100011101100101010001010110011100101100011111001011111100010100101101011001111101011100100101000010011100111100011001011011111101011100100111011100011101100101010001010110011100101100011111001011111100011100000001001001111101111101111001000101111101111101001101010110111101011100100011000011001000110 e78cb7eb93b8eca8ace58f97e296b3eb9284e78cb7eb93b8eca8ace58f97e38093efbc8befa6adeb918646
UHC 猷듸쨬受△뒄猷듸쨬受〓+玲둆F 1110101110100011101101011110111110100100100001101110000111110100101000011110001010001010100000101110101110100011101101011110111110100100100001101110000111110100101000011110101110100011101010111110011110111111100010100100001001000110 eba3b5efa486e1f4a1e28a82eba3b5efa486e1f4a1eba3abe7bf8a4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)