To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猷??佚?猷??佚?B 100101110101000100111111001111111001100011000011001111111001011101010001001111110011111110011000110000110011111101000010 97513f3f98c33f97513f3f98c33f42
EUC-JP 猷??佚?猷??佚?B 110011011011001000111111001111111101000011000101001111111100110110110010001111110011111111010000110001010011111101000010 cdb23f3fd0c53fcdb23f3fd0c53f42
UTF-8 猷드뭬佚촫猷드뭬佚촫B 11100111100011001011011111101011100100111001110011101011101011011010110011100100101111011001101011101100101101001010101111100111100011001011011111101011100100111001110011101011101011011010110011100100101111011001101011101100101101001010101101000010 e78cb7eb939cebadace4bd9aecb4abe78cb7eb939cebadace4bd9aecb4ab42
UHC 猷드뭬佚촫猷드뭬佚촫B 111010111010001110110101111001011011100110111110111011001110101010101100011010011110101110100011101101011110010110111001101111101110110011101010101011000110100101000010 eba3b5e5b9beeceaac69eba3b5e5b9beeceaac6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)