To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猷??鎖?猷??鎖?B 100101110101000100111111001111111000110110111101001111111001011101010001001111110011111110001101101111010011111101000010 97513f3f8dbd3f97513f3f8dbd3f42
EUC-JP 猷??鎖?猷??鎖?B 110011011011001000111111001111111011101010111111001111111100110110110010001111110011111110111010101111110011111101000010 cdb23f3fbabf3fcdb23f3fbabf3f42
UTF-8 猷듭쪡鎖쒩猷듭쪡鎖쒩B 11100111100011001011011111101011100100111010110111101100101010101010000111101001100011101001011011101100100100101010100111100111100011001011011111101011100100111010110111101100101010101010000111101001100011101001011011101100100100101010100101000010 e78cb7eb93adecaaa1e98e96ec92a9e78cb7eb93adecaaa1e98e96ec92a942
UHC 猷듭쪡鎖쒩猷듭쪡鎖쒩B 111010111010001110110101111011001010010110011010111000011111000010011100111111101110101110100011101101011110110010100101100110101110000111110000100111001111111001000010 eba3b5eca59ae1f09cfeeba3b5eca59ae1f09cfe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)