To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??循?6?ν?猷 100101110101000100111111001111111000111101111010001111111000001001010101001111111000001111001011001111111001011101010001 97513f3f8f7a3f82553f83cb3f9751
EUC-JP 猷??循?6?ν?猷 110011011011001000111111001111111011110111011011001111111010001110110110001111111010011011001101001111111100110110110010 cdb23f3fbddb3fa3b63fa6cd3fcdb2
UTF-8 猷듯벀循낅6力ν몖猷 1110011110001100101101111110101110010011101011111110101110110010100000001110010110111110101010101110101110000010100001011110111110111100100101101110111110100110100010101100111010111101111010111010101010010110111001111000110010110111 e78cb7eb93afebb280e5beaaeb8285efbc96efa68acebdebaa96e78cb7
UHC 猷듯벀循낅6力ν몖猷 1110101110100011101101011110110110010011101001101110001011100000100001011110101110100011101101101110011010110011101001011110110110010001100001001110101110100011 eba3b5ed93a6e2e085eba3b6e6b3a5ed9184eba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)