To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷?Ъ耶??猷レ?徇 10010111010100010011111110000100010110111001011011101011001111110011111110010111010100011000001110001100001111111001110001101101 97513f845b96eb3f3f9751838c3f9c6d
EUC-JP 猷?Ъ耶??猷レ?徇 11001101101100100011111110100111101111001100110011101101001111110011111111001101101100101010010111101100001111111101011111001110 cdb23fa7bccced3f3fcdb2a5ec3fd7ce
UTF-8 猷들Ъ耶섅굜猷レ퐪徇 1110011110001100101101111110101110010011101001001101000010101010111010001000000010110110111011001000010010000101111010101011010110011100111001111000110010110111111000111000001110101100111011011001000010101010111001011011111010000111 e78cb7eb93a4d0aae880b6ec8485eab59ce78cb7e383aced90aae5be87
UHC 猷들Ъ耶섅굜猷レ퐪徇 1110101110100011101101011110100110101100101111001110010110101101100110001110001110000010100001001110101110100011101010111110110010111101100100111110001011011111 eba3b5e9acbce5ad98e38284eba3abecbd93e2df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)