To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??恂??猷??僥?6韋??猷??嗽?+ 100101110101000100111111001111111001110010010110001111110011111110010111010100010011111100111111100110010100011000111111100000100101010111101000111010000011111100111111100101110101000100111111001111111001101001110101001111111000000101111011 97513f3f9c963f3f97513f3f99463f8255e8e83f3f97513f3f9a753f817b
EUC-JP 猷??恂??猷??僥?6韋??猷??嗽?+ 110011011011001000111111001111111101011111110110001111110011111111001101101100100011111100111111110100011010011100111111101000111011011011110000111010100011111100111111110011011011001000111111001111111101001111010110001111111010000111011100 cdb23f3fd7f63f3fcdb23f3fd1a73fa3b6f0ea3f3fcdb23f3fd3d63fa1dc
UTF-8 猷띠컠恂먥뼦猷띠썿僥뚮6韋앯탞猷댄뼳嗽먮+ 111001111000110010110111111010111001110110100000111011001011101110100000111001101000000110000010111010111010100010100101111010111011110010100110111001111000110010110111111010111001110110100000111011001000110110111111111001011000001110100101111010111001101010101110111011111011110010010110111010011001111110001011111011001001010110101111111011011000001110011110111001111000110010110111111010111000110010000100111010111011110010110011111001011001011110111101111010111010100010101110111011111011110010001011 e78cb7eb9da0ecbba0e68182eba8a5ebbca6e78cb7eb9da0ec8dbfe583a5eb9aaeefbc96e99f8bec95afed839ee78cb7eb8c84ebbcb3e597bdeba8aeefbc8b
UHC 猷띠컠恂먥뼦猷띠썿僥뚮6韋앯탞猷댄뼳嗽먮+ 111010111010001110110110111011001011000010001011111000101110000110010000111000101001011010101001111010111010001110110110111011001001101110101001111010001110100110001100111010111010001110110110111010101101111110011101111001111011010110000010111010111010001110110100111011011001011010110110111000011111010110010000111010111010001110101011 eba3b6ecb08be2e190e296a9eba3b6ec9ba9e8e98ceba3b6eadf9de7b582eba3b4ed96b6e1f590eba3ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)