To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??嗽?+恁??猷ユ?猷??嚥▼?恂 1001011101010001001111110011111110011010011101010011111110000001011110111001110010001100001111110011111110010111010100011000001110000110001111111001011101010001001111110011111110011010100010111000000110100101001111111001110010010110 97513f3f9a753f817b9c8c3f3f975183863f97513f3f9a8b81a53f9c96
EUC-JP 猷??嗽?+恁??猷ユ?猷??嚥▼?恂 1100110110110010001111110011111111010011110101100011111110100001110111001101011111101100001111110011111111001101101100101010010111100110001111111100110110110010001111110011111111010011111010111010001010100111001111111101011111110110 cdb23f3fd3d63fa1dcd7ec3f3fcdb2a5e63fcdb23f3fd3eba2a73fd7f6
UTF-8 猷댄뼳嗽먮+恁㎯궠猷ユ젗猷띕뇾嚥▼띂恂 111001111000110010110111111010111000110010000100111010111011110010110011111001011001011110111101111010111010100010101110111011111011110010001011111001101000000110000001111000111000111010101111111010101011011010100000111001111000110010110111111000111000001110100110111011001010000010010111111001111000110010110111111010111001110110010101111010111000011110111110111001011001101010100101111000101001011010111100111010111001110110000010111001101000000110000010 e78cb7eb8c84ebbcb3e597bdeba8aeefbc8be68181e38eafeab6a0e78cb7e383a6eca097e78cb7eb9d95eb87bee59aa5e296bceb9d82e68182
UHC 猷댄뼳嗽먮+恁㎯궠猷ユ젗猷띕뇾嚥▼띂恂 1110101110100011101101001110110110010110101101101110000111110101100100001110101110100011101010111110110011110110101001111110001110000010101100111110101110100011101010111110011010100000100100111110101110100011101101101110101110000111100111111110011010111111101000011110010110001101101111011110001011100001 eba3b4ed96b6e1f590eba3abecf6a7e382b3eba3abe6a093eba3b6eb879fe6bfa1e58dbde2e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)