To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 受〓+??6壬??巡??猷ヨ?猷??恂?5 1000111011110011100000011010110010000001011110110011111100111111100000100101010110010000011100000011111100111111100011111000010000111111001111111001011101010001100000111000100000111111100101110101000100111111001111111001110010010110001111111000001001010100 8ef381ac817b3f3f825590703f3f8f843f3f975183883f97513f3f9c963f8254
EUC-JP 受〓+??6壬??巡??猷ヨ?猷??恂?5 1011110011110101101000101010111010100001110111000011111100111111101000111011011010111111110100010011111100111111101111011110010000111111001111111100110110110010101001011110100000111111110011011011001000111111001111111101011111110110001111111010001110110101 bcf5a2aea1dc3f3fa3b6bfd13f3fbde43f3fcdb2a5e83fcdb23f3fd7f63fa3b5
UTF-8 受〓+娛뤿6壬듽깛巡볛뼞猷ヨ뼅猷띤븡恂볥5 111001011000111110010111111000111000000010010011111011111011110010001011111001011010100010011011111010111010010010111111111011111011110010010110111001011010001110101100111010111001001110111101111010101011100110011011111001011011011110100001111010111011001110011011111010111011110010011110111001111000110010110111111000111000001110101000111010111011110010000101111001111000110010110111111010111001110110100100111010111011100010100001111001101000000110000010111010111011001110100101111011111011110010010101 e58f97e38093efbc8be5a89beba4bfefbc96e5a3aceb93bdeab99be5b7a1ebb39bebbc9ee78cb7e383a8ebbc85e78cb7eb9da4ebb8a1e68182ebb3a5efbc95
UHC 受〓+娛뤿6壬듽깛巡볛뼞猷ヨ뼅猷띤븡恂볥5 111000011111010010100001111010111010001110101011111001111111010010001111111010111010001110110110111011001111001110001010111000111000001110010011111000101101111010010011111000101001011010100001111010111010001110101011111010001001011010001111111010111010001110110110111011011001010110001010111000101110000110010011111010111010001110110101 e1f4a1eba3abe7f48feba3b6ecf38ae38393e2de93e296a1eba3abe8968feba3b6ed958ae2e193eba3b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)