To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猷??恂??猷??松?6壬??訟?6椅?+ 10010111010100010011111100111111100111001001011000111111001111111001011101010001001111110011111110001111101111000011111110000010010101011001000001110000001111110011111110001111110101110011111110000010010101011000100011010110001111111000000101111011 97513f3f9c963f3f97513f3f8fbc3f825590703f3f8fd73f825588d63f817b
EUC-JP 猷??恂??猷??松?6壬??訟?6椅?+ 11001101101100100011111100111111110101111111011000111111001111111100110110110010001111110011111110111110101111100011111110100011101101101011111111010001001111110011111110111110110110010011111110100011101101101011000011011000001111111010000111011100 cdb23f3fd7f63f3fcdb23f3fbebe3fa3b6bfd13f3fbed93fa3b6b0d83fa1dc
UTF-8 猷띤븡恂볛뒄猷띠뿄松듬6壬듽닗訟귣6椅뚮+ 111001111000110010110111111010111001110110100100111010111011100010100001111001101000000110000010111010111011001110011011111010111001001010000100111001111000110010110111111010111001110110100000111010111011111110000100111001101001110110111110111010111001001110101100111011111011110010010110111001011010001110101100111010111001001110111101111010111000101110010111111010001010100010011111111010101011011110100011111011111011110010010110111001101010010010000101111010111001101010101110111011111011110010001011 e78cb7eb9da4ebb8a1e68182ebb39beb9284e78cb7eb9da0ebbf84e69dbeeb93acefbc96e5a3aceb93bdeb8b97e8a89feab7a3efbc96e6a485eb9aaeefbc8b
UHC 猷띤븡恂볛뒄猷띠뿄松듬6壬듽닗訟귣6椅뚮+ 111010111010001110110110111011011001010110001010111000101110000110010011111000101000101010000010111010111010001110110110111011001001011110001100111000011110011010110101111010111010001110110110111011001111001110001010111000111000100010011011111000011110100010000010111010111010001110110110111010111111010110001100111010111010001110101011 eba3b6ed958ae2e193e28a82eba3b6ec978ce1e6b5eba3b6ecf38ae3889be1e882eba3b6ebf58ceba3ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)