To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????q?????????qB 001111110011111100111111001111110011111100111111001111110011111100111111011100010011111100111111001111110011111100111111001111110011111100111111001111110111000101000010 3f3f3f3f3f3f3f3f3f713f3f3f3f3f3f3f3f3f7142
SJIS-WIN 猷??意?5壬??q猷??意?5壬??qB 1001011101010001001111110011111110001000110100110011111110000010010101001001000001110000001111110011111101110001100101110101000100111111001111111000100011010011001111111000001001010100100100000111000000111111001111110111000101000010 97513f3f88d33f825490703f3f7197513f3f88d33f825490703f3f7142
EUC-JP 猷??意?5壬??q猷??意?5壬??qB 1100110110110010001111110011111110110000110101010011111110100011101101011011111111010001001111110011111101110001110011011011001000111111001111111011000011010101001111111010001110110101101111111101000100111111001111110111000101000010 cdb23f3fb0d53fa3b5bfd13f3f71cdb23f3fb0d53fa3b5bfd13f3f7142
UTF-8 猷뜻톷意밸5壬듽굜q猷뜻톷意밸5壬듽굜qB 111001111000110010110111111010111001110010111011111011011000011010110111111001101000010010001111111010111011000010111000111011111011110010010101111001011010001110101100111010111001001110111101111010101011010110011100011100011110011110001100101101111110101110011100101110111110110110000110101101111110011010000100100011111110101110110000101110001110111110111100100101011110010110100011101011001110101110010011101111011110101010110101100111000111000101000010 e78cb7eb9cbbed86b7e6848febb0b8efbc95e5a3aceb93bdeab59c71e78cb7eb9cbbed86b7e6848febb0b8efbc95e5a3aceb93bdeab59c7142
UHC 猷뜻톷意밸5壬듽굜q猷뜻톷意밸5壬듽굜qB 111010111010001110110110111001101011011110001011111010111111001010111001111010111010001110110101111011001111001110001010111000111000001010000100011100011110101110100011101101101110011010110111100010111110101111110010101110011110101110100011101101011110110011110011100010101110001110000010100001000111000101000010 eba3b6e6b78bebf2b9eba3b5ecf38ae3828471eba3b6e6b78bebf2b9eba3b5ecf38ae382847142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)