To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猷??窈?5徇??B 1001011101010001001111110011111111100010011101110011111110000010010101001001110001101101001111110011111101000010 97513f3fe2773f82549c6d3f3f42
EUC-JP 猷??窈?5徇??B 1100110110110010001111110011111111100011110110000011111110100011101101011101011111001110001111110011111101000010 cdb23f3fe3d83fa3b5d7ce3f3f42
UTF-8 猷듐걖窈뚮5徇곲죱B 11100111100011001011011111101011100100111001000011101010101100011001011011100111101010101000100011101011100110101010111011101111101111001001010111100101101111101000011111101010101100111011001011101100101000111011000101000010 e78cb7eb9390eab196e7aa88eb9aaeefbc95e5be87eab3b2eca3b142
UHC 猷듐걖窈뚮5徇곲죱B 11101011101000111011010111100011100000011000000111101001101000011000110011101011101000111011010111100010110111111000000111101001101000011000110001000010 eba3b5e38181e9a18ceba3b5e2df81e9a18c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)