To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????Æ???????????Æ^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111111100011001011110 3f3f3f3f3f3f3f3f3f3f3fc63f3f3f3f3f3f3f3f3f3f3fc65e
SJIS-WIN ???嚴ф????嚴ф????嚴ф????嚴ф?^ 001111110011111100111111100110101000111010000100100001100011111100111111001111110011111110011010100011101000010010000110001111110011111100111111001111111001101010001110100001001000011000111111001111110011111100111111100110101000111010000100100001100011111101011110 3f3f3f9a8e84863f3f3f3f9a8e84863f3f3f3f9a8e84863f3f3f3f9a8e84863f5e
EUC-JP ???嚴ф????嚴фÆ???嚴ф????嚴фÆ^ 00111111001111110011111111010011111011101010011111100110001111110011111100111111001111111101001111101110101001111110011010001111101010011010000100111111001111110011111111010011111011101010011111100110001111110011111100111111001111111101001111101110101001111110011010001111101010011010000101011110 3f3f3fd3eea7e63f3f3f3fd3eea7e68fa9a13f3f3fd3eea7e63f3f3f3fd3eea7e68fa9a15e
UTF-8 廬쎈젙嚴ф쭇溜묐젙嚴фÆ廬쎈젙嚴ф쭇溜묐젙嚴фÆ^ 11101111101001101000001011101100100011101000100011101100101000001001100111100101100110101011010011010001100001001110110010101101100001111110111110100111100010111110101110101100100100001110110010100000100110011110010110011010101101001101000110000100110000111000011011101111101001101000001011101100100011101000100011101100101000001001100111100101100110101011010011010001100001001110110010101101100001111110111110100111100010111110101110101100100100001110110010100000100110011110010110011010101101001101000110000100110000111000011001011110 efa682ec8e88eca099e59ab4d184ecad87efa78bebac90eca099e59ab4d184c386efa682ec8e88eca099e59ab4d184ecad87efa78bebac90eca099e59ab4d184c3865e
UHC 廬쎈젙嚴ф쭇溜묐젙嚴фÆ廬쎈젙嚴ф쭇溜묐젙嚴фÆ^ 11100101111111101011110111101011101000001001010111100101111100011010110011100110101001111000001111101010111111101001000111101011101000001001010111100101111100011010110011100110101010001010000111100101111111101011110111101011101000001001010111100101111100011010110011100110101001111000001111101010111111101001000111101011101000001001010111100101111100011010110011100110101010001010000101011110 e5febdeba095e5f1ace6a783eafe91eba095e5f1ace6a8a1e5febdeba095e5f1ace6a783eafe91eba095e5f1ace6a8a15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)