To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 悟??韋??碎??D悟??韋??碎??D^ 100011001110010100111111001111111110100011101000001111110011111111100001111010100011111100111111010001001000110011100101001111110011111111101000111010000011111100111111111000011110101000111111001111110100010001011110 8ce53f3fe8e83f3fe1ea3f3f448ce53f3fe8e83f3fe1ea3f3f445e
EUC-JP 悟??韋??碎??D悟??韋??碎??D^ 101110001110011100111111001111111111000011101010001111110011111111100010111011000011111100111111010001001011100011100111001111110011111111110000111010100011111100111111111000101110110000111111001111110100010001011110 b8e73f3ff0ea3f3fe2ec3f3f44b8e73f3ff0ea3f3fe2ec3f3f445e
UTF-8 悟귘뫃韋곮쁻碎㏐텞D悟귘뫃韋곮쁻碎㏐텞D^ 111001101000001010011111111010101011011110011000111010111010101110000011111010011001111110001011111010101011001110101110111011001000000110111011111001111010001010001110111000111000111110010000111011011000010110011110010001001110011010000010100111111110101010110111100110001110101110101011100000111110100110011111100010111110101010110011101011101110110010000001101110111110011110100010100011101110001110001111100100001110110110000101100111100100010001011110 e6829feab798ebab83e99f8beab3aeec81bbe7a28ee38f90ed859e44e6829feab798ebab83e99f8beab3aeec81bbe7a28ee38f90ed859e445e
UHC 悟귘뫃韋곮쁻碎㏐텞D悟귘뫃韋곮쁻碎㏐텞D^ 111001111111011010000010111000101001000110100111111010101101111110000001111010001001100010000010111000011110111110100111111010101011011010010101010001001110011111110110100000101110001010010001101001111110101011011111100000011110100010011000100000101110000111101111101001111110101010110110100101010100010001011110 e7f682e291a7eadf81e89882e1efa7eab69544e7f682e291a7eadf81e89882e1efa7eab695445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)