To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 罌?????應??D罌?????應??D^ 11100011101000000011111100111111001111110011111100111111100111001110010000111111001111110100010011100011101000000011111100111111001111110011111100111111100111001110010000111111001111110100010001011110 e3a03f3f3f3f3f9ce43f3f44e3a03f3f3f3f3f9ce43f3f445e
EUC-JP 罌?????應??D罌?????應??D^ 11100110101000100011111100111111001111110011111100111111110110001110011000111111001111110100010011100110101000100011111100111111001111110011111100111111110110001110011000111111001111110100010001011110 e6a23f3f3f3f3fd8e63f3f44e6a23f3f3f3f3fd8e63f3f445e
UTF-8 罌산퀎六쀥♤應꿔뀅D罌산퀎六쀥♤應꿔뀅D^ 111001111011110110001100111011001000001010110000111011011000000010001110111011111010011110010001111011001000000010100101111000101001100110100100111001101000011110001001111010101011111110010100111010111000000010000101010001001110011110111101100011001110110010000010101100001110110110000000100011101110111110100111100100011110110010000000101001011110001010011001101001001110011010000111100010011110101010111111100101001110101110000000100001010100010001011110 e7bd8cec82b0ed808eefa791ec80a5e299a4e68789eabf94eb808544e7bd8cec82b0ed808eefa791ec80a5e299a4e68789eabf94eb8085445e
UHC 罌산퀎六쀥♤應꿔뀅D罌산퀎六쀥♤應꿔뀅D^ 111001011010001010111011111010101011001110000100111010111011101110010111111001011010001010111011111010111110101110110010111000111000010110000001010001001110010110100010101110111110101010110011100001001110101110111011100101111110010110100010101110111110101111101011101100101110001110000101100000010100010001011110 e5a2bbeab384ebbb97e5a2bbebebb2e3858144e5a2bbeab384ebbb97e5a2bbebebb2e38581445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)