To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 疾痔シウ疾辞シイシ」シ燻セ汐シー 10001110101111101000111010100100101111001011001110001110101111101000111010101011101111001011001010111100101000111011110011100000100011101011111010001110101011001011110010110000 8ebe8ea4bcb38ebe8eabbcb2bca3bce08ebe8eacbcb0
EUC-JP 疾痔シウ疾辞シイシ」シ燻セ汐シー 1011110011000000101111001010011010001110101111001000111010110011101111001100000010111100101011011000111010111100100011101011001010001110101111001000111010100011100011101011110011011111111011101000111010111110101111001010111010001110101111001000111010110000 bcc0bca68ebc8eb3bcc0bcad8ebc8eb28ebc8ea38ebcdfee8ebebcae8ebc8eb0
UTF-8 疾痔シウ疾辞シイシ」シ燻セ汐シー 111001111001011010111110111001111001011110010100111011111011110110111100111011111011110110110011111001111001011010111110111010001011111010011110111011111011110110111100111011111011110110110010111011111011110110111100111011111011110110100011111011111011110110111100111001111000011110111011111011111011110110111110111001101011000110010000111011111011110110111100111011111011110110110000 e796bee79794efbdbcefbdb3e796bee8be9eefbdbcefbdb2efbdbcefbda3efbdbce787bbefbdbee6b190efbdbcefbdb0
UHC 疾痔??疾??????燻?汐?? 111100101111000011110110110000000011111100111111111100101111000000111111001111110011111100111111001111110011111111111101101110000011111111100000101100010011111100111111 f2f0f6c03f3ff2f03f3f3f3f3f3ffdb83fe0b13f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)