To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 疾鹿シムセチ耳シ」疾汐シー疾汐シエ疾辞シナ 100011101011111010001110101011011011110011010001101111101100000110001110101010001011110010100011100011101011111010001110101011001011110010110000100011101011111010001110101011001011110010110100100011101011111010001110101010111011110011000101 8ebe8eadbcd1bec18ea8bca38ebe8eacbcb08ebe8eacbcb48ebe8eabbcc5
EUC-JP 疾鹿シムセチ耳シ」疾汐シー疾汐シエ疾辞シナ 101111001100000010111100101011111000111010111100100011101101000110001110101111101000111011000001101111001010101010001110101111001000111010100011101111001100000010111100101011101000111010111100100011101011000010111100110000001011110010101110100011101011110010001110101101001011110011000000101111001010110110001110101111001000111011000101 bcc0bcaf8ebc8ed18ebe8ec1bcaa8ebc8ea3bcc0bcae8ebc8eb0bcc0bcae8ebc8eb4bcc0bcad8ebc8ec5
UTF-8 疾鹿シムセチ耳シ」疾汐シー疾汐シエ疾辞シナ 111001111001011010111110111010011011100110111111111011111011110110111100111011111011111010010001111011111011110110111110111011111011111010000001111010001000000010110011111011111011110110111100111011111011110110100011111001111001011010111110111001101011000110010000111011111011110110111100111011111011110110110000111001111001011010111110111001101011000110010000111011111011110110111100111011111011110110110100111001111001011010111110111010001011111010011110111011111011110110111100111011111011111010000101 e796bee9b9bfefbdbcefbe91efbdbeefbe81e880b3efbdbcefbda3e796bee6b190efbdbcefbdb0e796bee6b190efbdbcefbdb4e796bee8be9eefbdbcefbe85
UHC 疾鹿????耳??疾汐??疾汐??疾??? 1111001011110000110101101110001100111111001111110011111100111111111011001011110000111111001111111111001011110000111000001011000100111111001111111111001011110000111000001011000100111111001111111111001011110000001111110011111100111111 f2f0d6e33f3f3f3fecbc3f3ff2f0e0b13f3ff2f0e0b13f3ff2f03f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)