Modeling a 2200 residue protein

This is a copy of the original post that was made by ITASSERteam at Mon, 06/22/2015 - 12:53. If this can be a problem, I will delete it.

----------

This is a typical problem of modeling multiple-domain proteins. I recommend the following procedure:

1. split sequence into domains and submit the sequence of individual domains to I-TASSER, like what you have done.
2. submit the whole-chain sequence to I-TASSER to get the whole-chain I-TASSER model. In case that the whole-chain sequence is beyond 1500, you can delete the N- and C-terminal sequence to make it below 1500 AA. But you should have the sequence include at least part of the sequence from each domain.
3. superpose the individual domain models to the whole-chain I-TASSER model to construct a new whole-chain structural model. You need to reorder the residue number of the single-domain models so that you can superpose the models appropriately to whole-chain model using TM-score program.
4. submit the new whole-chain model to FG-MD server to refine the model and remove steric clashes etc. Thus, you will a final full-length model that has all domain modeled appropriately.

This procedure can be quite complicated and tedious to beginning users. But this is probably the best way to model multi-domain proteins. We plan to make this procedure automated in the I-TASSER server in future.

The Zhang Lab Message Boards

Navigation

User login

Forum Search

a solution