About DEMO2

Online Services

●I-TASSER ●QUARK ●LOMETS ●COACH ●COFACTOR ●MetaGO ●MUSTER ●CEthreader ●SEGMER ●FG-MD ●ModRefiner ●REMO ●DEMO ●SPRING ●COTH ●BSpred ●ANGLOR ●EDock ●BSP-SLIM ●SAXSTER ●FUpred ●ThreaDom ●ThreaDomEx ●EvoDesign ●GPCR-I-TASSER ●MAGELLAN ●BindProf ●BindProfX ●SSIPe ●ResQ ●IonCom ●STRUM ●DAMpred

●TM-score ●TM-align ●MM-align ●RNA-align ●NW-align ●LS-align ●EDTSurf ●MVP ●MVP-Fit ●SPICKER ●HAAD ●PSSpred ●3DRobot ●MR-REX ●I-TASSER-MR ●SVMSEQ ●NeBcon ●ResPRE ●WDL-RF ●ATPbind ●DockRMSD ●DeepMSA ●FASPR ●EM-Refiner

●BioLiP ●E. coli ●GLASS ●GPCR-HGmod ●GPCR-RD ●GPCR-EXP ●Tara-3D ●TM-fold ●DECOYS ●POTENTIAL ●RW/RWplus ●EvoEF ●HPSF ●THE-DB ●ADDRESS ●Alpaca-Antibody ●CASP7 ●CASP8 ●CASP9 ●CASP10 ●CASP11 ●CASP12 ●CASP13 ●CASP14

[Back to DEMO2 homepage]

About DEMO2 Pipeline

What is DEMO2?

How does DEMO2 assemble multidomain protein structures?

TM-align

DeepPotential

In the second step, L-BFGS simulation is used to assemble the domain structruces under the guidence of structurally analogous templates, the inter-domain spatial restraints predicted by DeepPotential, and the knowledge-based inter-domain potentials.

In the last step, the model with lowest energy is selected for the linker reconstruction and further refined with fragment-guided molecule dynamics (FG-MD) simulations.

Figure 1. Pipeline of DEMO2 for multidomain protein structruce assembly.

What are the performances of DEMO2 server compared with other methods?

We further compared the full-length model assembled by DEMO2 using independently generated domain models by D-I-TASSER with the full-length models directly created by the trRosetta. The DEMO2 models have an average TM-score of 0.70 and the global fold is correct, with 83% cases with a TM-score >0.5. This compares favorably with the full-length models built directly by trRosetta which has an average TM-score of 0.64 but with only 70% cases with a TM-score >0.5 (Fig. 2b).

CASP (or Critical Assessment of Techniques for Protein Structure Prediction) is a community-wide experiment for testing the state-of-the-art of protein structure predictions which takes place every two years since 1994. The experiment (often referred as a competition) is strictly blind because the structures of testing proteins are unknown to the predictors. We have used DEMO2 (as ‘Zhang-Server’) to assemble all multidomain targets in the latest CASP14. Fig. 2c shows the comparisons between DEMO2 and other top 4 servers for multidomain protein structure prediction in CASP14, in which we sorted the servers according to the average GDT-score of the full-length models for all multidomain proteins with ≥ 1 template-free modeling (FM) or template-free modeling/template-based modeling (FM/TBM) domain. As shown in the figure, the performance of DEMO2 on the full-length model of multidomain proteins is better than other servers.

Figure 2. Performance of DEMO2 on the 356 benchmark proteins and CASP14 targets. (a) Comparion of DEMO2 with DEMO and AIDA on the performance of full-length models assembled using D-I-TASSER predicted domain models. (b) TM-scores of models assembled by DEMO2 vs. models directly generated by whole-chain trRosetta prediction. (c) Comparison between DEMO2 (Zhang-Server) with the other top 4 servers in CASP14 on the full-length multidomain models in terms of the global distance test (GDT) score, where the servers were sorted according to the GDT score of the full-length models for multidomain proteins with ≥ 1 FM or FM/TBM domain.

What are the input of the DEMO2 server?

Mandatory:

At least 2 domain models in PDB format
Users can click the button "Add domain" to add text boxes for input more domain models. The server currently can assemble up to 5 domains. Users can interactively assemble the model or download the standalone package to run DEMO2 locally if they have >5 domain models.

Optional:

Full-chain sequence in the standard FASTA format
Email address for receiving the results
Name of the query protein
Templates in PDB format to guide the domain assembly
Selection for removing templates sharing >30% sequence identity with target
Experimental data including cross-linking and cryo-EM density map to guide the assembly

What are the output of the DEMO2 server?

The output of the DEMO2 server include:

Up to five full-length atomic models (ranked based on the energy)
Estimated accuracy of the predicted models (including a confidence score of all models, and predicted TM-score and RMSD for the first model)
User provided domain models
Top 10 full-length templates for domain assembly
Predicted distance and inferface maps for domain assembly
An illustrative example of the DEMO2 output can be seen from here.

How to interpret the output data generated by the DEMO2 server?

an example of DEMO2 output

What is the 'top 5 models assembled by DEMO2'?
For each target, DEMO2 reports up to five full-length models ranked by the total energy. It is possible that the lower-rank models have a higher C-score. Although the first model has a higher C-score and a better quality in most cases, it is not unusual that the lower-rank models have a better quality than the higher-rank models.
What are the "top 10 full-length templates for domain assembly"?
DEMO2 identifies the analogous full-length templates from a non-redundant multidomain protein library using TM-align structural alignments. All domain models are aligned to each template of the library by TM-align, and the harmonic mean TM-score of all domains is defined as the score (TplScore) of a template. The top 10 templates with the highest score are selected to generate the initial full-length model and deduce the inter-domain distance restraints to guide the domain assembly.
What is C-score?
C-score is a confidence score for estimating the quality of predicted models by DEMO2. It is calculated based on the convergence parameters of the domain assembly simulations, the quality of the full-length templates for domain assembly, the satisfaction degree of the inter-domain distances, and the estimated accuracy of the individual domain model. C-score is typically in the range of [-5,2], where a C-score of higher value signifies a model with a high confidence and vice-versa.
What is TM-score?
TM-score is a metric for measuring the structural similarity between two structures (see Zhang and Skolnick, Scoring function for automated assessment of protein structure template quality, Proteins, 2004 57: 702-710). The purpose of proposing TM-score is to solve the problem of RMSD which is sensitive to the local error. Because RMSD is an average distance of all residue pairs in two structures, a local error (e.g. a misorientation of the tail) will arise a big RMSD value although the global topology is correct. In TM-score, however, the small distance is weighted stronger than the big distance which makes the score insensitive to the local modeling error. A TM-score >0.5 indicates a model of correct topology and a TM-score <0.17 means a random similarity. These cutoff does not depends on the protein length.
Here the 'Estimated TM-score' is an estimated value of TM-score over the correlation between TM-score and C-score which is observed by a nonredundant training set.
What are distance and interface maps?
Distance map shows the the probability that inter-residue distances fall within 36 equal-width bins from [2, 20] Å, as well as two additional bins with distances <2 Å and >20 Å. The domain-domain interface map is extracted from the predicted distances by the summation of the cumulative probability of distances <18 Å. In the distance map, the first and second columns are the residue indexes which start from 1. Starting from the third column, the value is the probability that the distance located in the bin [0, 2], [2, 2.5], [2.5, 3],..., [20, ∞], respectively. Similar to the distance map, the first and second columns in the interface map are the residue indexes, and the third column is the probability of the distance <18 Å.

How to use known information (e.g. full-length templates, experimental data) to improve DEMO2 assembly?

The DEMO2 server currently accepts the following information:

Up to 20 full-length templates in PDB format
Experimental cross-linking data
Inter-domain contact/inteface restraints
Cryo-EM density map

How long does it take for DEMO2 to generate the final models for your protein?

How to cite DEMO2

Xiaogen Zhou, Chunxiang Peng, Wei Zheng,Yang Li, Guijun Zhang, Yang Zhang. DEMO2: Multi-domain protein structures assembly by coupling quaternary structural alignment with deep-learning inter-domain restraint prediction, to be submitted.
Xiaogen Zhou, Jun Hu, Chengxin Zhang, Guijun Zhang, Yang Zhang. Assembling multidomain protein structures through analogous global structural alignments. Proceedings of the National Academy of Sciences, 116: 15930-15938 (2019).

Funding support

Contact information

zhanglab

zhanggroup.org | +65-6601-1241 | Computing 1, 13 Computing Drive, Singapore 117417