| Literature DB >> 25348407 |
Tony E Lewis1, Ian Sillitoe2, Antonina Andreeva3, Tom L Blundell4, Daniel W A Buchan5, Cyrus Chothia3, Domenico Cozzetto5, José M Dana6, Ioannis Filippis7, Julian Gough8, David T Jones9, Lawrence A Kelley7, Gerard J Kleywegt6, Federico Minneci5, Jaina Mistry6, Alexey G Murzin3, Bernardo Ochoa-Montaño4, Matt E Oates8, Marco Punta6, Owen J L Rackham10, Jonathan Stahlhacke8, Michael J E Sternberg7, Sameer Velankar6, Christine Orengo1.
Abstract
Genome3D (http://www.genome3d.eu) is a collaborative resource that provides predicted domain annotations and structural models for key sequences. Since introducing Genome3D in a previous NAR paper, we have substantially extended and improved the resource. We have annotated representatives from Pfam families to improve coverage of diverse sequences and added a fast sequence search to the website to allow users to find Genome3D-annotated sequences similar to their own. We have improved and extended the Genome3D data, enlarging the source data set from three model organisms to 10, and adding VIVACE, a resource new to Genome3D. We have analysed and updated Genome3D's SCOP/CATH mapping. Finally, we have improved the superposition tools, which now give users a more powerful interface for investigating similarities and differences between structural models.Entities:
Mesh:
Year: 2014 PMID: 25348407 PMCID: PMC4384030 DOI: 10.1093/nar/gku973
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Summary of the Genome3D prediction resources and the structural domain classifications, on which they are based
| Resource | Principal investigator | Affiliation | Type | Classification source |
|---|---|---|---|---|
| DomSerf ( | Jones | UCL | Prediction: Models | CATH |
| FUGUE ( | Blundell | Cambridge | Prediction: Annotations | CATH/SCOP |
| Gene3D ( | Orengo | UCL | Prediction: Annotations | CATH |
| pDomTHREADER ( | Jones | UCL | Prediction: Annotations | CATH |
| PHYRE2 ( | Sternberg/Kelley | Imperial | Prediction: Both | SCOP/PDB |
| SUPERFAMILY ( | Gough | Bristol | Prediction: Both | SCOP |
| VIVACE | Blundell | Cambridge | Prediction: Models | CATH/SCOP |
| CATH ( | Orengo | UCL | Classification | N/A |
| SCOP ( | Murzin | MRC-LMB | Classification | N/A |
Figure 1.Coverage of UniProt sequences in each genome by the number of resources providing at least one domain annotation for the sequence. ‘Pfam (single domain)’ denotes the new data set of sequences representing Pfam families and ‘Pfam (architecture)’ denotes the extra data set of sequences representing the Pfam multi-domain architectures, as discussed in the main text.
Figure 2.Coverage of UniProt sequences in each genome by the number of resources providing at least one structural model for the sequence. ‘Pfam (single domain)’ denotes the new data set of sequences representing Pfam families, as discussed in the main text.
Figure 3.PyMOL downloads of Genome3D superpositions now include black alignment lines that connect the locations in which the different structural models place the same residue from the source sequence. These images show a superposition of four resources’ structural models of OR5AL1 (UniProt accession: P0C617) with the alignment lines displayed. In (b), the models predict two structurally similar helices but differ about where to locate the source sequence's residues on the yellow helix at the bottom.