| Literature DB >> 25348408 |
Ian Sillitoe1, Tony E Lewis2, Alison Cuff2, Sayoni Das2, Paul Ashford2, Natalie L Dawson2, Nicholas Furnham3, Roman A Laskowski4, David Lee2, Jonathan G Lees2, Sonja Lehtinen2, Romain A Studer4, Janet Thornton4, Christine A Orengo2.
Abstract
The latest version of the CATH-Gene3D protein structure classification database (4.0, http://www.cathdb.info) provides annotations for over 235,000 protein domain structures and includes 25 million domain predictions. This article provides an update on the major developments in the 2 years since the last publication in this journal including: significant improvements to the predictive power of our functional families (FunFams); the release of our 'current' putative domain assignments (CATH-B); a new, strictly non-redundant data set of CATH domains suitable for homology benchmarking experiments (CATH-40) and a number of improvements to the web pages.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25348408 PMCID: PMC4384018 DOI: 10.1093/nar/gku947
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.Comparison of EC diversity against the number of FunFams for superfamilies in CATH v3.5 and v4.0.
Figure 2.Selected TPP-dependent enzyme superfamily (CATH 3.40.50.970) domains classified into different FunFams in CATH v4.0 using the improved FunFHMMer algorithm. The previous version of CATH (v3.5) clustered all members of this superfamily into one, large FunFam. The catalytic site residues derived from the CSA (Catalytic Site Atlas) are highlighted in red and shown as sticks.
Figure 3.Superposition of 15 representative domains from the Homing endonucleases superfamily (CATH: 3.10.28.10).
Figure 4.New tools allow users to browse the CATH structural hierarchy using dynamically generated views: ‘Tree’ and ‘Sunburst’.