| Literature DB >> 29112716 |
Tony E Lewis1, Ian Sillitoe1, Natalie Dawson1, Su Datt Lam1,2, Tristan Clarke1, David Lee3, Christine Orengo1, Jonathan Lees1,4.
Abstract
Gene3D (http://gene3d.biochem.ucl.ac.uk) is a database of globular domain annotations for millions of available protein sequences. Gene3D has previously featured in the Database issue of NAR and here we report a significant update to the Gene3D database. The current release, Gene3D v16, has significantly expanded its domain coverage over the previous version and now contains over 95 million domain assignments. We also report a new method for dealing with complex domain architectures that exist in Gene3D, arising from discontinuous domains. Amongst other updates, we have added visualization tools for exploring domain annotations in the context of other sequence features and in gene families. We also provide web-pages to visualize other domain families that co-occur with a given query domain family.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29112716 PMCID: PMC5753370 DOI: 10.1093/nar/gkx1069
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Gene3D domains are now displayed along with other sequence annotations using the ProtVista tool.
Figure 2.The Gene3D website displaying protein domain architectures in a gene tree, using the PhyD3 JavaScript library. (A) A query with an elephant P53 gene, zoomed in on the area of recurrent P53 duplication, shows multiple paralogs with similar architectures. P63 genes with altered domain architectures can be seen in the lower half of the figure as part of the same gene tree. (B) A Gene3D search with mouse Trpt1, shows it has a non-significant hit to domain superfamily 3.20.170.30 (light blue rectangles) whilst other members of the gene family have a significant hit to this same domain family (darker blue–green rectangles).
Figure 3.(A) The most frequent Domain Architectures that the SH2 domain family is found in amongst the Ensembl Genomes. (B) Gene3D domain family co-occurrence page, showing domain families that co-occur with the SH2 domain family in selected Eukaryotic and Metazoan genomes. The first two columns (surrounded by a blue box) correspond to two Choanoflagellate genomes. Darker red entries indicate a greater number of proteins in which the SH2 domain family co-occurs with a specific genome/domain family pair. The number of domain families displayed has been limited to the most populous 100 domain families for the selected genomes. Mousing over a point in the matrix gives popups with further detailed information.