| Literature DB >> 27924015 |
Julia Varga1, László Dobson2, István Reményi2, Gábor E Tusnády3.
Abstract
The TSTMP database is designed to help the target selection of human transmembrane proteins for structural genomics projects and structure modeling studies. Currently, there are only 60 known 3D structures among the polytopic human transmembrane proteins and about a further 600 could be modeled using existing structures. Although there are a great number of human transmembrane protein structures left to be determined, surprisingly only a small fraction of these proteins have 'selected' (or above) status according to the current version the TargetDB/TargetTrack database. This figure is even worse regarding those transmembrane proteins that would contribute the most to the structural coverage of the human transmembrane proteome. The database was built by sorting out proteins from the human transmembrane proteome with known structure and searching for suitable model structures for the remaining proteins by combining the results of a state-of-the-art transmembrane specific fold recognition algorithm and a sequence similarity search algorithm. Proteins were searched for homologues among the human transmembrane proteins in order to select targets whose successful structure determination would lead to the best structural coverage of the human transmembrane proteome. The pipeline constructed for creating the TSTMP database guarantees to keep the database up-to-date. The database is available at http://tstmp.enzim.ttk.mta.hu.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27924015 PMCID: PMC5210638 DOI: 10.1093/nar/gkw939
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The growth of polytopic proteins having solved structure (red) or those that could be modeled using existing structures (orange) until 2016. The former curve was extrapolated until 2020 (red, dotted). The number of modelable proteins was estimated for cases when the target structures were chosen randomly (orange dotted) or according to the order of ‘The Most Wanted’ set of TSTMP database (orange continuous).
Figure 2.Comparison of the results of various target selection protocols. Proteins having ‘3D’/’modelable’ evidence are inside the Venn diagram (38). Outside the circles, the number of proteins marked as targets by all three methods is shown.