| Literature DB >> 21059677 |
Zexian Liu1, Jun Cao, Xinjiao Gao, Yanhong Zhou, Longping Wen, Xiangjiao Yang, Xuebiao Yao, Jian Ren, Yu Xue.
Abstract
As a reversible post-translational modification (PTM) discovered decades ago, protein lysine acetylation was known for its regulation of transcription through the modification of histones. Recent studies discovered that lysine acetylation targets broad substrates and especially plays an essential role in cellular metabolic regulation. Although acetylation is comparable with other major PTMs such as phosphorylation, an integrated resource still remains to be developed. In this work, we presented the compendium of protein lysine acetylation (CPLA) database for lysine acetylated substrates with their sites. From the scientific literature, we manually collected 7151 experimentally identified acetylation sites in 3311 targets. We statistically studied the regulatory roles of lysine acetylation by analyzing the Gene Ontology (GO) and InterPro annotations. Combined with protein-protein interaction information, we systematically discovered a potential human lysine acetylation network (HLAN) among histone acetyltransferases (HATs), substrates and histone deacetylases (HDACs). In particular, there are 1862 triplet relationships of HAT-substrate-HDAC retrieved from the HLAN, at least 13 of which were previously experimentally verified. The online services of CPLA database was implemented in PHP + MySQL + JavaScript, while the local packages were developed in JAVA 1.5 (J2SE 5.0). The CPLA database is freely available for all users at: http://cpla.biocuckoo.org.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21059677 PMCID: PMC3013790 DOI: 10.1093/nar/gkq939
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
The data statistics of lysine acetylated proteins in CPLA database
| Organism | Sub. | Siteb | Per.c (%) |
|---|---|---|---|
| 2585 | 5409 | 75.64 | |
| 190 | 260 | 3.64 | |
| 187 | 552 | 7.72 | |
| 149 | 259 | 3.62 | |
| 44 | 85 | 1.19 | |
| Others | 156 | 586 | 8.20 |
| Total | 3311 | 7151 | 100 |
aSub., number of substrates; bSite, number of acetylation sites; cPer., percentiles of acetylation sites.
Figure 1.The search option of CPLA 1.0 database. (A) Users could simply input ‘STAT3’ for querying. (B) The results will be shown in a tabular format. Users could click on the CPLA ID (CPLA-000025) to visualize the detailed information. (C) The detailed information of human STAT3. The PPI information was also presented.
The top five most enriched GO terms of biological processes, molecular functions and cellular components in human acetylome
| Description of GO term | Acetylome | Proteome | ||
|---|---|---|---|---|
| The top five most enriched biological processes | ||||
| Translational elongation (GO:0006414) | 69 (2.80) | 93 (0.52) | 5.44 | 3.35E-40 |
| Nucleosome assembly (GO:0006334) | 43 (1.75) | 79 (0.44) | 3.99 | 1.17E-17 |
| RNA splicing (GO:0008380) | 70 (2.84) | 188 (1.04) | 2.73 | 3.50E-16 |
| Interspecies interaction between organisms (GO:0044419) | 100 (4.06) | 333 (1.85) | 2.20 | 3.97E-15 |
| mRNA processing (GO:0006397) | 65 (2.64) | 193 (1.07) | 2.47 | 9.06E-13 |
| The top five most enriched molecular functions | ||||
| ATP binding (GO:0005524) | 398 (16.17) | 1473 (8.17) | 1.98 | 4.67E-46 |
| Protein binding (GO:0005515) | 798 (32.06) | 3996 (22.16) | 1.45 | 9.00E-35 |
| RNA binding (GO:0003723) | 186 (7.56) | 557 (3.09) | 2.45 | 9.49E-34 |
| Structural constituent of ribosome (GO:0003735) | 71 (2.89) | 155 (0.86) | 3.36 | 1.99E-22 |
| Nucleotide binding (GO:0000166) | 85 (3.45) | 255 (1.41) | 2.44 | 5.75E-16 |
| The top five most enriched cellular components | ||||
| Cytosol (GO:0005829) | 298 (12.11) | 1009 (5.60) | 2.16 | 4.71E-42 |
| Mitochondrial matrix (GO:0005759) | 83 (3.37) | 139 (0.77) | 4.38 | 6.33E-37 |
| Nucleoplasm (GO:0005654) | 124 (5.04) | 333 (1.85) | 2.73 | 1.12E-27 |
| Nucleolus (GO:0005730) | 133 (5.40) | 425 (2.36) | 2.29 | 1.84E-21 |
| Cytosolic small ribosomal subunit (GO:0022627) | 29 (1.18) | 36 (0.20) | 5.90 | 2.24E-19 |
aNum., number of proteins annotated; bPer. percentiles of proteins annotated; cE-ratio, enrichment ratio of acetylated proteins.
Figure 2.A core HLAN identified from experimentally identified PPI data. The HATs, substrates, and HDACs form a dense network. Green node: HAT; Blue node: HDAC; yellow node: HAT that is acetylated; purple node: HDAC that is acetylated; pink node: substrate.