| Literature DB >> 28025345 |
Marc Feuermann1, Pascale Gaudet2, Huaiyu Mi3, Suzanna E Lewis4, Paul D Thomas3.
Abstract
We previously reported a paradigm for large-scale phylogenomic analysis of gene families that takes advantage of the large corpus of experimentally supported Gene Ontology (GO) annotations. This 'GO Phylogenetic Annotation' approach integrates GO annotations from evolutionarily related genes across ∼100 different organisms in the context of a gene family tree, in which curators build an explicit model of the evolution of gene functions. GO Phylogenetic Annotation models the gain and loss of functions in a gene family tree, which is used to infer the functions of uncharacterized (or incompletely characterized) gene products, even for human proteins that are relatively well studied. Here, we report our results from applying this paradigm to two well-characterized cellular processes, apoptosis and autophagy. This revealed several important observations with respect to GO annotations and how they can be used for function inference. Notably, we applied only a small fraction of the experimentally supported GO annotations to infer function in other family members. The majority of other annotations describe indirect effects, phenotypes or results from high throughput experiments. In addition, we show here how feedback from phylogenetic annotation leads to significant improvements in the PANTHER trees, the GO annotations and GO itself. Thus GO phylogenetic annotation both increases the quantity and improves the accuracy of the GO annotations provided to the research community. We expect these phylogenetically based annotations to be of broad use in gene enrichment analysis as well as other applications of GO annotations.Database URL: http://amigo.geneontology.org/amigo.Entities:
Mesh:
Year: 2016 PMID: 28025345 PMCID: PMC5199145 DOI: 10.1093/database/baw155
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Representation of apoptosis in the Gene Ontology. The main steps (or GO processes) are indicated in each box, and the proteins mediating these processes are shown: Initiator caspases (pale blue), executor caspases (red), topoisomerases (turquoise), protein kinases (grey), phospholipid scramblases (yellow), and nucleases (purple).
Annotation statistics for proteins with members participating in the execution of apoptosis
| Protein families | Experimental annotations | Tree annotations | Functions | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PTHR family ID | Representative human proteins | Protein count | Proteins | EXP | Terms | Annotated Proteins | IBD | IBA | Terms | |
| PTHR13067 | DFFB | 22 | 3 | 16 | 10 | 19 | 3 | 53 | 3 | Catalytic subunit of DNA fragmentation factor |
| PTHR11371 | DNASE1 | 93 | 8 | 40 | 17 | 92 | 4 | 268 | 4 | Deoxyribonuclease |
| PTHR10858 | DNASE2 | 95 | 7 | 25 | 11 | 94 | 3 | 276 | 3 | Deoxyribonuclease |
| PTHR13966 | ENDOG | 116 | 9 | 53 | 27 | 112 | 5 | 332 | 4 | Endonuclease-related protein |
| PTHR23248 | PLSCR1 | 195 | 13 | 135 | 45 | 193 | 5 | 576 | 5 | Phospholipid scramblase |
| PTHR10169 | TOP2A, TOP2B | 212 | 24 | 267 | 108 | 211 | 12 | 1,658 | 12 | DNA topoisomerase/gyrase; participates in chromosome condensation |
| PTHR10454 | Caspases | 418 | 50 | 1135 | 255 | 390 | 23 | 1576 | 22 | Cysteinyl-aspartate-cleaving protease |
| PTHR31773 | Metacaspases (MCA1 in fungi, MC1 in plants) | 106 | 5 | 19 | 11 | 67 | 5 | 328 | 5 | Cysteinyl proteases |
| PTHR31810 | Metacaspases (MC4-MC9 in plants) | 44 | 4 | 28 | 17 | 16 | 3 | 49 | 3 | Cysteinyl proteases |
| PTHR24361 | STK24, TAOK1, PAK2 | 1441 | 163 | 2 227 | 539 | 1441 | 44 | 11 407 | 34 | Ser/Thr kinases (PAKs and MAPKs) STE family (yeast STE20-related) |
| PTHR32129 | XKR8 | 112 | 2 | 7 | 3 | 16 | 2 | 32 | 2 | Phospholipid scramblase |
| PTHR16024 | Ced-8 ( | 43 | 1 | 12 | 7 | 42 | 2 | 84 | 2 | Phospholipid scramblase |
| Total | 2897 | 289 | 3964 | 1050 | 2693 | 111 | 16 639 | 99 | ||
The Protein families section defines the protein families annotated for apoptosis, including: (i) the PANTHER family ID; (ii) a representative protein, human whenever available or a well-characterized member of the family, with the species or phyla in parenthesis; (iii) the number of proteins in the family.
The Experimental annotations section summarizes the experimental annotations available for PAINT inference. For each family: (i) the number of proteins (having at least one experimental annotation; (ii) the total number of different experimental annotations; (iii) the number of distinct GO terms associated with family members.
The Tree annotations section summarizes the annotations inferred using PAINT. For each family: (i) the total number of proteins having at least one inferred annotation; (ii) the number of distinct IBD annotations (Inferred from Biological Descendant), representing the point in evolution in which the inferred function first evolved; (iii) the number of IBA annotations (Inferred from Biological Ancestor); inherited from the IBD annotations of the tree nodes from which the sequence has evolved; (iv) the number of distinct GO terms used for inference. Differences with the second tree annotation column are due to using more specific terms.
The Function section gives the major function of the family.
Figure 2.Representation of macroautophagy in the Gene Ontology. Induction of autophagy occurs in responses to stresses such as by starvation, and can be mediated by pathways such as the TOR pathway. The autophagosome begins from an extension of the endoplasmic reticulum membrane, and eventually fuses with the vacuole, leading to the degradation of its contents.
Annotation statistics for proteins with members participating in autophagosome assembly
| Protein families | Experimental annotations | Tree annotations | Function | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PTHR family ID | Representative human proteins | Protein count | Proteins | EXP | Terms | Annotated Proteins | IBD | IBA | Terms | |
| PTHR13430 | ATG13 | 70 | 4 | 20 | 10 | 29 | 16 | 166 | 6 | ULK1 kinase activator |
| PTHR23160 | ATG11 ( | 23 | 7 | 23 | 10 | 20 | 2 | 39 | 2 | Scaffold protein |
| PTHR13664 | ATG14 | 24 | 3 | 16 | 10 | 23 | 3 | 69 | 3 | Membrane recruitment component of the PI3-kinase complex |
| PTHR14957 | ATG10 | 30 | 2 | 10 | 8 | 30 | 5 | 145 | 5 | E2 for ATG12-ATG5 conjugation |
| PTHR31671 | TP53INP2 | 37 | 4 | 35 | 21 | 36 | 4 | 140 | 4 | Scaffold protein that recruits ATG8 family proteins |
| PTHR13222 | RB1CC1 | 44 | 5 | 30 | 23 | 44 | 6 | 257 | 6 | Autophagosome formation |
| PTHR13292 | ATG101 | 49 | 4 | 16 | 13 | 47 | 2 | 95 | 2 | Protects ATG13 from proteasomal degradation |
| PTHR13385 | ATG12 | 49 | 9 | 40 | 23 | 49 | 8 | 383 | 8 | Ubiquitin-like protein |
| PTHR13040 | ATG5 | 55 | 10 | 75 | 44 | 55 | 8 | 430 | 8 | Component of E3 ubiquitin ligase for ATG8-PI conjugation |
| PTHR13038 | ATG9 | 70 | 12 | 41 | 39 | 70 | 8 | 549 | 8 | Bridging protein |
| PTHR19878 | ATG16L1, ATG16L2 | 70 | 7 | 22 | 14 | 70 | 3 | 330 | 3 | Stabilizes the ATG5-ATG12 conjugate |
| PTHR12768 | BECN1 | 74 | 9 | 134 | 59 | 74 | 11 | 744 | 11 | Regulatory component of the PI3-kinase complex |
| PTHR12866 | ATG3 | 74 | 8 | 41 | 22 | 74 | 6 | 431 | 6 | E2 for ATG8-PI conjugation |
| PTHR13190 | ATG2A, ATG2B | 81 | 6 | 27 | 18 | 81 | 6 | 479 | 6 | Localizes ATG18 to omegasome and PAS |
| PTHR24348 | ULK1, ULK2, ULK3 | 127 | 14 | 203 | 91 | 127 | 12 | 1052 | 12 | Kinase for ATG9 |
| PTHR22624 | ATG4A | 137 | 9 | 52 | 27 | 137 | 10 | 1356 | 10 | ATG8 protease |
| PTHR11227 | WIPI1, WIPI2 | 199 | 14 | 135 | 65 | 199 | 12 | 2174 | 12 | PI(3,5)P2 regulatory complex |
| PTHR10969 | MAP1LC3, GARABAP, … | 213 | 26 | 239 | 70 | 213 | 14 | 2148 | 12 | Membrane fusion, autophagosome assembly |
| PTHR10281 | VMP1 | 223 | 22 | 91 | 49 | 222 | 12 | 853 | 10 | Early step in autophagosome assembly |
| PTHR10048 | PIK3C3 | 420 | 63 | 758 | 288 | 420 | 42 | 4488 | 36 | Catalytic component of the PI3-kinase complex |
| PTHR10555 | SNX4 (Sorting nexins) | 521 | 57 | 546 | 129 | 520 | 14 | 2820 | 14 | Sorting nexin involved in autophagosome assembly |
| PTHR19957 | STX17 (syntaxins) | 528 | 73 | 681 | 233 | 525 | 18 | 4681 | 18 | SNARE of the autophagosome |
| PTHR10953 | ATG7 | 631 | 64 | 423 | 159 | 627 | 51 | 3309 | 43 | E1-like activating enzyme |
| PTHR24073 | RAB1A, RAB23 | 2753 | 129 | 1321 | 307 | 2748 | 117 | 25 459 | 68 | Rab GTPase involved in membrane recognition and fusion |
| Total | 6453 | 559 | 4982 | 1740 | 6432 | 385 | 52 650 | 310 | ||
The Protein families section defines the protein families annotated for autophagy, including: (i) the PANTHER family ID; (ii) a representative protein, human whenever available, or a well-characterized member of the family, with the species or phyla in parenthesis; (iii) the number of proteins in the family.
The Experimental annotations section summarizes the experimental annotations available for PAINT inference. For each family: (i) the number of proteins having at least one experimental annotation; (ii) the number of different experimental annotations; (iii) the number of distinct GO terms associated with family members.
The Tree annotations section summarizes the annotations inferred using PAINT. For each family: (i) the number of proteins having at least one inferred annotation; (ii) the number of IBD annotations (Inferred from Biological Descendant), representing the point in evolution in which the inferred function first evolved; (iii) the number of IBA annotations (Inferred from Biological Ancestor); inherited from the IBD annotations of the tree nodes from which the sequence has evolved; (iv) the number of distinct GO terms used for inference.
The Function section gives the major function of the family.
GO-terms describing cellular components involved in autophagosome assembly
| GO ID | Cellular component and corresponding topology |
|---|---|
| GO:0005776 | Autophagosome |
| GO:0000421 | Autophagosome membrane |
| GO:0034423 | Autophagosome lumen |
| GO:0097635 | Extrinsic component of autophagosome membrane |
| GO:0097636 | Intrinsic component of autophagosome membrane |
| 2GO:0097637 | Integral component of autophagosome membrane |
| GO:0000407 | Pre-autophagosomal structure |
| GO:0034045 | Pre-autophagosomal structure membrane |
| GO:0097632 | Extrinsic component of pre-autophagosomal structure membrane |
| GO:0097633 | Intrinsic component of pre-autophagosomal structure membrane |
| GO:0097634 | Integral component of pre-autophagosomal structure membrane |
| GO:1990462 | Omegasome |
| GO:1903349 | Omegasome membrane |
| GO:0097629 | Extrinsic component of omegasome membrane |
| GO:0097630 | Intrinsic component of omegasome membrane |
| GO:0097631 | Integral component of omegasome membrane |
Terms for which we have replaced ‘autophagic vacuole’ by ‘autophagosome’.
Newly created terms.