| Literature DB >> 24507166 |
Heather C Wick1, Harold Drabkin, Huy Ngu, Michael Sackman, Craig Fournier, Jessica Haggett, Judith A Blake, Diana W Bianchi, Donna K Slonim.
Abstract
BACKGROUND: Recent increases in genomic studies of the developing human fetus and neonate have led to a need for widespread characterization of the functional roles of genes at different developmental stages. The Gene Ontology (GO), a valuable and widely-used resource for characterizing gene function, offers perhaps the most suitable functional annotation system for this purpose. However, due in part to the difficulty of studying molecular genetic effects in humans, even the current collection of comprehensive GO annotations for human genes and gene products often lacks adequate developmental context for scientists wishing to study gene function in the human fetus. DESCRIPTION: The Developmental FunctionaL Annotation at Tufts (DFLAT) project aims to improve the quality of analyses of fetal gene expression and regulation by curating human fetal gene functions using both manual and semi-automated GO procedures. Eligible annotations are then contributed to the GO database and included in GO releases of human data. DFLAT has produced a considerable body of functional annotation that we demonstrate provides valuable information about developmental genomics. A collection of gene sets (genes implicated in the same function or biological process), made by combining existing GO annotations with the 13,344 new DFLAT annotations, is available for use in novel analyses. Gene set analyses of expression in several data sets, including amniotic fluid RNA from fetuses with trisomies 21 and 18, umbilical cord blood, and blood from newborns with bronchopulmonary dysplasia, were conducted both with and without the DFLAT annotation.Entities:
Mesh:
Year: 2014 PMID: 24507166 PMCID: PMC3928322 DOI: 10.1186/1471-2105-15-45
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Overview of sources of DFLAT annotation. On the left-hand side, neonatologists suggest keywords and developmental processes for manual curation of the literature, which proceeds according to the methodology of the Gene Ontology Consortium. Eligible annotations are submitted to the Gene Ontology and included in subsequent data releases. Others, valuable for our purposes, become part of the “GONE” collection. The right-hand side of the image depicts our procedure for deriving annotation from mouse orthologs. Mouse genes of interest are identified by having GO annotations in the “Developmental Process” subtree. For those genes for which MGI has identified unique human orthologs, all mouse annotations with the required evidence codes are mapped to the corresponding human gene.
Figure 2Propagation of Gene Ontology annotation. Genes or gene products (letters A-P) are propagated up the GO graph from the original GO annotation term to all of its parents via is_a (solid arrows) and part_of (dashed arrows) relationships. Bold text letters indicate genes or gene products directly annotated to the GO term; the remaining letters indicate propagated annotations.
Numbers of unique gene sets
| | ||||||
|---|---|---|---|---|---|---|
| Trisomy 21 | 406 | 64 | 14 | 2 | 3.45 | 3.13 |
| Trisomy 18 | 406 | 64 | 22 | 3 | 5.42 | 4.69 |
| Maternal/fetal blood | 376 | 58 | 14* | 2* | 3.72* | 3.45* |
| BPD | 406 | 64 | 23 | 0 | 5.67 | 0.00 |
| Leukemia | 372 | 55 | 44 | 16 | 11.83 | 29.10 |
Unique gene sets are gene sets that meet the GSEA size filter (7–150 genes) in only one of the two gene set collections, DFLAT or GOA.
*Because of the design of this study, these numbers report significant changes in only one direction – gene sets that might be characterizing fetal gene expression in maternal blood. If we count significant changes in both directions, we find 38 and 6 with DFLAT and GOA, respectively, but these include functions that likely characterize maternal response to Cesarean delivery, rather than fetal expression.
Total number newly significant/insignificant (p < 0.05) gene sets with DFLAT
| Trisomy 21 | 47 | 29 |
| Trisomy 18 | 37 | 16 |
| Maternal/fetal blood* | 22 | 6 |
| BPD | 50 | 27 |
| Leukemia | 108 | 49 |
Total number of gene sets (either “unique” or “common”) showing enrichment (GSEA p < 0.05) with only the DFLAT collection (“Newly significant”) or the GOA collection (“Newly insignificant”).
*Because of the design of this study, these numbers report the significant changes only in the meaningful direction: gene sets that might characterize fetal gene expression in maternal blood. Changes in the other direction likely characterize maternal response to Cesarean delivery.
Verified gene set significance (p < 0.05) in Trisomy 21
| DFLAT | 85.1% (40) | 14.9% (7) |
| GOA | 65.5% (19) | 34.5% (10) |
Verified gene set significance (p < 0.05) in Trisomy 18
| DFLAT | 37.8% (14) | 62.2% (23) |
| GOA | 31.3% (5) | 68.8% (11) |
Percent (number) of GO terms from the first two rows of Table 2 that were found to have independent corroborating evidence in the literature.