| Literature DB >> 31032818 |
Caralina Marín de Evsikova1,2, Isaac D Raplee3, John Lockhart4, Gilberto Jaimes5, Alexei V Evsikov6.
Abstract
As one of the most widespread metabolic diseases, atherosclerosis affects nearly everyone as they age; arteries gradually narrow from plaque accumulation over time reducing oxygenated blood flow to central and periphery causing heart disease, stroke, kidney problems, and even pulmonary disease. Personalized medicine promises to bring treatments based on individual genome sequencing that precisely target the molecular pathways underlying atherosclerosis and its symptoms, but to date only a few genotypes have been identified. A promising alternative to this genetic approach is the identification of pathways altered in atherosclerosis by transcriptome analysis of atherosclerotic tissues to target specific aspects of disease. Transcriptomics is a potentially useful tool for both diagnostics and discovery science, exposing novel cellular and molecular mechanisms in clinical and translational models, and depending on experimental design to identify and test novel therapeutics. The cost and time required for transcriptome analysis has been greatly reduced by the development of next generation sequencing. The goal of this resource article is to provide background and a guide to appropriate technologies and downstream analyses in transcriptomics experiments generating ever-increasing amounts of gene expression data.Entities:
Keywords: RNA-seq analysis; atherosclerosis; coronary aortic disease; gene set enrichment analysis; heart disease; metabolic disease; pathway enrichment analysis; secondary gene expression analysis; transcriptomics
Year: 2019 PMID: 31032818 PMCID: PMC6617151 DOI: 10.3390/jpm9020021
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1Transcriptomics workflow diagram highlighting the steps to process tissue, cell, or biopsy sample for RNA and choosing gene expression technology platform depending upon the specific application as an investigative tool for discovery science, disease diagnosis, or molecular mechanism. EST: expressed sequence tag; SAGE: serial analysis of gene expression; NGS: next-generation sequencing.
Figure 2Timeline of the introduction of prominent technologies for gene expression measurement and bioinformatics analysis since the discovery of reverse transcriptase, an enzyme indispensable for any RNA sequencing study. Some of the seminal papers in atherosclerosis research discussed in the text are shown in this timeline as well. Timeline is not to scale. SMRT: single molecule real time.
Figure 3Older technologies used in gene expression studies. (A) In microarray experiments, labeled cRNA are used to measure the gene expression level by hybridization to cDNAs on glass slides representing known genes. The intensities are measured, normalized, and analyzed by computer software to compare experimental treatments or conditions. (B) Sanger sequencing was the original method of measuring DNA nucleotide sequence based on chain-dye termination and the first technology for sequencing of expressed genes. (C) Steps in producing concatenated short tags for subsequent sequencing in SAGE method.
Figure 4Principle of pyrosequencing.
Figure 5Generalized pipeline for a high-throughput microarray or RNA-seq transcriptomics study.
Figure 6(A) Gene annotations in Gene Ontology (GO) across species based on type of evidence supporting gene annotation. (B) Breakdown of gene annotations based on most frequently used evidence categories (Biological Process, Cellular Component, and Molecular Function categories combined). (C) Number of genes annotated with at least one GO term in the species.
Figure 7Examples of ontology structure and the power of ontological analysis. Gene Ontology (GO) (A), Mammalian Phenotype Ontology (MP) (B), and Human Disease Ontology (C) terms related to atherosclerosis. (D) Bioinformatics analysis result for VisuaL Annotated Display (VLAD) illustrating the statistically significant GO categories overrepresented among the 100 highest-expressed genes in atherosclerotic aortas of mouse and rabbit translational models. The width of the color bar represents the relative “strength” of a particular GO pathway representation among the highest-expressed genes and exemplifies similarities and differences between models, green bar = mouse, red bar = rabbit.
Biomedical Ontology and Pathway Databases and Ontology/Pathway Enrichment Tools.
| Resource | Description | URL | Ref |
|---|---|---|---|
|
| |||
| Gene Ontology | Central repository of terms describing gene functions across multiple biological systems |
| [ |
| Mammalian Phenotype Ontology | Biomedical curators’ and community database of ontological terms for annotating phenotypic data |
| [ |
| Human Disease Ontology | Ontology for human disease cross-mapped to MeSH, ICD, NCI’s thesaurus, SNOMED and OMIM. |
| [ |
| Protein Ontology | Ontology of protein-related entities, their explicit definitions, and relationships between them. |
| [ |
| Open Biological Ontologies | Collaborative effort to specify and implement best principles and practices in ontology development. Contains links to all Ontologies. |
| [ |
| MSigDB | A collection of annotated gene sets, such as canonical pathways gene sets, for use with GSEA. |
| [ |
| MetaCyc | A curated database of experimentally elucidated metabolic pathways for many organisms. |
| [ |
| KEGG | A collection of maps representing metabolism, pathways, and associated genes. |
| [ |
| Reactome | A free, open-source, curated and peer-reviewed pathway database. |
| [ |
|
| |||
| VLAD | Tool for identification of statistically significant over- or under-represented ontology terms in lists of genes. GO gene – function annotations for human and mouse, and MP gene – phenotype annotations for mouse are pre-loaded. Allows uploading user-specified ontologies and gene – ontology mappings. Updated weekly. |
| [ |
| AmiGO | Allows users to query, browse and visualize ontologies and gene annotation data for many species. Updated weekly. |
| [ |
| GOrilla | A tool to identify and visualize enriched GO terms in gene lists. Can either search for GO terms at the top of a ranked gene list, or compare a target gene list to a background gene list. |
| [ |
| DAVID | A set of tools to identify overrepresented features in large lists of genes. |
| [ |
| BinGO | Cytoscape tool to visualize statistically overrepresented GO terms in a list of genes. |
| [ |
| GSEA | A tool to determine if a gene set has significant differences between two biological states. |
| [ |
1 Download link for a stand-alone tool.