| Literature DB >> 30911735 |
Nicholas Owen1, Mariya Moosajee1.
Abstract
High-throughput, massively parallel sequence analysis has revolutionized the way that researchers design and execute scientific investigations. Vast amounts of sequence data can be generated in short periods of time. Regarding ophthalmology and vision research, extensive interrogation of patient samples for underlying causative DNA mutations has resulted in the discovery of many new genes relevant to eye disease. However, such analysis remains functionally limited. RNA-sequencing accurately snapshots thousands of genes, capturing many subtypes of RNA molecules, and has become the gold standard for transcriptome gene expression quantification. RNA-sequencing has the potential to advance our understanding of eye development and disease; it can reveal new candidates to improve our molecular diagnosis rates and highlight therapeutic targets for intervention. But with a wide range of applications, the design of such experiments can be problematic, no single optimal pipeline exists, and therefore, several considerations must be undertaken for optimal study design. We review the key steps involved in RNA-sequencing experimental design and the downstream bioinformatic pipelines used for differential gene expression. We provide guidance on the application of RNA-sequencing to ophthalmology and sources of open-access eye-related data sets.Entities:
Keywords: RNA-sequencing; bioinformatics; differential gene expression; false discovery rate; gene ontology; next-generation sequencing; ophthalmology; power; replicates; transcriptomics
Year: 2019 PMID: 30911735 PMCID: PMC6421592 DOI: 10.1177/2515841419835460
Source DB: PubMed Journal: Ther Adv Ophthalmol ISSN: 2515-8414
Figure 1.A diagrammatic overview of the considerations for designing a successful RNA-seq experiment for differential gene expression analysis. Branches of the outline are numbered to indicate the general order for the considerations. Within each branch, subbranches denote options to consider within the design.
Summary of a subset of NCBI-submitted RNA-seq experimental data sets related to eye development and disease, highlighting utilized methods and software. The NCBI Gene Expression Omnibus (GEO) was searched for terms ‘retina disease; retina development; eye disease; eye development’, subsetting on ‘Study type’ – ‘expression profiling by high throughput sequencing’ (December 2018) (available details of the software used for analysis are noted; *unpublished data sets).
| Keywords | Data set description | Species | NCBI GEO | Software | Reference |
|---|---|---|---|---|---|
| Cornea | Molecular Effects of Doxycycline Treatment on Pterygium as Revealed by Massive Transcriptome Sequencing |
| GSE34736 | Tuxedo: TopHat2, Cufflinks2 | Larrayoz and colleagues[ |
| Cornea | RNA-seq analysis in Cornea epithelial cells (CECs), skin epithelial cells (SECs), LSCs after knocking down PAX6 (3-D shPAX6 LSCs) and SESCs transduced with PAX6 (3-D PAX6+SESCs) upon 3-D differentiation |
| GSE54322 | Not reported | Ouyang and colleagues[ |
| Cornea | Molecular Effects of Doxycycline Treatment on Pterygium from Caucasian Patients as Revealed by Massive Transcriptome Sequencing |
| GSE58441 | Tuxedo: TopHat2, Cufflinks2 | Larrayoz and colleagues[ |
| Cornea | RNA-seq analysis and comparison of corneal epithelium in keratoconus and myopia patients |
| GSE112155 | TopHat2, edgeR, DESeq2, limma | You and colleagues[ |
| Cornea | RNA Mis-splicing in Fuchs Endothelial Corneal Dystrophy II |
| GSE112201 | TopHat2, edgeR | Wieben and colleagues[ |
| Cornea | RNA Mis-splicing in Fuchs Endothelial Corneal Dystrophy |
| GSE101872 | TopHat2, edgeR | * |
| Cornea | Transcriptome profiling of human keratoconus corneas through RNA-sequencing identifies collagen synthesis disruption and downregulation of core elements of TGF-β, Hippo, and Wnt pathways |
| GSE77938 | Bowtie2, StringTie, Cufflinks2, Kallisto, DESeq2, edgeR | Kabza and colleagues[ |
| Diabetic retinopathy | Transcriptomic Analysis of Endothelial Cells from Fibrovascular Membranes in Proliferative Diabetic Retinopathy |
| GSE94019 | Partek | * |
| Muller glia | Rapid, dynamic activation of Müller glial stem cell responses in zebrafish |
| GSE86872 | RSEM, edgeR, limma | Sifuentes and colleagues[ |
| Retina | Id2a knockdown in zebrafish retina |
| GSE38786 | Bowtie, DESeq, DAVID | Uribe and colleagues[ |
| Retina | Molecular anatomy of the developing human retina |
| GSE104827 | STAR, RSEM, limma, | Hoshino and colleagues[ |
| Retina | The Dynamic Epigenetic Landscape of the Retina During Development, Reprogramming, and Tumorigenesis |
| GSE87042 | TopHat2, Cufflins2 | Aldiri and colleagues[ |
| Retina | Unprecedented alternative splicing and 3 Mb of novel transcribed sequence leads to significant transcript diversity in the transcriptome of the human retina |
| GSE40524 | RUM pipeline | Farkas and colleagues[ |
| Retina | Comparative Systems Pharmacology of HIF Stabilization in the Prevention of Retinopathy of Prematurity |
| GSE74170 | TopHat, Cufflinks | Hoppe and colleagues[ |
| Retina/CRX | Graded Expression Changes Determine Phenotype Severity In Mouse Models of CRX-Associated Retinopathy |
| GSE65506 | TopHat, edgeR | Ruzycki and colleagues[ |
| Retina/Macula | Comprehensive analysis of gene expression in human retina and supporting tissues |
| GSE94437 | GSNAP, Cufflinks2 | * |
| Retina/RP | rd10 transcriptome analysis |
| GSE56473 | RMap, edgeR | Uren and colleagues[ |
| Retina/RPE | Comprehensive analysis of gene expression in human retina and supporting tissues |
| GSE94437 | GSNAP, Cufflinks2 | * |
| Retina/RPE | Region-specific Transcriptome Analysis of the Human Retina and RPE/Choroid |
| PRJNA336370 | TopHat2, Cufflink2, cummeRbund | Whitmore and colleagues[ |
| Retina/RPE/ES | Comparative transcriptomic analysis of self-organized, in vitro generated optic tissues |
| GSE62432 | TopHat2, Cufflinks2, edgeR | Andrabi and colleagues[ |
| Retinoblastoma | A three-dimensional organoid model recapitulates tumorigenic aspects and drug responses of advanced human retinoblastoma |
| GSE120710 | Kallisto | Saengwimol and colleagues[ |
| Retinal Culture/iPSC | Treatment Paradigms for Retinal and Macular Diseases Using 3-D Retina Cultures Derived From Human Reporter Pluripotent Stem Cell Lines |
| GSE103826 | Not reported | Kaewkhaw and colleagues[ |
| Retinal Culture/iPSC | Transcriptome dynamics of developing photoreceptors in 3-D retina cultures recapitulates temporal sequence of human cone and rod differentiation revealing cell surface markers and gene networks |
| GSE67645 | Bowtie2, eXpress, edgeR, limma | Kaewkhaw and colleagues[ |
| Retinal Culture/iPSC/ESC | Accelerated and Improved Differentiation of Retinal Organoids from Mouse Pluripotent Stem Cells in Rotating-Wall Bioreactors |
| GSE102727 | edgeR, limma | DiStefano and colleagues[ |
| RGC/ESC | Enriched retinal ganglion cells derived from human embryonic stem cells (RNA-seq) |
| GSE84639 | ExAtlas | Gill and colleagues[ |
| RPE | Aneuploidy-induced cellular stresses limit autophagic degradation. |
| GSE60570 | RSEM, Bowtie, DESeq, ssGSEA | Santaguida and colleagues[ |
| RPE | Regulation of protein translation during mitosis |
| GSE67902 | Bowtie, DAVID | Tanenbaum and colleagues[ |
| RPE | RNA-Seq analysis of 4N and 2N RPE1 cells following polyploid induction via cytokinesis failure or Aurora kinase inhibition [tpo3] |
| GSE86101 | TopHat2, edgeR | Potapova and colleagues[ |
| RPE | RNA-Seq analysis of proliferating 4N and 2N RPE1 cells derived from single cell clones following inhibition of Aurora B to induce polyploidization [tpo10] |
| GSE86103 | TopHat2, edgeR | Potapova and colleagues[ |
| RPE | RNA-Seq analysis RPE1 cells following exposure to Nutlin-3 to identify target genes of p53 [tpo12] |
| GSE86104 | TopHat2, edgeR | Potapova and colleagues[ |
| RPE | Appropriately Differentiated ARPE-19 Cells Regain a Native Phenotype and Similar Gene Expression Profile |
| GSE88848 | CLC Genomics Workbench, DESeq2 | Samuel and colleagues[ |
| RPE/AMD | Reversal of persistent wound-induced retinal pigmented epithelial-to-mesenchymal transition by the TGFb pathway inhibitor, A-83-01 |
| GSE67898 | Partek, edgeR | Radeke and colleagues[ |
| RPE/AMD | A widespread decrease of chromatin accessibility in age-related macular degeneration |
| GSE99287 | TopHat2, Cufflinks2 | Wang and colleagues[ |
| RPE/iPSC | Expression data for hiPSC-derived RPE treated with 10mM Nicotinamide or vehicle |
| GSE90889 | STAR, bedtools, samtools, DESeq2 | Saini and colleagues[ |
| RPE/iPSC | Comparison of stem-cell derived retinal pigment epithelia (RPE) with human fetal retina pigment epithelium |
| GSE36695 | Galaxy - TopHat2, Cufflinks2 | * |
AMD, age-related macular degeneration; DAVID, Database for Annotation, Visualization, and Integrated Discovery; iPSC, induced pluripotent stem cell; NCBI, National Center for Biotechnology Information; RGS, retinal ganglion cell; RNA-seq, RNA-sequencing; RP, retinal pigment; RPE, retinal pigment epithelia.
Figure 2.Schematic representation of typical bioinformatic processing of high-throughput sequence data for RNA-seq experiments. The sequencing platform generated raw reads (FASTQ) are subjected to quality assessment. Where a reference genome and a high-quality annotation are available, resulting high-quality cleaned reads can be used in alignment- or pseudo-alignment-based processes. For alignment-based process, reads are mapped to the genome and transcriptome in a splice-aware manner. Resulting alignments (SAM/BAM/CRAM) are assessed for mapping qualities and counts of features (genes/transcripts/exons) generated. Counts are modeled for quantification and differential analysis computed using various methods, resulting in differential feature lists. With pseudo-alignment-based methods, clean reads are modeled to the transcriptome, allowing direct quantification of appropriate feature(s) for differential analysis. The output of both approaches can provide further insight through gene ontology analysis (GSEA/GO ORA), pathway analysis (Panther, KEGG, DAVID), and visualized (IGV, GenomeBrowse, Bioconductor) for report production. Software examples listed are non-exhaustive.
DAVID, Database for Annotation, Visualization, and Integrated Discovery; GO, gene ontology; GSEA, gene set enrichment analysis; IGV, Integrative Genomics Viewer; KEGG, Kyoto Encyclopedia of Genes and Genomes; ORA, over-representation analysis; Panther, Protein Analysis Through Evolutionary Relationships; SAM/BAM, sequence/binary alignment map.
Figure 3.Heat map of differentially expressed genes (DEGs) in zebrafish between isolated optic fissure tissue and dorsal retina at 56 hours post fertilization (hpf),[159] generated by R for Statistics package NMF. DEGs were identified using DESeq2, whose output was filtered for biologically significant results using criteria of a false discovery rate of less than 0.01 and fold change greater than 2. Resulting DESeq2 analysis was rlog transformed and hierarchical clustering performed on differential gene list. The z-score scale bar represents relative expression ±2SD from the mean. Top enriched gene ontology for biological process (BP) is highlighted for each cluster.