| Literature DB >> 31200636 |
Ashok Kumar Sharma1, Barbora Pafčo2,3, Klára Vlčková2,3, Barbora Červená2,3, Jakub Kreisinger2,4, Samuel Davison1, Karen Beeri5, Terence Fuh6, Steven R Leigh7, Michael B Burns8, Ran Blekhman9,10, Klára J Petrželková11,12,13, Andres Gomez14.
Abstract
BACKGROUND: Limited accessibility to intestinal epithelial tissue in wild animals and humans makes it challenging to study patterns of intestinal gene regulation, and hence to monitor physiological status and health in field conditions. To explore solutions to this limitation, we have used a noninvasive approach via fecal RNA-seq, for the quantification of gene expression markers in gastrointestinal cells of free-range primates and a forager human population. Thus, a combination of poly(A) mRNA enrichment and rRNA depletion methods was used in tandem with RNA-seq to quantify and compare gastrointestinal gene expression patterns in fecal samples of wild Gorilla gorilla gorilla (n = 9) and BaAka hunter-gatherers (n = 10) from The Dzanga Sangha Protected Areas, Central African Republic.Entities:
Keywords: Gene expression; Nonhuman primate; Noninvasive method; RNA-seq
Mesh:
Substances:
Year: 2019 PMID: 31200636 PMCID: PMC6567582 DOI: 10.1186/s12864-019-5813-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Evaluation of mRNA enrichment methods using RNA-seq in four sample set. Read statistics along with the % of alignment to G. g. gorilla and H. sapiens genomes
| Samples | Groups | Number of paired-end reads | Remaining reads after quality filtering | Singletons | Low quality reads | Low quality (%) | Total reads aligned using Kallisto | Ratio OLIGO:RIBO | Alignment (%) |
|---|---|---|---|---|---|---|---|---|---|
| KOTO-OLIGO | Haman | 13,796,106 | 13,340,097 | 0 | 456,009 | 3.31 | 59,616 | 9.13 | 0.45 |
| KOTO-RIBO | Human | 14,066,624 | 13,695,055 | 0 | 371,569 | 2.64 | 6532 | 0.05 | |
| MBUSA-OLIGO | Human | 16,559,894 | 14,492,534 | 1,269,002 | 798,358 | 4.82 | 103,239 | 11.85 | 0.71 |
| MBUSA-RIBO | Human | 16,775,019 | 15,644,483 | 845,112 | 285,424 | 1.70 | 8714 | 0.06 | |
| IYIKI-OLIGO | Human | 4,992,286 | 4,390,932 | 437,374 | 163,980 | 3.28 | 57,579 | 7.41 | 1.31 |
| IYIKI-RIBO | Human | 5,677,816 | 5,159,806 | 388,422 | 129,588 | 2.28 | 7772 | 0.15 | |
| MALUI-OLIGO | Gorilla | 4,541,457 | 4,086,161 | 334,319 | 120,977 | 2.66 | 57,810 | 5.02 | 1.41 |
| MALUI-RIBO | Gorilla | 4,749,986 | 4,193,129 | 418,094 | 138,763 | 2.92 | 11,506 | 0.27 |
Signletons: Number of high quailty reads for which the correspoding pairs were discarded during quality filtering
Ratio OLIGO:RIBO: Ratio of aligned reads using poly(A) mRNA enrichment plus rRNA depletion vs rRNA depletion alone
Alignment rate: Number of Kallisto aligned reads/number of high quality reads
Fig. 1Taxonomic profiling of non-host mRNA reads, using BLAST against the NR database. BLAST on 10,000 randomly selected unmapped reads from each sample: a) Percentage relative proportions of major taxonomic groups, b) Percentage relative proportions of other eukaryotic hits (excluding gorillas and humans), c) Percentage relative proportions of all taxonomic groups identified from gorilla and human samples, d) Percentage relative proportion of Arthropods and Nematodes
Fig. 2Comparison of human and gorilla transcriptomic data from fecal samples using expression profile of total 15,602 common or orthologous genes: a) Principal coordinate analysis based on Bray-Curtis distances, b) Hierarchical cluster analysis using hclust function based on the distances calculated using binary distance measure, c) Comparison of human-BaAka, gorilla and human-infants (downloaded) transcriptomic data using principal coordinate analysis based on Bray-Curtis distances, d) Bray-Curtis distances between these groups
Fig. 3Identification of differentially expressed genes from IECs of gorillas and humans in fecal samples: a) Volcano plot of differentially expressed genes. A total 212 genes showed significant differential expression: 184 up-regulated, and 28 down-regulated, b) Heatmap representation of top 50 differentially expressed genes based on q-value <= 3.09E-12 and log2 fold change> = 13, between human and gorilla samples. Each row in the heatmap represents gene symbols, whereas each column in the heatmap represents sample names. Color gradient scaling represents the normalized z-scores
List of selected top 50 significantly differentially expressed genes from gorilla and human fecal RNA-seq
| Ensembl Id | baseMean | log2FC | lfcSE | pvalue | padj | Gene symbol | entrez ID | Gene name |
|---|---|---|---|---|---|---|---|---|
| ENSG00000167996 | 11,610.8 | −16.95 | 1.69 | 1.30E-23 | 3.81E-20 | FTH1 | 2495 | ferritin heavy chain 1 |
| ENSG00000159251 | 10,736.4 | 16.84 | 1.76 | 1.20E-21 | 1.75E-18 | ACTC1 | 70 | actin, alpha, cardiac muscle 1 |
| ENSG00000183304 | 61.38 | 25.02 | 2.68 | 9.69E-21 | 9.46E-18 | FAM9A | 171,482 | family with sequence similarity 9 member A |
| ENSG00000185201 | 183.02 | 26.54 | 2.96 | 3.09E-19 | 2.26E-16 | IFITM2 | 10,581 | interferon induced transmembrane protein 2 |
| ENSG00000158470 | 18,496.9 | 15.59 | 1.77 | 1.18E-18 | 5.77E-16 | B4GALT5 | 9334 | beta-1,4-galactosyltransferase 5 |
| ENSG00000152977 | 704.66 | −26.19 | 2.97 | 1.10E-18 | 5.77E-16 | ZIC1 | 7545 | Zic family member 1 |
| ENSG00000235750 | 3610.92 | 15.27 | 1.74 | 1.76E-18 | 7.37E-16 | KIAA0040 | 9674 | KIAA0040 |
| ENSG00000227877 | 91.61 | 25.53 | 2.96 | 6.52E-18 | 2.32E-15 | MRLN | 100,507,027 | myoregulin |
| ENSG00000089356 | 85.74 | 25.50 | 2.96 | 7.12E-18 | 2.32E-15 | FXYD3 | 5349 | FXYD domain containing ion transport regulator 3 |
| ENSG00000280831 | 72.47 | 25.28 | 2.96 | 1.38E-17 | 4.04E-15 | RPS25 | 6230 | ribosomal protein S25 |
| ENSG00000169605 | 64.76 | 24.87 | 2.96 | 4.44E-17 | 1.18E-14 | GKN1 | 56,287 | gastrokine 1 |
| ENSG00000164405 | 52.34 | 24.83 | 2.96 | 5.02E-17 | 1.23E-14 | UQCRQ | 27,089 | ubiquinol-cytochrome c reductase complex III subunit VII |
| ENSG00000126709 | 45.85 | 24.65 | 2.96 | 8.44E-17 | 1.72E-14 | IFI6 | 2537 | interferon alpha inducible protein 6 |
| ENSG00000233927 | 45.61 | 24.64 | 2.96 | 8.67E-17 | 1.72E-14 | RPS28 | 6234 | ribosomal protein S28 |
| ENSG00000107223 | 53.30 | 24.63 | 2.96 | 8.81E-17 | 1.72E-14 | EDF1 | 8721 | endothelial differentiation related factor 1 |
| ENSG00000144233 | 39.76 | 24.44 | 2.96 | 1.53E-16 | 2.68E-14 | AMMECR1L | 83,607 | AMMECR1 like |
| ENSG00000173467 | 39.14 | 24.43 | 2.96 | 1.56E-16 | 2.68E-14 | AGR3 | 155,465 | anterior gradient 3, protein disulphide isomerase family member |
| ENSG00000129559 | 38.07 | 24.39 | 2.96 | 1.77E-16 | 2.77E-14 | NEDD8 | 4738 | neural precursor cell expressed, developmentally down-regulated 8 |
| ENSG00000106153 | 40.39 | 24.38 | 2.96 | 1.80E-16 | 2.77E-14 | CHCHD2 | 51,142 | coiled-coil-helix-coiled-coil-helix domain containing 2 |
| ENSG00000169189 | 36.70 | 24.35 | 2.96 | 2.00E-16 | 2.93E-14 | NSMCE1 | 197,370 | NSE1 homolog, SMC5-SMC6 complex component |
| ENSG00000163541 | 32.88 | 24.20 | 2.96 | 3.04E-16 | 4.24E-14 | SUCLG1 | 8802 | succinate-CoA ligase alpha subunit |
| ENSG00000198643 | 32.08 | 24.14 | 2.96 | 3.59E-16 | 4.78E-14 | FAM3D | 131,177 | family with sequence similarity 3 member D |
| ENSG00000275903 | 29.48 | 24.03 | 2.96 | 4.88E-16 | 6.22E-14 | PSMB3 | 5691 | proteasome subunit beta 3 |
| ENSG00000177697 | 36.53 | 23.99 | 2.96 | 5.34E-16 | 6.26E-14 | CD151 | 977 | CD151 molecule (Raph blood group) |
| ENSG00000163959 | 26.61 | 23.91 | 2.96 | 6.79E-16 | 7.65E-14 | SLC51A | 200,931 | solute carrier family 51 alpha subunit |
| ENSG00000196914 | 49.35 | −23.95 | 2.97 | 7.09E-16 | 7.69E-14 | ARHGEF12 | 23,365 | Rho guanine nucleotide exchange factor 12 |
| ENSG00000136167 | 26.11 | 23.87 | 2.96 | 7.58E-16 | 7.86E-14 | LCP1 | 3936 | lymphocyte cytosolic protein 1 |
| ENSG00000241468 | 25.70 | 23.86 | 2.96 | 7.78E-16 | 7.86E-14 | ATP5MF | 9551 | ATP synthase membrane subunit f |
| ENSG00000182718 | 24.11 | 23.77 | 2.96 | 1.00E-15 | 9.77E-14 | ANXA2 | 302 | annexin A2 |
| ENSG00000164266 | 38.51 | 23.73 | 2.96 | 1.12E-15 | 1.06E-13 | SPINK1 | 6690 | serine peptidase inhibitor, Kazal type 1 |
| ENSG00000156467 | 22.80 | 23.69 | 2.96 | 1.24E-15 | 1.14E-13 | UQCRB | 7381 | ubiquinol-cytochrome c reductase binding protein |
| ENSG00000140319 | 21.51 | 23.60 | 2.96 | 1.58E-15 | 1.40E-13 | SRP14 | 6727 | signal recognition particle 14 |
| ENSG00000118972 | 1630.18 | −14.12 | 1.77 | 1.62E-15 | 1.40E-13 | FGF23 | 8074 | fibroblast growth factor 23 |
| ENSG00000110700 | 20.79 | 23.57 | 2.96 | 1.72E-15 | 1.44E-13 | RPS13 | 6207 | ribosomal protein S13 |
| ENSG00000124172 | 18.93 | 23.45 | 2.96 | 2.43E-15 | 1.97E-13 | ATP5F1E | 514 | ATP synthase F1 subunit epsilon |
| ENSG00000105404 | 18.29 | 23.39 | 2.96 | 2.81E-15 | 2.22E-13 | RABAC1 | 10,567 | Rab acceptor 1 |
| ENSG00000143226 | 17.72 | 23.36 | 2.96 | 3.10E-15 | 2.33E-13 | FCGR2A | 2212 | Fc fragment of IgG receptor IIa |
| ENSG00000197956 | 886.17 | 13.24 | 1.68 | 3.08E-15 | 2.33E-13 | S100A6 | 6277 | S100 calcium binding protein A6 |
| ENSG00000116663 | 16.94 | 23.29 | 2.96 | 3.68E-15 | 2.70E-13 | FBXO6 | 26,270 | F-box protein 6 |
| ENSG00000105379 | 16.11 | 23.22 | 2.96 | 4.51E-15 | 3.22E-13 | ETFB | 2109 | electron transfer flavoprotein beta subunit |
| ENSG00000246705 | 15.06 | 23.13 | 2.96 | 5.68E-15 | 3.96E-13 | H2AFJ | 55,766 | H2A histone family member J |
| ENSG00000206291 | 15.77 | 22.97 | 2.96 | 8.90E-15 | 5.79E-13 | HLA-DPA1 | 3113 | major histocompatibility complex, class II, DP alpha 1 |
| ENSG00000103187 | 12.75 | 22.91 | 2.96 | 1.04E-14 | 6.51E-13 | COTL1 | 23,406 | coactosin like F-actin binding protein 1 |
| ENSG00000117616 | 13.04 | 22.91 | 2.96 | 1.04E-14 | 6.51E-13 | RSRP1 | 57,035 | arginine and serine rich protein 1 |
| ENSG00000162909 | 12.50 | 22.79 | 2.96 | 1.41E-14 | 8.59E-13 | CAPN2 | 824 | calpain 2 |
| ENSG00000148180 | 19.65 | −22.67 | 2.97 | 2.24E-14 | 1.34E-12 | GSN | 2934 | gelsolin |
| ENSG00000165030 | 17.54 | −22.51 | 2.97 | 3.41E-14 | 1.96E-12 | NFIL3 | 4783 | nuclear factor, interleukin 3 regulated |
| ENSG00000165704 | 16.69 | −22.44 | 2.97 | 4.10E-14 | 2.31E-12 | HPRT1 | 3251 | hypoxanthine phosphoribosyltransferase 1 |
| ENSG00000119431 | 15.72 | −22.36 | 2.97 | 5.06E-14 | 2.80E-12 | HDHD3 | 81,932 | haloacid dehalogenase like hydrolase domain containing 3 |
| ENSG00000116176 | 13.05 | 22.26 | 2.96 | 5.70E-14 | 3.09E-12 | TPSG1 | 25,823 | tryptase gamma 1 |
log2FC: log2FoldChange (Gorilla/human)
lfcSE: logfoldchangeStandard Error
padj: Adjusted p value
Fig. 4Pathway analysis (Ingenuity IPA software) was performed using differentially expressed genes. Top 5 up-regulated and top-5 down regulated pathways selected based on the IPA provided z-scores
Differentially regulated pathways between gorilla and humans identified using Ingenuity Pathway Analysis
| Ingenuity Canonical Pathways | -log( | Ratio | z-score | Molecules |
|---|---|---|---|---|
| Oxidative Phosphorylation | 6.02E+ 00 | 8.26E-02 | 3 | COX6B1,SDHB,ATP5PD,COX5B,ATP5MF,ATP5MC3,UQCRQ,ATP5F1E,UQCRB |
| Fcγ Receptor-mediated Phagocytosis in Macrophages and Monocytes | 2.72E+ 00 | 5.38E-02 | 2.236 | FCGR2A,ACTB,ARPC3,ACTC1,ACTA1 |
| Dendritic Cell Maturation | 1.42E+ 00 | 2.58E-02 | 2.236 | PLCD3,NFKBIA,LEP,FCGR2A,IL32 |
| Interferon Signaling | 3.44E+ 00 | 1.11E-01 | 2 | IFNGR2,IFI6,IFITM2,IFITM1 |
| PI3K Signaling in B Lymphocytes | 1.39E+ 00 | 2.94E-02 | 2 | PLCD3,ATF3,NFKBIA,CHP1 |
| EIF2 Signaling | 9.25E+ 00 | 7.05E-02 | 1.89 | ATF3,ACTB,RPS23,RPLP2,RPL23,RPL10A,RPS28,EIF1,RPS13,RPS25,RPS15A,RPL36,RPL31,ACTC1,ACTA1,RPLP0 |
| RhoA Signaling | 3.74E+ 00 | 5.65E-02 | 1.633 | ARHGEF12,NRP2,CFL1,ACTB,ARPC3,ACTC1,ACTA1 |
| Regulation of Actin-based Motility by Rho | 4.63E+ 00 | 7.78E-02 | 1.342 | RHOB,CFL1,ACTB,ARPC3,GSN,ACTC1,ACTA1 |
| HMGB1 Signaling | 1.98E+ 00 | 3.60E-02 | 1.342 | HMGB1,CXCL8,KAT6B,RHOB,IFNGR2 |
| Integrin Signaling | 3.60E+ 00 | 4.11E-02 | 1.134 | ARF5,RHOB,ACTB,ARPC3,ZYX,CAPN2,GSN,ACTC1,ACTA1 |
| Signaling by Rho Family GTPases | 3.16E+ 00 | 3.57E-02 | 1.134 | GNB1,CDH1,ARHGEF12,RHOB,CFL1,ACTB,ARPC3,ACTC1,ACTA1 |
| ILK Signaling | 2.56E+ 00 | 3.55E-02 | 1.134 | PPP2CB,CDH1,RHOB,CFL1,ACTB,ACTC1,ACTA1 |
| Actin Cytoskeleton Signaling | 4.10E+ 00 | 4.29E-02 | 1 | ARHGEF12,WASF2,CFL1,ACTB,FGF23,ARPC3,GSN,ACTC1,ACTA1,IQGAP3 |
| Dopamine-DARPP32 Feedback in cAMP Signaling | 3.01E+ 00 | 4.27E-02 | 1 | PLCD3,PPP2CB,KCNJ2,KCNJ14,CHP1,PPP1R11,KCNJ3 |
| Death Receptor Signaling | 1.93E+ 00 | 4.30E-02 | 1 | NFKBIA,ACTB,ACTC1,ACTA1 |
| Tec Kinase Signaling | 1.64E+ 00 | 2.94E-02 | 1 | GNB1,RHOB,ACTB,ACTC1,ACTA1 |
| Sirtuin Signaling Pathway | 2.16E+ 00 | 2.74E-02 | 0.816 | SCNN1A,CXCL8,CDH1,SDHB,SLC2A1,TUBA4B,HMGCS2,ATP5F1E |
| Neuroinflammation Signaling Pathway | 1.12E+ 00 | 1.93E-02 | 0.447 | HMGB1,CXCL8,CHP1,IFNGR2,GLUL,KCNJ3 |
| Production of Nitric Oxide and Reactive Oxygen Species in Macrophages | 1.42E+ 00 | 2.58E-02 | −0.447 | PPP2CB,NFKBIA,RHOB,IFNGR2,PPP1R11 |
| Phospholipase C Signaling | 1.54E+ 00 | 2.46E-02 | −0.816 | GNB1,PLCD3,ARHGEF12,RHOB,FCGR2A,CHP1 |
| RhoGDI Signaling | 5.12E+ 00 | 5.65E-02 | −1 | GNB1,CDH1,ARHGEF12,WASF2,RHOB,CFL1,ACTB,ARPC3,ACTC1,ACTA1 |
| Gαq Signaling | 1.73E+ 00 | 3.11E-02 | −1 | GNB1,RGS2,NFKBIA,RHOB,CHP1 |
| IL-8 Signaling | 1.35E+ 00 | 2.46E-02 | −1 | GNB1,CXCL8,CDH1,RHOB,LASP1 |
z-score(+): upregulation in gorilla, whereas z-score(−): upregulation in humans