| Literature DB >> 28829433 |
Zhigang Lu1,2, Florian Sessler2, Nancy Holroyd2, Steffen Hahnel1, Thomas Quack1, Matthew Berriman2, Christoph G Grevelding1.
Abstract
RNA-Seq has proven excellence in providing information about the regulation and transcript levels of genes. We used this method for profiling genes in the flatworm Schistosoma mansoni. This parasite causes schistosomiasis, an infectious disease of global importance for human and animals. The pathology of schistosomiasis is associated with the eggs, which are synthesized as a final consequence of male and female adults pairing. The male induces processes in the female that lead to the full development of its gonads as a prerequisite for egg production. Unpaired females remain sexually immature. Based on an organ-isolation method we obtained gonad tissue for RNA extraction from paired and unpaired schistosomes, with whole adults included as controls. From a total of 23 samples, we used high-throughput cDNA sequencing (RNA-Seq) on the Illumina platform to profile gene expression between genders and tissues, with and without pairing influence. The data obtained provide a wealth of information on the reproduction biology of schistosomes and a rich resource for exploitation through basic and applied research activities.Entities:
Mesh:
Year: 2017 PMID: 28829433 PMCID: PMC5566097 DOI: 10.1038/sdata.2017.118
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Schematic overview of the experimental design.
All biological replicates (n=2 for sO and n=3 for all other samples) were obtained from the species Schistosoma mansoni, which was maintained in hamsters as final hosts. The separation of couples and the isolation of ovaries and testes were always performed at the same day to avoid any in vitro-culture effects. Total RNA was extracted by Trizol and its quality assessed by electropherogram analyses using an Agilent 2100 Bioanalyzer and Pico chips (Agilent Technologies). For each sample 100 ng of total RNA was used for synthesizing cDNA and generating libraries. Pair-end 100 bp sequencing was performed on Illumina HiSeq 2500 running two technical replicates for each sample. Raw reads were assessed using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and mapped to Schistosoma mansoni reference genome (v5.0) using TopHat. Reads for each transcript were counted using HTSeq and imported to edgeR for normalization across all samples and differential expression analysis. Mean RPKM and s.e.m. values based on normalized reads were calculated and used for barplots. A transcript profile was generated for each gene, including the barplot, as well as the log2 fold-change (log2FC) and adjusted P (PAdjust) values based on various comparisons (see Smp_051920.2 as representative example, a nanos-ortholog that is abundantly transcribed in ovaries of unpaired females=sO).
List of primers used for reverse transcription and amplification of representative gene transcripts in real-time quantitative RT-PCR (qPCR) analyses.
| psm | Smp_073410 | GGTCTGGTGGTTTCTCGTTC | GTACCTTCTGTTGCCCGTG | 160 |
| rad50 | Smp_181450 | TACCCAAAGATCTGGATGCAGA | TGAGTGGTCTAACGCATACGG | 152 |
| melk | Smp_166150 | TCTCCAAGGGCTGTACCTGT | ATCAGACCCGAGCTTCCTCT | 133 |
| hmgcs | Smp_198690 | GATCCTGGACTCATGTTCGC | GTACATAGCTGCTGCCATTCC | 159 |
| meg13 | Smp_127990 | CAAATGGATATAACTTATAGTTGGTG | TCGTTTGTGCTTGTGGAAGTAC | 155 |
| vwa | Smp_127480 | TCATTATCGCTTCCATCTACCC | CAGCTGGATTATTGGTGACAGT | 118 |
| twik | Smp_147550 | GGATTTGGTGATATTGTACCAGG | TTTGTGACACCTATACGACGTC | 179 |
| sox | Smp_076600 | CCTTATACCACAGTTGTTGGTTC | TGTACGTTAGGTGCATCGGG | 137 |
| syt14 | Smp_150350 | CTGGTGGACCTTCAGCTTATC | CGACGATACGACAAATAAGCGT | 168 |
| cpeb1 | Smp_070360 | AGGTGGTGTTCCAATGCGTG | CTGCATAACAATAGCGTCCATC | 152 |
| dia | Smp_146810 | CCGAAGACTTGAAATACGTGAG | TGAAGATGGTGATGGCTCGC | 157 |
| EIF4G | Smp_008900 | GCAGGATTGAGTGACAGACG | AATCGTAAGCTTGGGACGAGA | 158 |
| letem1 | Smp_065110 | GAAGGTGATCAAGCTCCATTGT | TTGTACTGCATGGATAGGTGGT | 162 |
| lin9 | Smp_133660 | CCCAAGTTCTGTGAATGGCTG | ACGTGGTCTCCCTATGATCC | 177 |
| rgs7 | Smp_210800 | CAAGTACGTCAAGAGCAAATGAG | CGATGTGAAGGCAACAGTGG | 119 |
| elav | Smp_194950 | GCCATCATCACCATCGAGTC | TGCTTCCAACTGATGTATCACTG | 149 |
| cdc25 | Smp_152200 | TACGAATCTCCTAGATCAAGGC | ACGGAGGAAGGGAACTGTG | 137 |
Figure 2Representative results of sample RNA assessed by electropherogram analysis using an Agilent 2100 Bioanalyzer.
(a) RNA obtained from testes of sM; (b) RNA obtained from ovaries of sF; (c) RNA obtained from sM; (d) RNA obtained from sF.
Figure 3Density distribution RPKM values.
Densities were calculated using log2(RPKM+0.001) values based on non-normalized reads. Each color (blue/red/black) represents one biological replicate.
Figure 4Principal Component Analysis (PCA) and representative genes for each sample cluster.
(a) Reads for all genes were taken for PCA. Five clusters were obtained for all samples: 1) bM-sM-sF; 2) bF; 3) bT-sT; 4) sO; 5) bO. (b) Collection of exemplary genes found to be preferentially or specifically and pairing-dependently or -independently transcribed in either one of two adult clusters (e.g. cluster 1: Smp_124000, MEG-14; cluster 2: Smp_131110, Sm_p14), or within one of the three gonad clusters as preferentially transcribed (e.g. cluster 3: Smp_194950, ELAV2) or as specifically transcribed in one gonad type (e.g. cluster 4: Smp_051920.2, NANOS2; cluster 5: Smp_070360, CPEB1). Dark and light grey indicates high and low transcript abundance, respectively. Full names of gene symbols are as follows: MEG-14 (micro exon gene 14), VAL7 (venom allergen-like (VAL) 7 protein), ACHE (acetylcholinesterase), WNT (wnt family member), BMP (bone morphogenetic protein), Sm_p14 (eggshell protein Sm_p14), fs800 (female specific protein), TSPAN (tetraspanin CD63 receptor), RPA (Replication protein A), ESP (eggshell protein), ELAV2 (Embryonic Lethal, Abnormal Vision 2), CDC25 (M phase inducer phosphatase 3), SYT1 (synaptotagmin), SIX6 (homeobox protein SIX6), NEK7 (Serine/threonine kinase NEK7), NANOS (nanos 2), MYT1 (myelin transcription factor 1 protein), ERCC2 (DNA excision repair protein ERCC-2), FGFRb (fibroblast growth factor receptor), TJP1 (tight junction protein 1), CPEB (cytoplasmic polyadenylation element binding), SYT14 (synaptotagmin), Znf (zinc finger), CDH11 (protocadherin 11). bF, paired females; bM, paired males; bO, ovary of bF; bT, testis of bM; sF, unpaired females; sM, testis of sM; sO, ovary of sF; sT, unpaired males.
Figure 5Correlations between RNA-Seq and qPCR results.
Each dot indicates a single gene whose corresponding Smp-number can be found in Table 1. The selected genes include those with high (large RPKM and small Ct-values) and low transcript abundance (small RPKM and large Ct) in the gonad samples. Pearson´s correlation values are given in each case. Full names of gene symbols: psm (proteasome), rad50 (DNA double-strand break repair rad50 ATPase), melk (maternal embryonic leucine zipper kinase), hmgcs (Hydroxymethylglutaryl-CoA synthase), meg13 (micro exon gene 13), vwa (von Willebrand factor A), twik (twik family of potassium channels), sox (transcription factor SOX), syt14 (synaptotagmin XIV), cpeb1 (cytoplasmic polyadenylation element binding), dia (diaphanous), EIF4G (eukaryotic translation initiation factor 4 gamma), letem1 (LETM1 and EF hand domain containing protein 1), lin9 (protein lin9), rgs7 (regulator of G protein signaling), elav (embryonic lethal abnormal visual system), and cdc25 (M phase inducer phosphatase 3 cdc25).