| Literature DB >> 31641148 |
Hao Xu1,2, Xiwen Xu1, Xihong Li1, Lei Wang1, Jiayu Cheng3, Qian Zhou4, Songlin Chen5.
Abstract
Vibrio harveyi is a major bacterial pathogen that causes fatal vibriosis in Chinese tongue sole (Cynoglossus semilaevis), resulting in massive mortality in the farming industry. However, the molecular mechanisms of C. semilaevis response to V. harveyi infection are poorly understood. Here, we performed transcriptomic analysis of C. semilaevis, comparing resistant and susceptible families in response to V. harveyi challenge (CsRC and CsSC) and control conditions (CsRU and CsSU). RNA libraries were constructed using 12 RNA samples isolated from three biological replicates of the four groups. We performed transcriptome sequencing on an Illumina HiSeq platform, and generating a total of 1,095 million paired-end reads, with the number of clean reads per library ranging from 75.27 M to 99.97 M. Through pairwise comparisons among the four groups, we identified 713 genes exhibiting significant differences at the transcript level. Furthermore, the expression levels were validated by real-time qPCR. Our results provide a valuable resource and new insights into the immune response to V. harveyi infection.Entities:
Mesh:
Year: 2019 PMID: 31641148 PMCID: PMC6805913 DOI: 10.1038/s41597-019-0231-2
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Accession numbers for each biological sample.
| Organism | analysis type | Sample name | Replicate | Group | Accession number (Sample) |
|---|---|---|---|---|---|
|
| RNA-sequencing | SU1 | Biological Replicate 1 | CsSU | GSM3619558 |
|
| RNA-sequencing | SU2 | Biological Replicate 2 | CsSU | GSM3619559 |
|
| RNA-sequencing | SU3 | Biological Replicate 3 | CsSU | GSM3619560 |
|
| RNA-sequencing | RU1 | Biological Replicate 1 | CsRU | GSM3619561 |
|
| RNA-sequencing | RU2 | Biological Replicate 2 | CsRU | GSM3619562 |
|
| RNA-sequencing | RU3 | Biological Replicate 3 | CsRU | GSM3619563 |
|
| RNA-sequencing | SC1 | Biological Replicate 1 | CsSC | GSM3619564 |
|
| RNA-sequencing | SC2 | Biological Replicate 2 | CsSC | GSM3619565 |
|
| RNA-sequencing | SC3 | Biological Replicate 3 | CsSC | GSM3619566 |
|
| RNA-sequencing | RC1 | Biological Replicate 1 | CsRC | GSM3619567 |
|
| RNA-sequencing | RC2 | Biological Replicate 2 | CsRC | GSM3619568 |
|
| RNA-sequencing | RC3 | Biological Replicate 3 | CsRC | GSM3619569 |
Fig. 1Overview of the experimental design. The flowchart represents RNA-Seq workflow and bioinformatics analysis workflow.
Fig. 2Visualization of qualities of C. semilaevis sequencing data. (a) Per base sequence quality. (b) Per sequence quality scores. (c) Per sequence GC content. (d) Per base N content.
Summary statistics for the sequencing data of the twelve samples.
| Sample name | Number of raw reads | Number of clean reads | clean bases | Error rate(%) | Q20(%) | Q30(%) | GC content(%) |
|---|---|---|---|---|---|---|---|
| SU1 | 93,383,974 | 87,594,930 | 13.14 G | 0.02 | 94.3 | 88.31 | 48.74 |
| SU2 | 104,672,142 | 98,188,152 | 14.73 G | 0.03 | 94.34 | 88.28 | 49.11 |
| SU3 | 80,095,718 | 75,276,260 | 11.29 G | 0.02 | 94.37 | 88.36 | 48.61 |
| RU1 | 85,660,884 | 80,441,096 | 12.07 G | 0.02 | 94.25 | 88.16 | 48.7 |
| RU2 | 91,134,342 | 85,816,620 | 12.87 G | 0.02 | 94.36 | 88.31 | 48.98 |
| RU3 | 91,226,452 | 85,555,254 | 12.83 G | 0.03 | 93.73 | 87.06 | 48.7 |
| SC1 | 101,900,430 | 95,811,584 | 14.37 G | 0.02 | 94.36 | 88.39 | 48.22 |
| SC2 | 104,216,082 | 97,740,946 | 14.66 G | 0.03 | 94.18 | 88.03 | 48.19 |
| SC3 | 100,320,038 | 93,866,088 | 14.08 G | 0.03 | 94.02 | 87.83 | 47.66 |
| RC1 | 106,581,728 | 99,971,142 | 15 G | 0.02 | 94.3 | 88.31 | 48.6 |
| RC2 | 105,878,506 | 99,234,478 | 14.89 G | 0.03 | 94.27 | 88.26 | 48.31 |
| RC3 | 100,828,908 | 95,019,216 | 14.25 G | 0.02 | 94.52 | 88.63 | 48.24 |
Statistics analysis of clean reads mapping onto reference genome.
| Sample name | Number of uniquely mapped reads | Percentage of uniquely mapped reads % | Number of multiply mapped reads | Percentage of multiply mapped reads % |
|---|---|---|---|---|
| SU1 | 60,754,188 | 69.36 | 1,605,970 | 1.83 |
| SU2 | 68,745,699 | 70.01 | 2,265,076 | 2.31 |
| SU3 | 52,489,583 | 69.73 | 1,378,661 | 1.83 |
| RU1 | 56,281,304 | 69.97 | 1,259,706 | 1.57 |
| RU2 | 60,241,555 | 70.2 | 1,469,368 | 1.71 |
| RU3 | 59,485,111 | 69.53 | 1,256,973 | 1.47 |
| SC1 | 66,388,773 | 69.29 | 1,753,415 | 1.83 |
| SC2 | 67,065,736 | 68.62 | 2,025,623 | 2.07 |
| SC3 | 64,134,171 | 68.33 | 1,305,320 | 1.39 |
| RC1 | 69,465,962 | 69.49 | 1,793,586 | 1.79 |
| RC2 | 68,779,401 | 69.31 | 1,908,332 | 1.92 |
| RC3 | 67,008,296 | 70.52 | 1,791,000 | 1.88 |
Fig. 3Quality assessment and comparisons of transcriptome data among the C. semilaevis groups. (a) Correlation matrix of the transcriptome data of all the samples. (b) Boxplot of FPKM distribution among the four groups. (c) Volcanoplot of differentially expressed genes (DEGs) distribution in the four pairwise comparisons. (d) Hierarchical cluster analysis of gene expression profiles of the four groups. (e) Venn diagram of the number of shared DEGs between contrasts. (f) Validation of differential expression of 24 genes from qPCR and RNA-Seq.
Primers of selected genes for qPCR validation.
| Gene | Forward Primer | Reverse Primer |
|---|---|---|
|
| TTCAAACTGGACTCGGTGGTTCT | CAGTTGTTGGTGGTGCTGCTAAT |
|
| TCGACGATGAGGCAAAGAGGATT | TTTCTTTGGTTTGCCACCCTGTC |
|
| TAAGCTGTATGTGCGCAGAGTCT | TTGCGGATGACCTTCAGGATCTT |
|
| TGCCAGAGGTGAATGGAATAGCA | AGTAGTCTCCCCCTGTCGTGTAT |
|
| ATCTCCCTGATCCTGACAACAGC | AATTGATCCTGCAGACCCTCGAA |
|
| TTCTGAGTGTGACAGGGGAACAG | GGCTGTATGAAGACACCCTCCAT |
|
| TGGCAAATTGACCAGAAATGCGT | TGTGGGGTGGGTATGTATCCAAC |
|
| GATCCAAATGCAGCCTTACGGAC | CTAGGATGAGGACACTGCCGTAC |
|
| GGGAGGCTTACTGCAAATTCCAC | CAAACGATAACTCCTGTGGTGGC |
|
| CCTCATCTCTCAGAGCACCAAGG | AGTTCTGACATCATCTCCTCGGC |
|
| AATGCACAAAGATGGCTTCCCAG | GGGAGACGAACAGAGGAATCACA |
|
| ACGCAGCCTACAGGAAGGTTATT | GTACGCTCTTGATGGTCCAGAGT |
|
| TGGCCATATACAACAGCACCAGA | GATGGGTAAAAGGGCTGCATGTC |
|
| GGGGAATGGTCTGATCAAGGTGA | CCTGACGGATGGCATAGATCTGT |
|
| CTGGCTGAGGCTAAGACTGAACT | CCAAACATGCACCTGAGACCATC |
|
| GGAGATGGAGATGATACCGGAGC | ACCCTGCATTCGTAGAGACCTTC |
|
| AGGTTTCGTAAGGAGTCCATCGG | AATCTTCCCACAGAGAACACCGT |
|
| TCCGTATCACCATTCTGCCTCAG | CCAGTCTCTTCTGCACTCGATCA |
|
| GACGCAGACACTCCCTCAAAATG | CCAGACAACAGGAGATGACCAGT |
|
| ACCAAAACAAGCTGCACCATCAA | ATCCACAGTTCCCTCCCAGATTG |
|
| GATCCAGGTTGTGTTGCCATCAG | TTTGTCGGAGGTGTAGACGTTGT |
|
| CACACGCCAAGTTTGAGGAGATC | CCATCAGCTTGACACCATCAACC |
|
| AGGATATGGCGATGGAGGGAATG | AAGATCTCAGTGCCTTGCTCGAT |
|
| ATGTTGTCAAGGTTTGCCAGTGG | GTCCTCTCCTCCATCACTTTCCC |
| Measurement(s) | messenger RNA |
| Technology Type(s) | RNA sequencing |
| Factor Type(s) | experimental condition • Phenotypic variability |
| Sample Characteristic - Organism | Cynoglossus semilaevis |
| Sample Characteristic - Environment | aquatic natural environment |