| Literature DB >> 28446772 |
Zhiwei Liao1, Quanyuan Wan1, Xueying Shang1, Jianguo Su2.
Abstract
Grass carp (Ctenopharyngodon idella) is an important economic species in freshwater aquaculture and its industry has been confined due to variety degeneration and frequent diseases. Marker-assisted selection is a feasible method for selective breeding of new varieties. Transcriptome data have greatly facilitated high-throughput single nucleotide polymorphism (SNP) marker discovery and phenotype association study. In this study, we gained a total of 25,981 and 5,775 high quality SNPs in two transcriptomes from individuals and cell lines, respectively. Comparative transcriptome analysis identified 413 and 832 grass carp reovirus (GCRV)-resistant-association SNPs as well as 1,381 and 1,606 GCRV-susceptible-association SNPs in individuals and cell lines, respectively. Integrated analysis indicated 22 genes with single SNP share common resistant/susceptible traits in two transcriptomes. Furthermore, we infected grass carp with GCRV, genotyping and association analyses were performed, and 9 in 22 SNPs were confirmed by PCR-RFLP. Meanwhile, mRNA expression profiles of 6 genes containing confirmed SNPs were examined by qRT-PCR. The results demonstrated that mRNA expressions were significant differences in resistant/susceptible individuals and cell lines. The present study develops an important strategy for high throughput screening of phenotype association genetic markers and the results will serve in grass carp breeding for GCRV resistance.Entities:
Mesh:
Year: 2017 PMID: 28446772 PMCID: PMC5430748 DOI: 10.1038/s41598-017-01338-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the transcriptomes of C. idella in individuals and cell lines.
| Type | Raw base pairs (bp) | Clean base pairs (bp) | ≥Q20 | Raw reads | Clean reads | ≥Q20 | |
|---|---|---|---|---|---|---|---|
|
| Spleen (SS1) | 7095838055 | 6627953846 | 93.41 | 32859452 | 32535864 | 99.02 |
| Spleen (SR2) | 4215899019 | 3374188672 | 80.03 | 24450204 | 22841432 | 93.42 | |
| Kidney (KS3) | 5219216274 | 4406813074 | 84.43 | 19492284 | 17678754 | 90.70 | |
| Kidney (KR4) | 7631346510 | 7112896707 | 93.21 | 35183620 | 34903598 | 99.20 | |
|
| Control (C1) | 6139459750 | 5930031750 | 96.59 | 49115678 | 47440254 | 96.59 |
| Resistant (R2) | 6556782500 | 6341051750 | 96.71 | 52454260 | 50728414 | 96.71 | |
| Susceptible (S3) | 6427464500 | 6234584000 | 97.00 | 51419716 | 49876672 | 97.00 |
Note: Paired-end reads were generated in lengths of 2 × 250 bp and 2 × 125 bp in individuals and CIK cells by Illumina MiSeq and HiSeq2500, respectively. SS1 and SR2 represent susceptible and resistant spleen tissues, respectively. KS3 and KR4 represent susceptible and resistant head-kidney tissues, respectively. C1, R2 and S3 represent control, resistant and susceptible groups in cells, respectively. Q20: percentage is the proportion of nucleotides with a quality value ≥20 in reads.
Figure 1Statistical analyses and venn diagrams of SNPs, InDels and genes based on transcriptome datasets. (A) and (B) represent the substitutions and InDels in individuals, respectively. (C) and (D) represent the substitutions and InDels in cell lines, respectively. (E) Venn diagram describes the overlapped resistance/susceptibility-associated SNPs among SS1, SR2, KS3 and KR4 in individuals. (F) C1 represents SNPs from the transcriptome of control CIK cells, and R2 and S3 stands for SNPs in resistant and susceptible cell lines, respectively. (G) IR and IS represent genes containing resistance/susceptibility-associated SNPs in individual transcriptomes, respectively, CR and CS stand for genes containing resistance/susceptibility-associated SNPs in cell transcriptomes, respectively.
Figure 2Read depth and distribution of SNPs. (A) and (D) represent read depths at SNP positions in transcriptomes in individuals and cell lines, respectively. (B) and (C) stand for SNP distribution in resistant and susceptible individuals, respectively. (E) and (F) represent SNP distribution in resistant and susceptible cell lines, respectively.
Figure 3SNPs location in C. idella genome. 24 C. idella chromosomes are marked on the horizontal axis and the number of SNPs associated with resistance/susceptibility to GCRV is plotted on the vertical axis in individuals (A) and cell lines (B), respectively.
Figure 4GO analysis of the annotated genes with resistance/susceptibility-associated SNPs in individuals. Red columns represent GO terms of genes containing resistance-associated SNPs and blue columns stand for GO terms of genes containing susceptibility-associated SNPs. And vertical axes show the number and percentage of corresponding genes respectively.
Figure 5Top 20 KEGG pathways of genes containing SNPs associated with resistance/susceptibility to GCRV in individuals. Venn diagram describes the overlapped pathways between resistant and susceptible groups. The number of SNPs is shown behind the corresponding pathways.
Primers used in the experiment.
| Gene | Primer name | Forward primer (5′-3′) | Primer name | Reverse primer (5′-3′) | Temperature (°C) |
|---|---|---|---|---|---|
| CI01000000_08975466_08977875 | HNOF658 | GGTATGTAATCAACCTGTCTTCA | HNOR659 | GTTGATGTGGGCAGAGTCC | 54.5 |
| HNIF660 | ATTTAAAGGTTTTTTTTTTTTTGT | HNIR661 | GTTAAAGTTAAGAAAAAAAGAAAGAA | 54.5 | |
| CI01000012_12132327_12133572 | GSF582 | TCTGCTTGTAGAAACTGCTGAT | GSR583 | CATTGTGACTGAATCGCCT | 54.1 |
| CI01000020_05320359_05335741 | CRF673 | CCATCCCACCCGCTTCA | CRR674 | ATCGCTGTCGCCTCGGA | 60.0 |
| CI01000000_15829395_15841876 | AHF719 | GATTTCCATGCCAGATGTTGTCT | AHR720 | TCTACTTCTGGTGCTTTAATGTCTCC | 60.0 |
| CI01000001_04462781_04463560 | FBF588 | TGAATGACAAGATGGTGAGCAA | FBR589 | GTGAGTGAAACGAATAAACCGAA | 58.9 |
| CI01000001_05575469_05577709 | HYF715 | ATAAAGCAGCACCAAAAGG | HYR716 | TTCATTAATGTAATGAAGATCTCAC | 53.2 |
| CI01000001_10604980_10633286 | MLCF596 | CGAGTCAGTGGATGGAAAAG | MLCR597 | GGTTATGGATGGGTTAGAGACT | 54.6 |
| CI01000004_11432849_11451784 | YEF586 | TTTCCGTCTGAGATTGTTTGTT | YER587 | ACCAGATTGTCTCCCACCAG | 56.7 |
| CI01000004_15217903_15228632 | ELF679 | TTTAGATTGATTTCTGAAGGAT | ELR680 | TTTTACAATTGTCTACTTGAATATA | 50.9 |
| CI01000004_15326982_15360355 | SKF600 | GCATCCTGCCTGTCAACG | SKR601 | AGTTTGCTGTTGGGGTCG | 56.8 |
| CI01000006_12684154_12695960 | DNF711 | CAACAACGAGGACACCCC | DNR712 | TTTCTTCCCTTTTGTCTCTGC | 56.0 |
| CI01000012_07509419_07535325 | ZFF723 | TGGCTCTCCCAGTGCTCTA | ZFR724 | TTGCTGACGGAGGTTCTGT | 56.0 |
| CI01000013_04750261_04754885 | DEF592 | GATGCCAGTTCACTCTTTG | DER593 | ACACTGATGACGAAACTCTATT | 50.5 |
| CI01000013_04805165_04810980 | C7NF580 | GACTGAGAATGCTGTGAAAGATG | C7NR581 | CAATGAGGTGGGATTTTAGTGA | 56.7 |
| CI01000016_04557546_04579200 | ERF594 | CCCTTCTTCGTCCCTCCTA | ERR595 | CGGAGCTTCAGTGGGATAC | 55.8 |
| CI01000016_05878037_05885761 | FLF702 | AAATTTCCTCTTTGAACTTTTT | FLR703 | CTTTACCTCTGCCATCCAC | 52.5 |
| CI01000021_00349073_00352551 | ZHF654 | AGATTGTTGTTCTGGTGCCTGA | ZHR655 | ACGGTGGACTTCCTCTTGCTA | 59.0 |
| CI01000021_06435903_06449790 | NAF700 | CACTTCTTTTTCCACATCTG | NAR701 | TTTGGCCAATCGGATAG | 50.3 |
| CI01000024_01747187_01758168 | INF721 | CGATATCATTATAAATTGAATGATG | INR722 | CAGTATGCCTGTTGATAAAAGC | 54.5 |
| CI01000027_07709714_07744049 | MEF675 | TGTTTCATTAGGAGTCGGATT | MER676 | CGAGAGATAAACAAGTCCAAAG | 54.1 |
| CI01000030_07749038_07754607 | HIF584 | CAGCGTGTTCCCCTTATCAG | HIR585 | CAAAGTCTGAAACTGAACTCGGT | 58.1 |
| CI01000036_01250575_01252253 | CHF713 | GCTTTCTTAATGTGCCCGTCT | CHR714 | TCCTCCTCACCATCATTCCC | 59.0 |
| CI01000000_08975466_08977875 | HF731 | ACTGCGTGGTGGTCCAAAAC | HR732 | GCCGAACTGCGAGAAATAATC | 60.0 |
| CI01000004_11432849_11451784* | YF735 | ACAACTTCAACAGCCGCACA | YR736 | AGGGAAGGGGTTGCTCACA | 60.0 |
| CI01000012_12132327_12133572* | GF744 | GAAGAATCCTTTTGGGACGGTA | GR745 | TTCTCAGGGTAGACCTCATCCAG | 60.0 |
| CI01000013_04805165_04810980* | C7F739 | AAATCCCACCTCATTGCCTTA | C7R740 | TGAAGACGGGCTTGTTTGC | 60.0 |
| CI01000021_06435903_06449790* | NF741 | GAACCAAACGGAAACCATCAA | NR742 | CAACAAAATCAACCCCACAGC | 60.0 |
| CI01000036_01250575_01252253* | CF737 | CAGCGAGAATGCCATCTTGAC | CR738 | GGGATTTTGACCGTAGGATAGC | 60.0 |
Note: *Indicates the primers used in the qRT-PCR, other primers are used in the SNPs verification.
Figure 6Gene structure schematics and polymorphism site confirmation. 22 SNPs associated with resistance/susceptibility to GCRV in individuals were selected for SNP verification, and 9 SNPs were confirmed. (A–I) show the 9 proved SNPs, including Hnrpa, Yes, Zfyve26, Gsto, C7N1, Napi-llb2, Mef2d, Hiat and Chsg genes, respectively. Double slash, grey box and polygonal line represent ellipsis, exons and introns, respectively. The polymorphism sites are marked by vertical arrows in gene organization plots and the genotypes of SNPs are demonstrated on the corresponding electropherogram. M, Maker 1.
Distribution of the SNPs in resistant and susceptible groups.
| Gene | Locus | Genotype | Resistant | Susceptible |
| Allele | Resistant | Susceptible | OR (95% CI) |
|
|---|---|---|---|---|---|---|---|---|---|---|
| NO (%) | NO (%) | NO (%) | NO (%) | |||||||
| Hnrpa | 8975656 T/C | TT | 27 (90.0) | 30 (100.0) | 3.157 (0.075) | T | 54 (90.0) | 60 (100.0) | NA | 6.315 (0.011*) |
| TC | 0 (0.0) | 0 (0.0) | C | 6 (10.0) | 0 (0.0) | |||||
| CC | 3 (10.0) | 0 (0.0) |
|
|
| |||||
| Yes | 11446782 A/G | AA | 3 (10.0) | 5 (16.7) | 1.731 (0.421) | A | 25 (41.7) | 24 (40.0) | 1.071 (0.517–2.219) | 0.034 (0.853) |
| AG | 19 (63.3) | 14 (46.7) | G | 35 (58.3) | 36 (60.0) | |||||
| GG | 8 (26.7) | 11 (36.7) |
|
|
| |||||
| Gsto | 12133107 A/G | AA | 7 (23.3) | 7 (23.3) | 0.138 (0.933) | A | 33 (55.0) | 32 (53.3) | 1.069 | 0.034 (0.855) |
| AG | 19 (63.3) | 18 (60.0) | G | 27 (45.0) | 28 (46.7) | |||||
| GG | 4 (13.3) | 5 (16.7) |
|
|
| |||||
| C7N1 | 4807309 A/G | AA | 3 (10.0) | 4 (13.3) | 8.174 (0.017*) | A | 30 (50.0) | 22 (36.7) | 1.727 (0.833–3.582) | 2.172 (0.141) |
| AG | 24 (80.0) | 14 (46.7) | G | 30 (50.0) | 38 (63.3) | |||||
| GG | 3 (10.0) | 12 (40.0) |
|
|
| |||||
| Napi-llb2 | 6436410 A/G | AA | 1 (3.3) | 1 (3.3) | 5.612 (0.060) | A | 11 (18.3) | 20 (33.3) | 0.448 (0.192–1.046) | 3.523 (0.060) |
| AG | 9 (30.0) | 18 (60.0) | G | 49 (81.7) | 40 (66.7) | |||||
| GG | 20 (66.7) | 11 (36.7) |
|
|
| |||||
| Chsg | 1260503 A/G | AA | 0 (0.0) | 0 (0.0) | 0.077 (0.781) | A | 10 (17.2) | 9 (15.0) | 1.133 (0.424–3.023) | 0.062 (0.802) |
| AG | 10 (33.3) | 9 (30.0) | G | 48 (82.8) | 51 (85.0) | |||||
| GG | 20 (66.7) | 21 (70.0) |
|
|
|
Note: *The distributions of corresponding SNPs between resistant and susceptible groups are significantly different (P < 0.05). ‘NA’ indicates not available. P values for HWE test are shown with boldface.
Figure 7mRNA expression profiles in spleen tissue and CIK cells. Transcripts of 6 genes with more than one genotype in 9 genes with confirmed SNPs were quantified in resistant and susceptible groups in spleen (A) and CIK (B) by qRT-PCR (n = 3). 18 S rRNA and EF1α as reference genes were used in individuals and cells respectively. Fold changes of mRNA expressions were relative to corresponding susceptible samples (defined as 1). *P < 0.05; **P < 0.01; ***P < 0.001; ‘NS’, not significant, P > 0.05. The normalized absolute quantification FPKM values of corresponding genes in RNA-Seq datasets of spleen and CIK were demonstrated in (C) and (D), respectively.