| Literature DB >> 25369077 |
Xuelin Zhao1, Hong Yu1, Lingfeng Kong1, Shikai Liu1, Qi Li1.
Abstract
Environmental salinity creates a key barrier to limit the distribution of most aquatic organisms. Adaptation to osmotic fluctuation is believed to be a factor facilitating species diversification. Adaptive evolution often involves beneficial mutations at more than one locus. Bivalves hold great interest, with numerous species living in waters, as osmoconformers, who maintain the osmotic pressure balance mostly by free amino acids. In this study, 107,076,589 reads from two groups of Crassostrea hongkongensis were produced and the assembled into 130,629 contigs. Transcripts putatively involved in stress-response, innate immunity and cell processes were identified according to Gene ontology and KEGG pathway analyses. Comparing with the transcriptome of C. gigas to characterize the diversity of transcripts between species with osmotic divergence, we identified 182,806 high-quality single nucleotide polymorphisms (SNPs) for C. hongkongensis, and 196,779 SNPs for C. gigas. Comparison of 11,602 pairs of putative orthologs allowed for identification of 14 protein-coding genes that experienced strong positive selection (Ka/Ks>1). In addition, 45 genes that may show signs of moderate positive selection (1 ≥ Ka/Ks>0.5) were also identified. Based on Ks ratios and divergence time between the two species published previously, we estimated a neutral transcriptome-wide substitution mutation rate of 1.39 × 10(-9) per site per year. Several genes were differentially expressed across the control and treated groups of each species. This is the first time to sequence the transcriptome of C. hongkongensis and provide the most comprehensive transcriptomic resource available for it. The increasing amount of transcriptome data on Crassostrea provides an excellent resource for phylogenetic analysis. A large number of SNPs identified in this work are expected to provide valuable resources for future marker and genotyping assay development. The analysis of natural selection provides an innovative view on the adaptation within species and sets the basis for future genetic and evolutionary studies.Entities:
Mesh:
Year: 2014 PMID: 25369077 PMCID: PMC4219811 DOI: 10.1371/journal.pone.0111915
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the transcriptome assembly for C. hongkongensis and C. gigas.
|
|
| |||
| HT | HC | PT | PC | |
| Raw data | 61,096,062 | 62,643,682 | 13 719 859 | 15 354 006 |
| Clean data | 53 044 352 | 54 032 237 | 13 573 056 | 15 237 315 |
| Read length (bp) | 100 | 100 | 92 | 92 |
| N50 length of assembly (bp) | 1117 | 831 | ||
| Mean length of assembly (bp) | 645 | 419 | ||
| Total number of transcripts | 130,629 | 300,382 | ||
| Number of putative orthologs | 11,602 | |||
Figure 1The length distribution of contigs and coding sequences (CDSs) of C. hongkongensis.
Contigs were generated from de novo assembly of Illumina sequencing reads. The minimum length of contigs was limited to 200 bp and that of CDSs was limited to 100 bp.
Figure 2GO comparison between the C. hongkongensis and C. gigas transcriptome.
The C. gigas data was from a previous study by Zhao et al. (2012).
Figure 3Distribution of the KEGG pathways in both C. hongkongensis and C. gigas.
The bar chart shows the percentages of sequences that are assigned within different pathway categories.
Figure 4Classification of SNPs identified from the C. hongkongensis and C. gigas transcriptomes.
For both species, transitions occurred more frequently than transerversions. The overall frequency of all types of SNPs was one per 460 bp for C. hongkongensis and one per 188 bp for C. gigas.
Figure 5Distribution of GO terms between orthologs of C. hongkongensis and C. gigas.
Function annotation of the 754 ortholog pairs plotted as categories.
Figure 6Distribution of Ka/Ks ratio.
Ortholog pairs with Ka/Ks ratio >1 are above the grey line, while ortholog pairs with Ka/Ks ratio between 0.5–1 reside between the black and grey lines.
Identification of candidate genes under positive selection (Ka/Ks>1).
| Transcript ID | Ka/Ks ratio | Gene name | Gene Ontology |
| comp104168_c0_seq1 CGI_10007982 | 1.40 | Transient receptorpotential cationchannel subfamilyM member 3 | P: GO:0006816 calcium ion transport; P: GO:0055085 transmembrane transport; F: GO:0005262 calcium channel activity; C: GO:0016020 membrane; C: GO:0016021 integral component of membrane |
| comp26473_c0_seq1 CGI_10001634 | 1.27 | Tripartitemotif-containingprotein 2 | P: GO:0016567 protein ubiquitination; F: GO:0008270 - zinc ion binding; F: GO:0004842 ubiquitin-protein ligase activity; F: GO:0005515 protein binding; C: GO:0005622 intracellular; C: GO:0005737 cytoplasm |
| comp21733_c0_seq1 CGI_10006649 | 1.23 | Tripartitemotif-containingprotein 2 | P: GO:0016567 protein ubiquitination; F: GO:0008270 - zinc ion binding; F: GO:0004842 ubiquitin-protein ligase activity; F: GO:0005515 protein binding; C: GO:0005622 intracellular; C: GO:0005737 cytoplasm |
| comp26605_c0_seq1 CGI_10009487 | 1.10 | Tripartitemotif-containingprotein 2 | P: GO:0016567 protein ubiquitination; F: GO:0008270 - zinc ion binding; F: GO:0004842 ubiquitin-protein ligase activity; F: GO:0005515 protein binding; C: GO:0005622 intracellular; C: GO:0005737 cytoplasm |
| comp15269_c0_seq1 CGI_10015178 | 1.06 | Tripartitemotif-containingprotein 45 | F: GO:0008270 zinc ion binding; C: GO:0005622 intracellular; C: GO:0005737 cytoplasm |
| comp9941_c0_seq1 CGI_10019616 | 1.03 | Transcriptionfactor HES-1-B | P: GO:0000122 negative regulation of transcription from RNA polymerase II promoter; P: GO:0006351 transcription, DNA-templated; P: GO:0007219 Notch signaling pathway; F: GO:0003677 DNA binding; F: GO:0043425 bHLH transcription factor binding; F: GO:0046982 protein heterodimerization activity; C: GO:0005634 nucleus |
Only genes with known identities were shown. The full list of genes were provided in Table S1.