| Literature DB >> 31923275 |
Fang-Chao Zhu1,2, Jin Sun3, Guo-Yong Yan1, Jiao-Mei Huang1,2, Chong Chen4, Li-Sheng He1.
Abstract
Diffusing fluid at a deep-sea hydrothermal vent creates rapid, acute physico-chemical gradients that correlate strongly with the distribution of the vent fauna. Two alvinocaridid shrimps, Alvinocaris longirostris and Shinkaicaris leurokolos occupy distinct microhabitats around these vents and exhibit different thermal preferences. S. leurokolos inhabits the central area closer to the active chimney, while A. longirostris inhabits the peripheral area. In this study, we screened candidate genes that might be involved in niche separation and microhabitat adaptation through comparative transcriptomics. The results showed that among the top 20% of overexpressed genes, gene families related to protein synthesis and structural components were much more abundant in S. leurokolos compared to A. longirostris. Moreover, 15 out of 25 genes involved in cellular carbohydrate metabolism were related to trehalose biosynthesis, versus 1 out of 5 in A. longirostris. Trehalose, a non-reducing disaccharide, is a multifunctional molecule and has been proven to act as a protectant responsible for thermotolerance in Saccharomyces cerevisiae. Putative positively selected genes involved in chitin metabolism and the immune system (lectin, serine protease and antimicrobial peptide) were enriched in S. leurokolos. In particular, one collagen and two serine proteases were found to have experienced strong positive selection. In addition, sulfotransferase-related genes were both overexpressed and positively selected in S. leurokolos. Finally, genes related to structural proteins, immune proteins and protectants were overexpressed or positively selected. These characteristics could represent adaptations of S. leurokolos to its microhabitat, which need to be confirmed by more evidence, such as data from large samples and different development stages of these alvinocaridid shrimps.Entities:
Mesh:
Year: 2020 PMID: 31923275 PMCID: PMC6953826 DOI: 10.1371/journal.pone.0227587
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information on the de novo transcriptome assembly.
| Data filtering: | ||
| Number of read pairs | 33,324,253 | 39,266,315 |
| Number of clean read pairs | 27,651,484 (82.98%) | 32,360,581 (82.41%) |
| Trinity assembly: | ||
| Number of transcripts | 158,408 | 173,354 |
| Percent GC | 39.33% | 39.23% |
| Contig N50 | 1,460 | 1,198 |
| Median contig length (bp) | 356 | 340 |
| Average contig length (bp) | 757.75 | 677.35 |
| Number of unigenes | 129,409 | 143,754 |
Annotation of protein-coding sequences.
| Predicted ORFs | 28,782 | 34,390 |
| NCBI nr | 20,409 (70.9%) | 23,392 (68.0%) |
| Gene Ontology (GO) | 16,898 (58.7%) | 19,559 (56.9%) |
| KEGG Ortholog (KO) | 6,738 (23.4%) | 6,835 (19.9%) |
| Pfam | 14,863 (51.6%) | 16,680 (48.5%) |
| Annotated in all databases | 4,305 (15.0%) | 4,388 (12.8%) |
| Annotated at least in one database | 20,730 (72.0%) | 23,720 (69.0%) |
Fig 1Maximum-likelihood phylogenetic tree of alvinocaridid shrimps based on The sequences in red are from this study. Statistical supports is indicated as bootstrap values, and the values of less than 50 are omitted. Subfamilies are masked by different colours: Alvinocaridinae (blue), Mirocaridinae (yellow), Rimicaridinae (green). Heterocarpus ensifer (family: Pandalidae) serves as the outgroup. Accession numbers are labeled in parentheses. The tree scale bar represents the number of expected substitutions per site.
GO families with significant differences between A. longirostris and S. leurokolos.
| GO ID | GO terms | Gene Number | Gene Ratio (%) | |||
|---|---|---|---|---|---|---|
| Biological Process | ||||||
| GO:0006412 | translation | 164 | 270 | 2.847 | 3.925 | 0.001 |
| GO:0044262 | cellular carbohydrate metabolic process | 5 | 23 | 0.087 | 0.334 | 0.003 |
| GO:0032507 | maintenance of protein location in cell | 5 | 16 | 0.087 | 0.233 | 0.045 |
| Cellular Component | ||||||
| GO:0031012 | extracellular matrix | 10 | 26 | 0.174 | 0.378 | 0.032 |
| GO:0016021 | integral component of membrane | 469 | 647 | 8.141 | 9.405 | 0.013 |
| GO:0005840 | ribosome | 137 | 245 | 2.378 | 3.562 | 0.000 |
| GO:0005938 | cell cortex | 7 | 22 | 0.122 | 0.320 | 0.020 |
| Molecular Function | ||||||
| GO:0008484 | sulfuric ester hydrolase activity | 6 | 18 | 0.104 | 0.262 | 0.043 |
| GO:0016491 | oxidoreductase activity | 229 | 323 | 3.975 | 4.695 | 0.048 |
| GO:0008146 | sulfotransferase activity | 10 | 27 | 0.174 | 0.392 | 0.023 |
| GO:0005509 | calcium ion binding | 99 | 158 | 1.718 | 2.297 | 0.022 |
| GO:0008234 | cysteine-type peptidase activity | 56 | 44 | 0.972 | 0.640 | 0.036 |
| GO:0003950 | NAD+ ADP-ribosyltransferase activity | 16 | 7 | 0.278 | 0.102 | 0.021 |
a: indicates the gene number in each GO group.
b: indicates the gene percentage calculated as the gene number of each GO group divided by the total number of the top 20% of highly expressed genes.
c: indicates the value of Pearson’s Chi-square tests, where a p-value<0.05 indicates a significant difference.
Fig 2Validation of gene expression by absolute qPCR.
Six single-copy orthologues for each species were randomly selected and quantified in the cephalothorax of S. leurokolos (A), abdomen of S. leurokolos (B), cephalothorax of A. longirostris (C) and abdomen of A. longirostris (D). The histograms show the gene copy number per μl (mean ± SD) with three technical replicates quantified by qPCR. The line charts show the TPM value quantified with RSEM software. Gene names are indicated on the x-axis.
Enriched positively selected genes in S. leurokolos.
| Gene ID | E value | Nr annotation |
|---|---|---|
| sulfotransferase activity (GO:0008146) | ||
| S50007_c0_g1 | 1.87E-65 | protein-tyrosine sulfotransferase [ |
| S54388_c0_g1 | 4.00E-103 | sulfotransferase 1C4-like [ |
| S52224_c0_g2 | 3.24E-80 | galactosylceramide sulfotransferase-like [ |
| S51757_c0_g1 | 5.26E-50 | carbohydrate sulfotransferase 11-like [ |
| S53761_c0_g1 | 9.83E-68 | carbohydrate sulfotransferase 1-like [ |
| carbohydrate binding (GO:0030246) | ||
| S48621_c1_g10 | 4.57E-30 | lectin B isoform 2, partial [ |
| S49766_c0_g1 | 1.71E-85 | C-type lectin 3 [ |
| S43185_c0_g1 | 1.10E-08 | C-type lectin [ |
| S45873_c1_g1 | 1.97E-31 | C-type lectin 2 [ |
| S46700_c0_g1 | 3.15E-24 | C-type lectin-like domain-containing protein [ |
| S54525_c4_g1 | 3.55E-37 | lectin B isoform 2, partial [ |
| S45454_c0_g1 | 8.76E-17 | natterin-4-like [ |
| S53662_c1_g1 | 3.79E-101 | scavenger receptor C [ |
| chitin metabolic process (GO:0006030), extracellular region (GO:0005576) | ||
| S51908_c0_g1 | 0.00E+00 | chitinase 10 [ |
| S52856_c0_g4 | 1.34E-15 | peritrophin-44-like protein [ |
| S53991_c0_g1 | 0.00E+00 | mucin-3A [ |
| S54047_c0_g1 | 3.11E-07 | mucin-5AC, partial [ |
| S24173_c0_g1 | NA | NA |
| S51404_c1_g2 | NA | NA |
| S51697_c0_g1 | NA | NA |
| S48582_c0_g1 | NA | NA |
| molecular function regulator (GO:0098772) | ||
| S46156_c0_g2 | 2.35E-36 | antimicrobial peptide type 2 precursor IIc [ |
| S38510_c0_g1 | 4.08E-25 | antimicrobial peptide type 1 precursor Ie [ |
| S54779_c0_g10 | 2.81E-29 | antimicrobial peptide type 2 precursor IIc [ |
| S47957_c0_g1 | 2.26E-127 | serine/threonine-protein phosphatase 2A activator-like [ |
| S43281_c0_g1 | 1.46E-14 | serine/threonine-protein kinase samkC [ |
| S41765_c0_g3 | 3.18E-107 | alpha-2-macroglobulin [ |
| S1809_c0_g1 | 2.62E-10 | macroglobulin [ |
| S53590_c2_g2 | 9.01E-17 | annexin A11 isoform X1 [ |
| S51629_c0_g1 | 4.13E-178 | ADP-ribosylation factor GTPase-activating protein 2-like isoform X1 [ |
| S53169_c0_g1 | 0.00E+00 | PH and SEC7 domain-containing protein 1 isoform X1 [ |
| S21248_c0_g1 | 1.20E-20 | puratrophin-1, partial [ |
| S49552_c0_g1 | NA | NA |
| endopeptidase activity (GO:0004175) | ||
| S48676_c0_g1 | 5.55E-65 | trypsin-like serine proteinase [ |
| S53405_c0_g1 | 3.18E-15 | caspase [ |
| S49898_c0_g1 | 0.00E+00 | lon protease homolog 2, peroxisomal-like [ |
| S48935_c0_g1 | 8.25E-07 | CUB-serine protease [ |
| S42913_c0_g1 | 8.78E-35 | trypsin 3A1-like [ |
| S51196_c0_g1 | 7.32E-140 | serine protease [ |
| S45313_c0_g1 | 3.83E-37 | chymotrypsin-like protein [ |
| S53118_c0_g2 | 9.09E-59 | trypsin [ |
| S43774_c0_g1 | 4.44E-49 | tryptase-like [ |
| S50445_c0_g1 | 2.18E-62 | trypsin-1-like [ |
| S54451_c0_g2 | 3.42E-51 | SpAN-like protein, partial [ |
| S55389_c2_g5 | 2.90E-59 | protein SpAN-like isoform X2 [ |
| S54596_c0_g2 | 9.78E-95 | protein SpAN-like [ |
| S46490_c0_g1 | 1.24E-07 | ADAM metalloprotease, partial [ |
| extracellular region (GO:0005576) | ||
| S53590_c2_g2 | 9.01E-17 | annexin A11 isoform X1 [ |
| S27614_c0_g1 | 1.69E-45 | serine proteinase inhibitor 8 [ |
| S1809_c0_g1 | 2.62E-10 | macroglobulin [ |
| S43856_c0_g1 | 3.18E-12 | putative insulin-like protein growth factor binding protein [ |
| S51978_c0_g6 | 1.32E-34 | serum amyloid A-5 protein-like isoform X1 [ |
| S45464_c0_g1 | 1.04E-95 | putative endothelial lipase [ |
| S49552_c0_g1 | 1.13E-27 | oxidoreductase NAD-binding domain-containing protein 1 isoform X4 [ |
| S50357_c0_g1 | 7.96E-69 | fibrinogen C domain-containing protein 1-A-like isoform X1 [ |
| S38510_c0_g1 | 4.08E-25 | antimicrobial peptide type 1 precursor Ie [ |
| S46156_c0_g2 | 2.35E-36 | antimicrobial peptide type 2 precursor IIc [ |
| S54779_c0_g10 | 2.81E-29 | antimicrobial peptide type 2 precursor IIc [ |
| S3003_c0_g1 | 8.40E-13 | oxygenase [ |
| S51450_c0_g1 | 8.12E-64 | oxygenase [ |
| S46892_c0_g2 | 1.76E-45 | oxygenase [ |