| Literature DB >> 27336696 |
Juan Wang1,2, Dong-Xiu Xue1,2, Bai-Dong Zhang1,2,3, Yu-Long Li1,2,3, Bing-Jian Liu1,2,3, Jin-Xian Liu1,2.
Abstract
Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus.Entities:
Mesh:
Year: 2016 PMID: 27336696 PMCID: PMC4919078 DOI: 10.1371/journal.pone.0157809
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics describing the distribution of different properties of each sequenced individual.
| Sample | Clean reads | Removed duplication reads | Clean duplication rate (%) | Digestion reads | Digestion ratio (%) |
|---|---|---|---|---|---|
| BHZL7 | 23,573,826 | 20,895,973 | 11.36 | 19,506,713 | 93.35 |
| BHZL2 | 6,514,378 | 5,793,497 | 11.07 | 5,482,926 | 94.64 |
| BHZL3 | 6,027,708 | 4,418,570 | 26.70 | 3,953,967 | 89.49 |
| BHZL4 | 5,528,681 | 3,421,002 | 38.12 | 3,028,070 | 88.51 |
| BHZL5 | 4,709,862 | 3,100,873 | 34.16 | 2,656,050 | 85.65 |
| BHGX10 | 6,489,759 | 4,272,832 | 34.16 | 4,164,049 | 97.45 |
| BHGX11 | 6,482,774 | 4,588,191 | 29.22 | 4,408,480 | 96.08 |
| BHGX4 | 5,181,443 | 4,856,074 | 6.28 | 4,498,246 | 92.63 |
| BHGX8 | 4,310,893 | 2,756,430 | 36.06 | 2,418,986 | 87.76 |
| BHGX9 | 6,484,100 | 5,895,274 | 9.08 | 5,555,381 | 94.23 |
| BHWS1 | 7,098,008 | 6,580,377 | 7.29 | 6,167,882 | 93.73 |
| BHZL1 | 6,532,443 | 5,994,190 | 8.24 | 5,590,782 | 93.27 |
| BHZL10 | 5,874,843 | 5,493,522 | 6.49 | 5,117,905 | 93.16 |
| BHZL11 | 6,479,435 | 5,987,951 | 7.59 | 5,625,429 | 93.95 |
| BHZL8 | 7,355,495 | 5,631,322 | 23.44 | 5,324,589 | 94.55 |
| BHZL9 | 6,482,580 | 5,930,894 | 8.51 | 5,584,640 | 94.16 |
| LNDD1 | 5,200,570 | 3,259,943 | 37.32 | 2,789,161 | 85.56 |
| LNDD11 | 3,436,429 | 2,303,697 | 32.96 | 1,880,963 | 81.65 |
| LNDD12 | 6,547,518 | 6,028,989 | 7.92 | 5,727,409 | 95.00 |
| LNDD13 | 2,701,201 | 1,680,911 | 37.77 | 1,287,849 | 76.62 |
| LNDD15 | 5,970,185 | 4,599,814 | 22.95 | 4,402,605 | 95.71 |
| LNDD16 | 6,543,661 | 5,877,104 | 10.19 | 5,704,539 | 97.06 |
| LNDD17 | 6,550,388 | 5,432,012 | 17.07 | 5,274,525 | 97.10 |
| LNDD18 | 6,525,016 | 5,226,347 | 19.90 | 5,125,077 | 98.06 |
| LNDD19 | 6,568,662 | 5,581,263 | 15.03 | 5,483,696 | 98.25 |
| LNDD20 | 6,489,129 | 4,913,880 | 24.28 | 4,734,983 | 96.36 |
| LNDD3 | 6,551,210 | 4,222,309 | 35.55 | 4,072,621 | 96.45 |
| LNDD5 | 6,561,516 | 5,604,561 | 14.58 | 5,427,559 | 96.84 |
| LNDD6 | 4,409,786 | 3,597,430 | 18.42 | 3,298,330 | 91.69 |
| LNDD7 | 7,101,922 | 5,113,206 | 28.00 | 4,896,347 | 95.76 |
Summary statistics of different properties of assembling into reference sequences.
| Feature | Value |
|---|---|
| (i) RAD-PE assembly statistics | |
| Total contig base (bp) | 113,529,353 |
| The number of contigs retained | 285,408 |
| Average contig length (bp) | 397 |
| N50 contig length (bp) | 509 |
| GC content (%) | 40.11 |
| (ii) Match statistics | |
| Mapping rate (%) | 90.54 |
| Average depth | 31.56 |
| Coverage (> 4×) (%) | 87.22 |
| SNP number | 217,531 |
Counts of putative loci after different filtering steps.
| Filtering No. | Feature | Value |
|---|---|---|
| Total number of SNPs | 1,184,075 | |
| i | Bi-allelic SNPs | 1,166,783 |
| ii | SNPs with quality score > 20 | 1,052,835 |
| iii | The average depth of reads > 4 and < 100 and > 80% coverage for each population | 109,307 |
| iv | A global minor allele frequency (MAF ≥ 0.05) in two populations | 64,008 |
| v | 42,733 | |
| vi | One SNP per contig | 22, 648 ( |
Fig 1Transitions and transversions occurring within a set of filtered SNPs.
Fig 2Graphical representation of outlier tests results.
(A) results from the LOSITAN. Above the top line is a 0.995 probability for being candidates of selection. A subset of the loci between the two lines is within 0.005–0.995 probability and is considered neutral. The remaining SNPs are conservatively considered undetermined. (B) results from BAYESCAN. The vertical line represents a false discovery threshold of 0.05. The candidate loci under directional selection are on the right side of the vertical line.
Fig 3Admixture analysis of L. maculatus based on all, neutral and outlier SNPs.
Each vertical line represents one individual, partitioned into segments according to admixture proportion of the spotted sea bass sampled from Dandong (green) and Beihai (red).
Fig 4The genetic relationships among individuals of Beihai and Dandong illustrated by the NetView P analysis.
Results of leave-one-out tests for individual assignment with four SNP panels.
| Populations | % Correct assignment | |||
|---|---|---|---|---|
| 298 outliers SNPs | 298 neutral SNPs | 20 outliers SNPs | 20 neutral SNPs | |
| Beihai | 100 | 100 | 93.8 | 93.8 |
| Dandong | 100 | 100 | 100 | 78.6 |
Characterization of 22 GO annotations obtained from Blast2Go analysis.
| Contig Name | Description | Length | Hits | min. E-value | mean Similarity | GOs | GOs | Enzyme Codes |
|---|---|---|---|---|---|---|---|---|
| 16835 | mitogen-activated protein kinase 14a-like | 476 | 20 | 4.79E-10 | 66.85% | 3 | F:nucleotide binding; F:protein serine/threonine kinase activity; P:protein phosphorylation | EC:2.7.11 |
| 352269 | hba1_cotgo ame: full = hemoglobin subunit alpha-1 ame: full = alpha-1-globin ame: full = hemoglobin alpha-1 chain | 509 | 5 | 2.59E-10 | 56.80% | 4 | C:hemoglobin complex; F:oxygen transporter activity; F:metal ion binding; P:oxygen transport | - |
| 2220874 | mrg morf4l-binding protein | 534 | 20 | 2.57E-11 | 62.15% | 2 | C:H4/H2A histone acetyltransferase complex; P:regulation of transcription, DNA-templated | - |
| 474018 | transcription factor | 522 | 2 | 4.02E-13 | 70.50% | 4 | C:nucleus; C:cytoplasm; F:sequence-specific DNA binding transcription factor activity; P:regulation of transcription from RNA polymerase II promoter | - |
| 3052505 | aryl hydrocarbon receptor nuclear translocator-like protein 2 isoform x1 | 553 | 20 | 2.55E-14 | 90.25% | 10 | C:nucleus; C:transcription factor complex; C:cytoplasm; F:DNA binding; F:sequence-specific DNA binding transcription factor activity; F:signal transducer activity; F:protein dimerization activity; P:transcription, DNA-templated; P:regulation of transcription, DNA-templated; P:signal transduction | - |
| 2031533 | agouti-related | 570 | 20 | 4.39E-16 | 71.30% | 3 | C:extracellular region; F:receptor binding; P:hormone-mediated signaling pathway | - |
| 1205767 | r-spondin-2- partial | 495 | 20 | 9.87E-17 | 89.90% | 3 | C:extracellular space; F:G-protein coupled receptor binding; P:positive regulation of canonical Wnt signaling pathway | - |
| 283613 | RNA-directed DNA polymerase from transposon BS | 586 | 20 | 1.93E-17 | 80.55% | 2 | F:RNA-directed DNA polymerase activity; P:RNA-dependent DNA replication | EC:2.7.7.49 |
| 427869 | stam-binding protein | 573 | 20 | 1.60E-17 | 77.35% | 4 | F:metallopeptidase activity; F:pyroglutamyl-peptidase activity; F:metal ion binding; P:proteolysis | EC:3.4.19 |
| 2441721 | insulin-like growth factor-binding protein 3 | 497 | 20 | 8.30E-20 | 88.95% | 10 | C:extracellular region; C:nucleus; F:insulin-like growth factor I binding; F:insulin-like growth factor II binding; P:skeletal system development; P:regulation of cell growth; P:negative regulation of BMP signaling pathway; P:otic vesicle formation; P:insulin-like growth factor receptor signaling pathway; P:pharyngeal system development | - |
| 2174006 | terminal uridylyltransferase 4 | 545 | 20 | 4.00E-21 | 100.00% | 4 | F:nucleic acid binding; F:zinc ion binding; F:nucleotidyltransferase activity; P:metabolic process | - |
| 3027366_2 | PREDICTED: uncharacterized protein LOC103908834 | 266 | 20 | 1.52E-21 | 63.45% | 1 | F:transferase activity, transferring glycosyl groups | - |
| 2645499 | protein fam50a | 368 | 20 | 2.00E-23 | 86.35% | 1 | C:nucleus | - |
| 1782285 | diacylglycerol kinase zeta isoform x1 | 520 | 20 | 3.33E-29 | 76.95% | 3 | F:nucleotide binding; F:kinase activity; P:signal transduction | - |
| 136359 | Golgi apparatus protein 1 | 518 | 20 | 8.27E-31 | 98.20% | 1 | C:Golgi membrane | - |
| 1075810 | Golgi apparatus protein 1 | 570 | 20 | 3.47E-32 | 79.45% | 1 | C:Golgi membrane | - |
| 2722643 | spatacsin | 484 | 20 | 5.36E-33 | 79.30% | 1 | P:axonogenesis | - |
| 1699444 | RNA-directed DNA polymerase from mobile element jockey | 578 | 20 | 9.55E-36 | 61.60% | 2 | F:RNA-directed DNA polymerase activity; P:RNA-dependent DNA replication | EC:2.7.7.49 |
| 1733564 | probable e3 ubiquitin-protein ligase herc1 | 564 | 20 | 1.95E-39 | 97.25% | 3 | F:ubiquitin-protein transferase activity; F:ligase activity; P:protein ubiquitination | - |
| 628717 | zinc finger protein 423-like isoform x3 | 580 | 20 | 1.58E-41 | 90.95% | 2 | F:nucleic acid binding; F:metal ion binding | - |
| 1450955 | RNA-directed DNA polymerase from mobile element jockey- partial | 626 | 20 | 1.08E-79 | 82.40% | 2 | F:RNA-directed DNA polymerase activity; P:RNA-dependent DNA replication | EC:2.7.7.49 |
| 2312120 | reverse transcriptase-like protein | 461 | 20 | 2.90E-83 | 68.85% | 2 | F:RNA-directed DNA polymerase activity; P:RNA-dependent DNA replication | EC:2.7.7.49 |
Fig 5Gene ontology assignment plot.
The plot shows GO of candidate genes for adaptive differentiation.