| Literature DB >> 35923702 |
Baohua Duan1, Shumei Mu1, Yueqiang Guan1, Weibiao Liu1, Tongxu Kang1, Yana Cheng1, Zejian Li2, Yang Tian3, Xianjiang Kang1,4,5.
Abstract
P. trituberculatus is an economically important mariculture species in China. Evaluating its genetic diversity and population structure can contribute to the exploration of germplasm resources and promote sustainable aquaculture production. In this study, a total of 246,243 SSRs were generated by transcriptome sequencing of P. trituberculatus. Among the examined 254,746 unigenes, 66,331 had more than one SSR. Among the different SSR motif types, dinucleotide repeats (110,758, 44.98%) were the most abundant. In 173 different base repeats, A/T (96.86%), AC/GT (51.46%), and ACC/GGT (26.20%) were dominant in mono-, di-, and trinucleotide, respectively. GO annotations showed 87,079 unigenes in 57 GO terms. Cellular process, cell, and binding were the most abundant terms in biological process, cellular component, and molecular function categories separately. A total of 34,406 annotated unigenes were classified into 26 functional categories according to the functional annotation analysis of KOG, of which "general function prediction only" was the biggest category (6,028 unigenes, 17.52%). KEGG pathway annotations revealed the clustering of 34,715 unigenes into 32 different pathways. Nineteen SSRs were identified as polymorphic and, thus, used to assess the genetic diversity and structure of 240 P. trituberculatus individuals from four populations in the Bohai Sea. Genetic parameter analysis showed a similar level of genetic diversity within wild populations, and the cultured population indicated a reduction in genetic diversity compared with wild populations. The pairwise FST values were between 0.001 and 0.04 with an average of 0.0205 (p < 0.05), suggesting a low but significant level of genetic differentiation among the four populations. Structure analysis demonstrated that the four populations were classified into two groups including the cultured group and other populations. The phylogenetic tree and PCA revealed that a vast number of samples were clustered together and that cultivated individuals were distributed more centrally than wild individuals. The findings contribute to the further assessment of germplasm resources and assist to provide valuable SSRs for marker-assisted breeding of P. trituberculatus in the future.Entities:
Keywords: Bohai Sea; Portunus trituberculatus; SSR; genetic diversity; transcriptome
Year: 2022 PMID: 35923702 PMCID: PMC9340201 DOI: 10.3389/fgene.2022.932173
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Collection map of P. trituberculatus samples.
Characteristics of 19 SSR loci for P. trituberculatus.
| Locus | Primer sequences | Repeat motif | Anneal (°C) |
|---|---|---|---|
| PrMa01 | F:CCTTGCCTCGTCAGTGTCAT | (CTG)6 | 60 |
| R:TGGCTGTAGACACCCTCCAT | |||
| PrMa02 | F:AGAGCTGACCTCGCTTTGAC | (GTG)8 | 60 |
| R:TCCAGCTCCTCCTGTCCAAT | |||
| PrMa03 | F:CTTGATTGCCTCTCGCTTGT | (TG)10 | 60 |
| R:GGGGGAGAGGGAGAGAATGT | |||
| PrMa04 | F:TCCTGGACCTTGTTCAGTCC | (TCC)10 | 60 |
| R:GCAATCCCACACACACTCCT | |||
| PrMa05 | F:GCGTTGCGTGTACTGAAAGT | (TG)31 | 60 |
| R:GCGGCTCTGGTCAGGAATAC | |||
| PrMa06 | F:TCCTGCAACTTACATTCTTGGTC | (CA)15 | 60 |
| R:GTGTGCACAGGATACAGCCT | |||
| ZL05 | F:AGAATGTTGCCATGGCTGGA | (GGT)7 | 60 |
| R:ACCCTGTATCAGTGCGTTGG | |||
| ZL06 | F:CCCGCCCCTGTACATTTTCA | (TAT)10 | 60 |
| R:TGTTGGTAGGCTTGGTGGTC | |||
| ZL08 | F:GCTTCTGCTGCTGGTCCTTA | (CAAC)10 | 60 |
| R:ACCAGACATTGCTGAGCATG | |||
| DX05 | F:GTGGGCCGCCAATATCACTA | (TG)12 | 60 |
| R:AATCCACCACTTGCACCCAA | |||
| DX07 | F:CGTGCATCCGTGTGTTTGTT | (TG)10 | 60 |
| R:GCCATCTTTTCGCCGAGTTG | |||
| DX09 | F:TAGGCATGGGATGGGTGAGA | (CA)17 | 60 |
| R:CGGGAAGGAGTGTTGTTGAGT | |||
| DX10 | F:AATCACAACCCAGCCGCATA | (TG)12 | 60 |
| R:ACAACGAAGGAGAGATGCGG | |||
| DX14 | F:CCCGCTACCCCATAACTCAC | (GTG)7 | 60 |
| R:TCTTCCTCCCCACAGCCATA | |||
| DX15 | F:CGTCCCATCATCTGACAAAGG | (GAG)6 | 60 |
| R:TCCTTCACCTCTTCCTCTTTTCT | |||
| DX16 | F:GAGGCAAGCAAGTTAACCATTAG | (GT)7 | 60 |
| R:CTTCCTGGTTACCTCATCCTACC | |||
| DX19 | F:CACACTCGTTGCAGACACTACTT | (TG)11 | 60 |
| R:CTGTTACTTACTCGGTGCTTTGG | |||
| TRAN2 | F:TCACTACCACTACCGCTTTGTTT | (CAC)8 | 60 |
| R:GATGTCAGTAACGGGAGAGTGAG | |||
| TRAN3 | F:GCTGTTGTAGAAACCCATGAAAG | (GTG)7 | 60 |
| R:AGGGAGATACACGACCAACACTA |
Summary statistics for transcriptome sequencing of P. trituberculatus.
| Category | Number |
|---|---|
| Total raw reads | 661,922,456 |
| Total clean reads | 637,983,466 |
| Clean reads proportion (%), Q20 (%), Q30 (%) | 99.38, 96.3, 91.07 |
| Total number of unigenes examined | 254,746 |
| Mean length of unigenes (bp), N50 (bp) | 1,077, 1,936 |
| GC content (%) | 50.88 |
| Total amount of transcripts | 338,285 |
| Mean length of transcripts (bp), N50 (bp) | 879, 1,730 |
| Total size of examined sequences (bp) | 274,270,543 |
| Total number of identified SSRs | 246,243 |
| Number of SSR containing sequences | 132,908 |
| Number of sequences containing more than one SSR | 66,331 |
FIGURE 2Annotation of the P. trituberculatus transcriptome. (A) GO annotation and classification of unigenes. (B) KOG annotation and classification of unigenes. The x-axis indicates the categories, and the y-axis indicates the number of the unigenes.
FIGURE 3Function annotation and classification of P. trituberculatus unigenes in the KEGG category.
Genetic diversity parameters for 19 SSR loci.
| Locus |
|
|
|
|
|
| HWE |
|---|---|---|---|---|---|---|---|
| PrMa01 | 12 | 5.124 | 0.925 | 0.805 | 0.779 | 0.0000 | ** |
| PrMa02 | 11 | 7.354 | 0.532 | 0.864 | 0.849 | 0.0706 | ** |
| PrMa03 | 21 | 11.161 | 0.833 | 0.91 | 0.903 | 0.0486 | ** |
| PrMa04 | 12 | 6.672 | 0.808 | 0.85 | 0.834 | 0.0157 | ** |
| PrMa05 | 30 | 20.156 | 0.791 | 0.95 | 0.948 | 0.084 | ** |
| PrMa06 | 20 | 9.693 | 0.488 | 0.897 | 0.888 | 0.2086 | ** |
| ZL05 | 9 | 4.239 | 0.467 | 0.764 | 0.727 | 0.0991 | ** |
| ZL06 | 11 | 5.079 | 0.708 | 0.803 | 0.782 | 0.0788 | ** |
| ZL08 | 12 | 6.644 | 0.82 | 0.85 | 0.833 | 0.000 | ns |
| DX05 | 10 | 6.363 | 0.871 | 0.843 | 0.824 | 0.000 | ns |
| DX07 | 7 | 3.892 | 0.613 | 0.743 | 0.702 | 0.0254 | ** |
| DX09 | 21 | 12.708 | 0.774 | 0.921 | 0.916 | 0.0835 | ** |
| DX10 | 10 | 4.938 | 0.699 | 0.798 | 0.773 | 0.0525 | ** |
| DX14 | 8 | 4.052 | 0.525 | 0.753 | 0.714 | 0.0289 | ** |
| DX15 | 13 | 8.75 | 0.571 | 0.886 | 0.875 | 0.1829 | ** |
| DX16 | 7 | 4.222 | 0.679 | 0.763 | 0.727 | 0.0359 | ** |
| DX19 | 11 | 6.647 | 0.938 | 0.85 | 0.832 | 0.000 | ** |
| TRAN2 | 9 | 4.367 | 0.629 | 0.771 | 0.736 | 0.000 | ** |
| TRAN3 | 9 | 2.751 | 0.483 | 0.636 | 0.598 | 0.000 | ** |
| Mean | 12.79 | 7.095 | 0.692 | 0.824 | 0.802 |
Na, number of alleles; Ne, number of effective alleles; Ho, observed heterozygosity; H, expected heterozygosity; PIC, polymorphism information content; Fna, frequency of null alleles; HWE, Hardy–Weinberg equilibrium; **p < 0.01. ns, no deviations from HWE.
Mean genetic parameters of four P. trituberculatus populations.
| Population |
|
|
|
|
|
|---|---|---|---|---|---|
| HW | 11.421 | 5.572 | 0.706 | 0.756 | 0.728 |
| PL | 11.842 | 5.595 | 0.7 | 0.76 | 0.731 |
| QHD | 11.263 | 5.802 | 0.675 | 0.752 | 0.724 |
| HC | 8.737 | 4.507 | 0.688 | 0.716 | 0.679 |
Estimates of pairwise F values among the four P. trituberculatus populations.
| HW | HC | PL | QHD | |
|---|---|---|---|---|
| HW | — | |||
| HC | 0.040* | — | ||
| PL | 0.016* | 0.023* | — | |
| QHD | 0.020* | 0.023* | 0.001* | — |
*Significant difference (p < 0.05).
Analysis of molecular variance (AMOVA) from four P. trituberculatus populations.
| Source of variation |
|
|
| Variance component | Percentage of variation (%) |
|
|---|---|---|---|---|---|---|
| Among populations | 3 | 77.042 | 25.681 | 0.149 | 2 | 0.001 |
| Within populations | 476 | 3408.842 | 14.333 | 7.166 | 98 | 0.001 |
| Total | 479 | 3485.883 | 7.316 | 100 | 0.001 |
df, degrees of freedom; SS, sum of squares; MS, mean square.
FIGURE 4(A) Relationships between the number of clusters (K) and the corresponding. ΔK statistics calculated based on STRUCTURE analysis. (B) Population genetic structure based on the Bayesian clustering model among 240 P. trituberculatus samples at K = 2.
FIGURE 5PCA analyses based on SSR data for 240 individuals from 4 P. trituberculatus populations.
FIGURE 6Neighbor-joining phylogenetic tree of 240 individuals of P. trituberculatus based on SSR data.