| Literature DB >> 35886050 |
Yajun Wang1,2, Hang Sha2, Xiaohui Li2, Tong Zhou2, Xiangzhong Luo2, Guiwei Zou2, Yi Chai1, Hongwei Liang2.
Abstract
Hypophthalmichthys molitrix is one of the four most important fish in China and has high breeding potential. However, simple sequence repeat (SSR) markers developed on H. molitrix genome level for genetic diversity analysis are limited. In this study, the distribution characteristics of SSRs in the assembled H. molitrix genome were analyzed, and new markers were developed to preliminarily evaluate the genetic diversity of the four breeding populations. A total of 368,572 SSRs were identified from the H. molitrix genome. The total length of SSRs was 6,492,076 bp, accounting for 0.77% of the total length of the genome sequence. The total frequency and total density were 437.73 loci/Mb and 7713.16 bp/Mb, respectively. Among the 2-6 different nucleotide repeat types, SSRs were dominated by di-nucleotide repeats (204,873, 55.59%), and AC/GT was the most abundant motif. The number of SSRs on each chromosome was positively correlated with the length. The 13 pairs of markers developed were used to analyze the genetic diversity of four cultivated populations in Hubei Province. The results showed that the genetic diversity of the four populations was low, and the ranges of alleles (Na), effective alleles (Ne), observed heterozygosity (Ho), and Shannon's index information (I) were 3.538-4.462, 2.045-2.461, 0.392-0.450, and 0.879-0.954, respectively. Genetic variation occurs mainly among individuals within populations (95.35%). UPGMA tree and Bayesian analysis showed that four populations could be divided into two different branches. Therefore, the genome-wide SSRs were effectively in genetic diversity analysis on H. molitrix.Entities:
Keywords: Hypophthalmichthys molitrix; SSR; genetic diversity; genome-wide identification; population structure
Mesh:
Substances:
Year: 2022 PMID: 35886050 PMCID: PMC9320178 DOI: 10.3390/genes13071267
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Sample information of the cultured H. molitrix populations in this study.
| Population | Location | Coordinates | Sampling Time | Number |
|---|---|---|---|---|
| SS | Shishou, Hubei Province, China | 112°48′E,29°83′N | 2021 | 30 |
| WH | Wuhan, Hubei Province, China | 114°48′E,30°77′N | 2021 | 30 |
| XC | Xiaochang, Hubei Province, China | 113°59′E,30°76′N | 2021 | 30 |
| YW | Yaowan, Hubei Province, China | 112°31′E,30°26′N | 2021 | 30 |
Information of SSR markers analyzed in this study from H. molitrix.
| Locus | Motif | Primer (5′–3′) | Size/Bp | |
|---|---|---|---|---|
| P020 | (AATA)5 | F: CTGCTCACCTCAGCTCATCC | 60 | 152~176 |
| P021 | (ATAG)4 | F: GCGAGCGCATGTTTGATCAA | 60 | 130~140 |
| P030 | (AAC)5 | F: GGTATCTGCTCGCTGGATCC | 60 | 201~213 |
| P034 | (ATA)11 | F: GGGCGATGATCCCTGAATCC | 60 | 199~221 |
| P036 | (ATT)5 | F: GCTTGCTCAAGGGCACAATG | 60 | 230~243 |
| P037 | (ATTAT)4 | F: AGAGCACGTTCACCTCACTG | 60 | 230~273 |
| P041 | (TAG)8 | F: AGAGGGAGACACGGCTACAT | 60 | 212~240 |
| P043 | (ACA)9 | F: TCACATCCTGCAACAGGGTC | 60 | 231~250 |
| P054 | (TAA)6 | F: TTGTTCGCTCCTTGGAAGGT | 60 | 241~278 |
| P056 | (TATC)6 | F: CGACCTGCTAGCCCAAACAT | 60 | 280~296 |
| P058 | (GTTT)6 | F: AACTGTCTATGCGATGCCGT | 60 | 280~293 |
| P062 | (TGTTT)37 | F: ATGCTGGCGATATGTGGCAA | 60 | 296~305 |
| P086 | (TAA)5 | F: TATTGCAGTGGTCGGACACA | 60 | 322~364 |
Information of SSR repeats in H. molitrix.
| SSR Types | Total Counts | Total | Average Length (bp) | Frequency (loci/Mb) | Density | Percent (%) |
|---|---|---|---|---|---|---|
| Di- | 204,873 | 4,911,266 | 23.97 | 243.31 | 5832.79 | 55.59 |
| Tri- | 38,048 | 468,878 | 12.32 | 45.19 | 556.86 | 10.32 |
| Tetra- | 70,012 | 712,888 | 10.18 | 83.15 | 846.65 | 19.00 |
| Penta- | 44,921 | 330,472 | 7.36 | 53.35 | 392.48 | 12.19 |
| Hexa- | 10,718 | 68,572 | 30.83 | 12.73 | 84.38 | 2.90 |
| Total | 368,572 | 6,492,076 | 84.66 | 437.73 | 7713.16 | 100.00 |
Figure 1Motif proportions of different repeat types in H. molitrix genome. (a) The most abundant SSRs motifs in the H. molitrix genome; (b) mono-nucleotide repeat types, (c) tri-nucleotide repeat types, (d) tera-nucleotide repeat types, (e) penta-nucleotide repeat types, and (f) hexa-nucleotide repeat types.
The most abundant motif categories in the H. molitrix genome SSRs.
| Motif | Categories | Number | Frequency | Density | Length |
|---|---|---|---|---|---|
| Di- | AC | 89,924 | 106.80 | 2111.18 | 1,777,632 |
| AT | 83,122 | 98.72 | 3045.83 | 2,564,620 | |
| AG | 31,602 | 37.53 | 672.34 | 566,120 | |
| CG | 225 | 0.27 | 3.44 | 2894 | |
| Tri- | AAT | 24,409 | 28.99 | 365.86 | 308,054 |
| AAC | 4696 | 5.58 | 63.67 | 53,608 | |
| ATC | 2572 | 3.05 | 37.90 | 31,910 | |
| AAG | 2038 | 2.42 | 29.29 | 24,660 | |
| Tetra- | AAAT | 16,819 | 19.97 | 178.24 | 150,084 |
| AGAT | 14,755 | 17.52 | 208.83 | 175,838 | |
| Penta- | AAAAT | 12,660 | 15.04 | 111.11 | 93,554 |
| Hexa- | AAAAAT | 3384 | 4.02 | 24.32 | 20,474 |
Figure 2Different copy number distribution of H. molitrix SSRs.
Figure 3Analysis of SSR frequencies on H. molitrix chromosomes: (a) number of SSRs, (b) length of SSRs, (c) frequency of SSRs, and (d) density of SSRs.
Characteristics of 13 polymorphic SSR sites in H. molitrix.
| Locus | Na | Ne | Ho | He | I | PIC |
|---|---|---|---|---|---|---|
| P020 | 2 | 1.363 | 0.250 | 0.267 | 0.437 | 0.231 |
| P021 | 2 | 1.342 | 0.267 | 0.255 | 0.423 | 0.222 |
| P030 | 5 | 1.941 | 0.517 | 0.485 | 0.954 | 0.449 |
| P034 | 3 | 2.044 | 0.450 | 0.511 | 0.819 | 0.442 |
| P036 | 3 | 1.482 | 0.317 | 0.325 | 0.606 | 0.298 |
| P037 | 3 | 1.052 | 0.017 | 0.049 | 0.133 | 0.048 |
| P041 | 5 | 2.636 | 0.467 | 0.621 | 1.221 | 0.581 |
| P043 | 7 | 3.084 | 0.600 | 0.676 | 1.446 | 0.643 |
| P054 | 2 | 1.654 | 0.217 | 0.395 | 0.584 | 0.319 |
| P056 | 7 | 4.332 | 0.683 | 0.769 | 1.639 | 0.735 |
| P058 | 6 | 2.092 | 0.583 | 0.520 | 1.083 | 0.487 |
| P062 | 4 | 2.332 | 0.283 | 0.571 | 1.047 | 0.520 |
| P086 | 7 | 4.765 | 0.450 | 0.800 | 1.747 | 0.768 |
| Mean | 4.308 | 2.317 | 0.392 | 0.479 | 0.934 | 0.442 |
| St. Dev | 1.974 | 1.140 | 0.186 | 0.216 | 0.494 | 0.213 |
Statistical values of genetic diversity of 13 SSR sites in four H. molitrix populations.
| Population | N | Na | Ne | Ho | He | I | Fst |
|---|---|---|---|---|---|---|---|
| SS | 30 | 4.385 ± 0.583 | 2.387 ± 0.335 | 0.421 ± 0.071 | 0.469 ± 0.068 | 0.912 ± 0.520 | 0.084 ± 0.066 |
| WH | 30 | 4.077 ± 0.487 | 2.461 ± 0.330 | 0.450 ± 0.006 | 0.504 ± 0.063 | 0.954 ± 0.482 | 0.100 ± 0.068 |
| XC | 30 | 4.462 ± 0.489 | 2.334 ± 0.352 | 0.392 ± 0.075 | 0.451 ± 0.072 | 0.879 ± 0.525 | 0.103 ± 0.069 |
| YW | 30 | 3.538 ± 0.368 | 2.045 ± 0.386 | 0.392 ± 0.103 | 0.402 ± 0.067 | 0.900 ± 0.455 | 0.178 ± 0.132 |
Molecular variance (AMOVA) results of four H. molitrix populations.
| Source of Variation | df | Sun of Squares | Variance Components | Percentage of Variation/% | Fixation Index |
|---|---|---|---|---|---|
| Among populations | 3 | 29.704 | 0.12299 | 4.65 | |
| Within populations | 236 | 595.217 | 2.5221 | 95.35 | Fst = 0.04650 |
| Total variation | 239 | 623.921 | 2.64509 | 100 |
Genetic differentiation index (Fst) (above diagonal) and Nei’s genetic distance (below diagonal).
| SS | WH | XC | YW | |
|---|---|---|---|---|
| SS | 0.02718 | 0.00714 | 0.05402 | |
| WH | 0.0328 | 0.03796 | 0.08709 | |
| XC | 0.0157 | 0.0382 | 0.06319 | |
| YW | 0.0673 | 0.1092 | 0.0823 |
Figure 4UPGMA tree based on the genetic distance among four H. molitrix populations.
Figure 5Structure analysis of H. molitrix populations using genotype data from 13 SSR sites. (K = 2).