| Literature DB >> 32370137 |
Xiang Li1, Xiaoting Liu1, Jiatong Wei1, Yan Li1, Mulualem Tigabu2, Xiyang Zhao1.
Abstract
Pinus koraiensis has significant economic and ecological value in Northeast China. However, due to the lack of suitable molecular markers, only a few available microsatellite markers were developed for further population genetics studies. In this study, for the first time we developed expressed sequence tag-simple sequence repeat (EST-SSR) markers from the cold-stressed transcriptome of P. koraiensis using Illumina Sequencing. We identified a total of 7,235 EST-SSRs from 97,376 sequences, and we tested their transferability among seven related Pinus species. The results showed that trinucleotides were the most abundant type of repeat (1287, 18.74%) excluding mononucleotides, followed by dinucleotides (1284, 18.7%) and tetranucleotides (72, 1.05%). The most dominant dinucleotides and trinucleotide repeat motifs were AT/AT (535, 7.79%) and AAT/ATT (103, 1.5%). The observed heterozygosity (Ho) and expected heterozygosity (He) ranged from 0.002 to 0.986 and 0.017 to 0.743, respectively, and the polymorphism information content (PIC) values and number of alleles (Na) varied from 0.029 to 0.794 and 2 to 23, respectively. A total of 8 natural P. koraiensis populations were divided into two main genetic clusters. Furthermore, nine of twenty polymorphic primer pairs were successfully amplified in seven Pinus species, and at least 80% of the successful P. koraiensis EST-SSR primers could be amplified in more than four species (16, 80%). Combined results for the development of EST-SSR markers in P. koraiensis and transferability among related species would contribute to improved studies on the genetic diversity and population structure in P. koraiensis and phylogenetic relationships among Pinus species. They would also provide a significant source for quantitative trait locus analysis.Entities:
Keywords: Pinus koraiensis; RNA-Seq; interspecific transferability; microsatellite markers; population structure
Mesh:
Substances:
Year: 2020 PMID: 32370137 PMCID: PMC7291311 DOI: 10.3390/genes11050500
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Distribution of the collected samples.
Summary of analyses of expressed sequence tag–simple sequence repeat (EST-SSRs) in Pinus koraiensis.
| Item | Parameters | Number |
|---|---|---|
| EST-SSR | Total number of sequences examined | 97,376 |
| Total size of examined sequences (bp) | 75,061,632 | |
| Total number of identified SSRs | 7235 | |
| Number of sequences containing SSRs | 6656 | |
| Number of sequences containing more than one SSR | 509 | |
| Number of SSRs present in compound formation | 354 |
The number of repeats in each different motif length in Pinus koraiensis.
| Number of Repeats | Mono- | Di- | Tri- | Tetra- | Penta- | Hexa- | Total | Percentage (%) |
|---|---|---|---|---|---|---|---|---|
| 5 | 815 | 53 | 21 | 40 | 929 | 13.53 | ||
| 6 | 420 | 260 | 16 | 4 | 9 | 709 | 10.32 | |
| 7 | 250 | 92 | 2 | 2 | 346 | 5.04 | ||
| 8 | 144 | 57 | 1 | 2 | 204 | 2.97 | ||
| 9 | 104 | 29 | 133 | 1.94 | ||||
| 10 | 1777 | 68 | 19 | 1864 | 27.14 | |||
| 11 | 873 | 57 | 7 | 937 | 13.64 | |||
| 12 | 485 | 44 | 1 | 530 | 7.72 | |||
| 13 | 302 | 38 | 5 | 345 | 5.02 | |||
| 14 | 214 | 34 | 2 | 250 | 3.64 | |||
| 15 | 137 | 28 | 165 | 2.4 | ||||
| 16 | 97 | 19 | 116 | 1.69 | ||||
| 17 | 68 | 13 | 81 | 1.18 | ||||
| 18 | 47 | 30 | 77 | 1.12 | ||||
| 19 | 35 | 23 | 58 | 0.85 | ||||
| 20 | 32 | 12 | 44 | 0.64 | ||||
| 21 | 16 | 16 | 0.23 | |||||
| 22 | 9 | 9 | 0.13 | |||||
| 23 | 8 | 8 | 0.12 | |||||
| 24 | 11 | 11 | 0.16 | |||||
| 25 | 5 | 5 | 0.07 | |||||
| 26 | 5 | 5 | 0.07 | |||||
| Others | 26 | 26 | 0.38 | |||||
| Total | 4147 | 1284 | 1287 | 72 | 25 | 53 | 6868 | 100 |
| Percentage (%) | 60.38 | 18.7 | 18.74 | 1.05 | 0.36 | 0.77 |
Frequencies of different repeat motifs in SSRs in Pinus koraiensis.
| Repeats | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16+ | Total | Percentage | Rank |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C/G | 23 | 10 | 5 | 4 | 1 | 2 | 12 | 57 | 0.83 | 21 | |||||
| A/T | 1754 | 863 | 480 | 298 | 213 | 135 | 347 | 4090 | 59.55 | 1 | |||||
| CG/CG | 1 | 1 | 2 | 0.03 | 40 | ||||||||||
| GC/GC | 2 | 1 | 3 | 0.04 | 38 | ||||||||||
| AC/GT | 20 | 15 | 10 | 5 | 1 | 1 | 3 | 1 | 1 | 1 | 58 | 0.84 | 18 | ||
| CA/TG | 35 | 15 | 8 | 5 | 4 | 1 | 2 | 2 | 72 | 1.05 | 13 | ||||
| TA/TA | 108 | 81 | 44 | 33 | 27 | 25 | 9 | 8 | 11 | 8 | 40 | 394 | 5.74 | 3 | |
| AT/AT | 134 | 90 | 56 | 54 | 33 | 27 | 26 | 24 | 18 | 19 | 54 | 535 | 7.79 | 2 | |
| GA/TC | 58 | 21 | 6 | 4 | 2 | 1 | 3 | 1 | 96 | 1.40 | 7 | ||||
| AG/CT | 62 | 26 | 20 | 3 | 2 | 2 | 3 | 1 | 4 | 1 | 124 | 1.81 | 5 | ||
| CTA/TAG | 3 | 3 | 0.04 | 38 | |||||||||||
| GTA/TAC | 4 | 1 | 1 | 6 | 0.09 | 36 | |||||||||
| CCG/CGG | 12 | 2 | 2 | 16 | 0.23 | 30 | |||||||||
| ACT/AGT | 2 | 2 | 0.03 | 40 | |||||||||||
| CGC/GCG | 11 | 4 | 1 | 2 | 18 | 0.26 | 29 | ||||||||
| ACG/CGT | 2 | 1 | 2 | 5 | 0.07 | 37 | |||||||||
| CGA/TCG | 3 | 2 | 1 | 1 | 1 | 8 | 0.12 | 35 | |||||||
| GAC/GTC | 7 | 2 | 9 | 0.13 | 33 | ||||||||||
| GCC/GGC | 8 | 1 | 9 | 0.13 | 33 | ||||||||||
| AGG/CCT | 35 | 13 | 4 | 3 | 1 | 1 | 1 | 58 | 0.84 | 18 | |||||
| GGA/TCC | 26 | 16 | 3 | 45 | 0.66 | 24 | |||||||||
| CAG/CTG | 58 | 18 | 7 | 2 | 1 | 1 | 1 | 88 | 1.28 | 8 | |||||
| GCA/TGC | 43 | 20 | 8 | 5 | 4 | 1 | 81 | 1.18 | 10 | ||||||
| AGC/GCT | 44 | 8 | 4 | 5 | 1 | 1 | 63 | 0.92 | 16 | ||||||
| CTC/GAG | 41 | 8 | 6 | 3 | 58 | 0.84 | 18 | ||||||||
| ATG/CAT | 30 | 9 | 1 | 2 | 1 | 1 | 44 | 0.64 | 25 | ||||||
| ACC/GGT | 6 | 5 | 1 | 1 | 13 | 0.19 | 32 | ||||||||
| ATA/TAT | 29 | 18 | 6 | 2 | 2 | 57 | 0.83 | 21 | |||||||
| ACA/TGT | 17 | 3 | 2 | 3 | 1 | 1 | 2 | 29 | 0.42 | 28 | |||||
| AAC/GTT | 22 | 3 | 2 | 3 | 1 | 1 | 32 | 0.47 | 27 | ||||||
| CAC/GTG | 9 | 4 | 1 | 1 | 15 | 0.22 | 31 | ||||||||
| TAA/TTA | 45 | 19 | 7 | 5 | 2 | 1 | 1 | 80 | 1.16 | 11 | |||||
| CCA/TGG | 25 | 5 | 3 | 2 | 1 | 2 | 1 | 39 | 0.57 | 26 | |||||
| CAA/TTG | 33 | 13 | 7 | 2 | 3 | 1 | 59 | 0.86 | 17 | ||||||
| TCA/TGA | 38 | 15 | 2 | 3 | 4 | 2 | 64 | 0.93 | 15 | ||||||
| ATC/GAT | 37 | 8 | 1 | 46 | 0.67 | 23 | |||||||||
| AAT/ATT | 65 | 12 | 14 | 6 | 2 | 2 | 1 | 1 | 103 | 1.50 | 6 | ||||
| AAG/CTT | 44 | 14 | 3 | 3 | 1 | 3 | 1 | 69 | 1.00 | 14 | |||||
| AGA/TCT | 53 | 19 | 2 | 5 | 1 | 80 | 1.16 | 11 | |||||||
| GAA/TTC | 60 | 17 | 3 | 4 | 3 | 1 | 88 | 1.28 | 8 | ||||||
| Others | 114 | 29 | 4 | 3 | 150 | 2.18 | 4 | ||||||||
| Total | 924 | 709 | 347 | 206 | 133 | 1865 | 938 | 530 | 345 | 250 | 165 | 456 | 6868 | 100 |
Characteristics of 20 EST-SSR loci in 80 unique genotypes in Pinus koraiensis.
| Locus | Primer Sequence (5′–3′) | Motif | Tm (°C) | Size (bp) | Na | Ne | I | Ho | He | PIC |
|---|---|---|---|---|---|---|---|---|---|---|
| NFPK-218 | F:AGTGGAACGAATTTGAACCG R:GGGCTTTGAAACAGGTGAAA | (TC)6 | 60 | 188 | 4 | 1.073 | 0.108 | 0.030 | 0.059 | 0.380 |
| NFPK-40 | F:TCGCTCTCTTCTTGACCACA R:CCGCTACTTCATCAGGGTTC | (TGA)6 | 60 | 196 | 4 | 1.070 | 0.119 | 0.040 | 0.061 | 0.063 |
| NFPK-32 | F:AAATGGACGAAGTTGGATGG R:CTCAGTGTCTTCAGGCAGGA | (GCT)6 | 59 | 197 | 6 | 2.052 | 0.759 | 0.603 | 0.510 | 0.582 |
| NFPK-53 | F:TGGAGATGCAGCAGATTAGG R:CTGCACACAGGATGTCACAA | (ATG)6 | 59 | 197 | 2 | 1.122 | 0.086 | 0.020 | 0.062 | 0.375 |
| NFPK-65 | F:ATGGGTATGGTGTTGGAAGG R:CTGGAGGAGCAAAATCGTGT | (TGC)6 | 59 | 199 | 11 | 1.451 | 0.491 | 0.325 | 0.281 | 0.588 |
| NFPK-71 | F:TTGGTGAGGATTGGTTCGAT R:CAAACTTCCGATTCGAGTGA | (AAG)6 | 60 | 200 | 5 | 1.957 | 0.725 | 0.622 | 0.484 | 0.405 |
| NFPK-117 | F:GCCCAATGGATGTGTCTCTT R:TCGGCCTGCAATTAGTCTCT | (TC)12 | 60 | 203 | 8 | 2.609 | 1.047 | 0.986 | 0.610 | 0.614 |
| NFPK-72 | F:ATCACCGCTGCCTTTCAGTA R:TCACTTCCCCAATCAATTCC | (ATA)8 | 60 | 204 | 3 | 1.045 | 0.070 | 0.014 | 0.038 | 0.387 |
| NFPK-67 | F:TGACCACTTCAGGCTTCTGAT R:ATGGCATCTGCTCTTTTTGC | (TGC)6 | 59 | 208 | 3 | 2.037 | 0.784 | 0.536 | 0.475 | 0.543 |
| NFPK-34 | F:AACCCACAGAAAGCTGAGGA R:CACCCCTGAACAGAGAGGAG | (TAA)6 | 60 | 221 | 2 | 0.894 | 0.032 | 0.018 | 0.017 | 0.032 |
| NFPK-43 | F:ATGCAGGGTTTGCAATACAG R:AATACGAGCACCGCGTTATC | (GAG)6 | 60 | 227 | 4 | 1.078 | 0.105 | 0.038 | 0.054 | 0.069 |
| NFPK-38 | F:TGATGGTGTGGTGAGGGTTA R:AGCGTGGGAGGAGTGTGTAG | (AAG)6 | 60 | 229 | 5 | 1.273 | 0.269 | 0.109 | 0.168 | 0.266 |
| NFPK-150 | F:AAATAACGGGGCTGTGTGTC R:ACGGATGTTGTAATCCCCAA | (GA)6 | 60 | 241 | 2 | 1.034 | 0.059 | 0.032 | 0.030 | 0.029 |
| NFPK-145 | F:ATGCGGAGGGATCAATTCTA R:CCAAGGCGCATCAATATTTC | (TA)6 | 60 | 241 | 14 | 2.985 | 0.989 | 0.296 | 0.513 | 0.794 |
| NFPK-175 | F:AAGGTCACGGCGTTCATTAC R:CCTGTGACCTCAACTGGGAT | (GA)6 | 60 | 259 | 5 | 1.075 | 0.133 | 0.069 | 0.065 | 0.074 |
| NFPK-181 | F:CTAAAGCGCTCAACCCAGAC R:GGACCACAGCGTGTTAGGAT | (AT)6 | 60 | 261 | 6 | 1.492 | 0.438 | 0.160 | 0.248 | 0.456 |
| NEPK-179 | F: CCAAGCCAGGTAAGGCACTA R: TGGACAAGGGAGATGAGACA | (CA)10 | 60 | 246 | 9 | 3.393 | 1.506 | 0.111 | 0.705 | 0.659 |
| NEPK-168 | F: CGGCTGTTCTGTTCCACATA R: GCCTTTGCAGTAGGATCGAG | (TA)11 | 60 | 261 | 23 | 3.884 | 2.063 | 0.193 | 0.743 | 0.723 |
| NFPK-213 | F:ATGTGTCACCACCCCTCATT R:ATGAGTGCGGCCTAAAGAGA | (TC)6 | 60 | 274 | 10 | 2.625 | 1.046 | 0.802 | 0.571 | 0.636 |
| NFPK-184 | F:AAGTCTCCACTGCATCAACCTT R:TGTCTCCCAACTTCCTGCTT | (TC)8 | 60 | 275 | 3 | 2.092 | 0.757 | 0.967 | 0.518 | 0.400 |
| Mean | 6.45 | 1.812 | 0.579 | 0.299 | 0.311 | 0.404 | ||||
| Total | 129 |
Na: number of alleles; Ne: number of effective alleles; Ho: observed heterozygosity; He: expected heterozygosity; I: information index; and PIC: polymorphic information content.
Transferability in related species of Pinus using 20 EST-SSR primers developed from the cold-stressed transcriptome of Pinus koraiensis.
| Locus |
|
|
|
|
|
|
| Total |
|---|---|---|---|---|---|---|---|---|
| NFPK-218 | + | + | + | + | - | - | - | 4 |
| NFPK-40 | + | + | + | + | + | + | + | 7 |
| NFPK-32 | + | + | + | + | + | + | + | 7 |
| NFPK-53 | + | + | + | + | + | + | + | 7 |
| NFPK-65 | + | + | + | + | + | - | - | 5 |
| NFPK-71 | + | + | + | + | + | - | - | 5 |
| NFPK-117 | + | + | + | + | + | - | + | 6 |
| NFPK-72 | + | + | + | + | + | + | + | 7 |
| NFPK-67 | + | + | + | + | + | - | - | 5 |
| NFPK-34 | - | - | - | - | - | - | - | 0 |
| NFPK-43 | + | + | + | + | + | + | + | 7 |
| NFPK-38 | + | + | + | + | + | + | + | 7 |
| NFPK-150 | + | + | + | + | + | + | + | 7 |
| NFPK-145 | + | + | + | + | - | + | - | 5 |
| NFPK-179 | + | + | + | + | + | + | + | 7 |
| NFPK-175 | + | + | + | + | - | + | + | 6 |
| NFPK-168 | + | + | - | + | + | - | - | 4 |
| NFPK-181 | - | + | - | - | - | - | - | 1 |
| NFPK-213 | + | + | + | + | + | - | - | 5 |
| NFPK-184 | + | + | + | + | + | + | + | 7 |
| Total | 18 | 19 | 17 | 18 | 15 | 11 | 11 | 109 |
The “+” and “-” indicate successful and failed PCR amplification, respectively.
Figure 2Results of STRUCTURE analysis for 80 individuals using 20 EST-SSR markers. (a) Estimation of population using Delta K value with cluster K ranging from 1 to 8. (b) Estimation of population structure based on STRUCTURE analysis.
Figure 3Principal component analysis based on 20 EST-SSR markers in P. koraiensis.