| Literature DB >> 32092852 |
Yingling Wan1, Min Zhang1, Aiying Hong2, Yixuan Zhang1, Yan Liu1,3,4,5.
Abstract
The insufficient number of available simple sequence repeats (SSRs) inhibits genetic research on and molecular breeding of Paeonia lactiflora, a flowering crop with great economic value. The objective of this study was to develop SSRs for P. lactiflora with Illumina RNA sequencing and assess the role of SSRs in gene regulation. The results showed that dinucleotides with AG/CT repeats were the most abundant type of repeat motif in P. lactiflora and were preferentially distributed in untranslated regions. Significant differences in SSR size were observed among motif types and locations. A large number of unigenes containing SSRs participated in catalytic activity, metabolic processes and cellular processes, and 28.16% of all transcription factors and 21.74% of hub genes for inflorescence stem straightness were found to contain SSRs. Successful amplification was achieved with 89.05% of 960 pairs of SSR primers, 55.83% of which were polymorphic, and most of the 46 tested primers had a high level of transferability to the genus Paeonia. Principal component and cluster dendrogram analyses produced results consistent with known genealogical relationships. This study provides a set of SSRs with abundant information for future accession identification, marker-trait association and molecular assisted breeding in P. lactiflora.Entities:
Keywords: herbaceous peony; molecular marker; next-generation sequencing; pedigree
Mesh:
Substances:
Year: 2020 PMID: 32092852 PMCID: PMC7073652 DOI: 10.3390/genes11020214
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Size range of amplification products, sample size (N), the frequency of null allele at locus (Null allele), number of different alleles (Na), number of effective alleles (Ne), Shannon’s information index (I), observed heterozygosity (Ho), expected heterozygosity (He), fixation index (F) and polymorphism information content (PIC)./means we did not calculated the frequency of null allele at the locus because there was no significant difference in the observed and expected value in this locus and it followed Hardy-Weinberg equilibrium (Chi-Square tests, p < 0.05).
| Locus | Repeat Motif | Size Range (bp) | N |
|
|
|
|
|
| F | PIC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| T125 | (TC)9 | 128–170 | 31 | 15 | 7.84 | 2.29 | 0.77 | 0.87 | 0.073 | 0.11 | 0.86 |
| T163 | (TA)8 | 146–160 | 31 | 8 | 3.96 | 1.63 | 0.65 | 0.75 | 0.085 | 0.14 | 0.71 |
| T237 | (CT)10 | 95–109 | 31 | 8 | 4.59 | 1.76 | 0.68 | 0.78 | 0.095 | 0.13 | 0.75 |
| T241 | (GA)15 | 127–157 | 31 | 12 | 6.77 | 2.11 | 0.81 | 0.85 | / | 0.05 | 0.84 |
| C160 | (AT)9ctcctt(CTC)5 | 211–227 | 29 | 8 | 4.58 | 1.79 | 0.66 | 0.78 | 0.117 | 0.16 | 0.76 |
| TA564 | (TA)8 | 172–201 | 26 | 9 | 5.43 | 1.90 | 0.77 | 0.82 | 0.058 | 0.06 | 0.79 |
| T317 | (CT)10 | 157–183 | 30 | 9 | 5.94 | 1.92 | 0.83 | 0.83 | / | 0.00 | 0.81 |
| S024 | (AAT)16 | 161–201 | 31 | 10 | 5.60 | 1.94 | 0.77 | 0.82 | 0.042 | 0.06 | 0.80 |
| T040 | (GA)7 | 228–246 | 31 | 7 | 5.02 | 1.73 | 0.68 | 0.80 | 0.106 | 0.15 | 0.77 |
| T179 | (AT)7 | 235–269 | 31 | 10 | 4.93 | 1.83 | 0.71 | 0.80 | 0.070 | 0.11 | 0.77 |
| T192 | (CT)8 | 212–230 | 31 | 8 | 4.50 | 1.75 | 0.68 | 0.78 | 0.093 | 0.13 | 0.75 |
| T300 | (AT)10 | 218–238 | 31 | 9 | 4.45 | 1.80 | 0.48 | 0.78 | 0.171 | 0.38 | 0.75 |
| TA673 | (CA)18 | 190–288 | 31 | 15 | 6.74 | 2.20 | 0.81 | 0.85 | 0.058 | 0.05 | 0.84 |
| T210 | (GA)10 | 253–285 | 31 | 12 | 4.44 | 1.86 | 0.81 | 0.77 | 0.044 | −0.04 | 0.75 |
| TA038 | (CT)9 | 259–299 | 31 | 13 | 6.74 | 2.18 | 0.13 | 0.85 | 0.391 | 0.85 | 0.84 |
| S033 | (TAT)7 | 137–181 | 31 | 10 | 2.47 | 1.39 | 0.42 | 0.59 | 0.123 | 0.29 | 0.57 |
| T304 | (GA)10 | 147–163 | 30 | 9 | 4.75 | 1.77 | 0.63 | 0.79 | 0.144 | 0.20 | 0.76 |
| T863 | (AG)10 | 140–160 | 31 | 10 | 6.26 | 2.03 | 0.74 | 0.84 | 0.070 | 0.12 | 0.82 |
| TA028 | (AC)6 | 101–123 | 31 | 8 | 3.88 | 1.69 | 0.48 | 0.74 | 0.177 | 0.35 | 0.72 |
| T239 | (CT)9 | 185–221 | 31 | 11 | 2.98 | 1.59 | 0.58 | 0.66 | 0.069 | 0.13 | 0.64 |
| TA144 | (CT)6 | 186–286 | 31 | 9 | 4.00 | 1.64 | 0.68 | 0.75 | / | 0.10 | 0.71 |
| T852 | (AT)8 | 158–367 | 31 | 13 | 5.95 | 2.06 | 0.35 | 0.83 | 0.268 | 0.57 | 0.81 |
| F106 | (TATG)5 | 200–290 | 31 | 13 | 5.02 | 2.03 | 0.55 | 0.80 | 0.145 | 0.32 | 0.78 |
| TA082 | (AG)6 | 242–280 | 30 | 13 | 6.00 | 2.03 | 0.73 | 0.83 | 0.084 | 0.12 | 0.81 |
| TA079 | (AT)8 | 232–264 | 29 | 8 | 2.93 | 1.45 | 0.52 | 0.66 | 0.161 | 0.21 | 0.62 |
| T356 | (AG)9 | 247–269 | 30 | 9 | 4.76 | 1.81 | 0.33 | 0.79 | 0.296 | 0.58 | 0.76 |
| TA133 | (AT)8 | 258–286 | 28 | 9 | 4.28 | 1.72 | 0.75 | 0.77 | 0.095 | 0.02 | 0.73 |
| SA010 | (AAT)5 | 264–294 | 31 | 9 | 4.98 | 1.85 | 0.71 | 0.80 | 0.079 | 0.11 | 0.77 |
| W75 | (CTCAC)5 | 257–293 | 31 | 8 | 4.65 | 1.70 | 0.65 | 0.79 | 0.092 | 0.18 | 0.75 |
| T205 | (TC)9 | 275–301 | 31 | 13 | 6.94 | 2.18 | 0.71 | 0.86 | / | 0.17 | 0.84 |
| S853 | (TTG)5 | 158–186 | 29 | 8 | 4.67 | 1.71 | 0.69 | 0.79 | / | 0.12 | 0.75 |
| T859 | (AG)19 | 166–200 | 29 | 11 | 7.61 | 2.15 | 0.90 | 0.87 | 0.000 | -0.03 | 0.85 |
| TA566 | (GA)6 | 167–185 | 25 | 7 | 4.83 | 1.71 | 0.80 | 0.79 | / | −0.01 | 0.76 |
| TA695 | (AC)15 | 164–180 | 30 | 8 | 3.00 | 1.49 | 0.57 | 0.67 | 0.119 | 0.15 | 0.64 |
| S237 | (CTG)8 | 193–219 | 31 | 13 | 9.96 | 2.41 | 0.74 | 0.90 | 0.081 | 0.18 | 0.89 |
| TA464 | (TC)8 | 200–214 | 30 | 8 | 5.56 | 1.85 | 0.73 | 0.82 | 0.062 | 0.11 | 0.80 |
| T160 | (CT)10 | 205–227 | 30 | 10 | 5.47 | 1.92 | 0.70 | 0.82 | / | 0.14 | 0.80 |
| TA134 | (GA)16 | 142–224 | 29 | 13 | 5.43 | 2.09 | 0.93 | 0.82 | 0.019 | −0.14 | 0.80 |
| SA061 | (GAA)7 | 257–277 | 29 | 6 | 3.43 | 1.46 | 0.38 | 0.71 | 0.281 | 0.46 | 0.67 |
| TA074 | (CT)8 | 260–306 | 30 | 13 | 5.79 | 2.03 | 0.97 | 0.83 | 0.000 | −0.17 | 0.81 |
| TA704 | (TC)17 | 237–277 | 30 | 7 | 5.19 | 1.73 | 0.70 | 0.81 | 0.111 | 0.13 | 0.78 |
| TA610 | (TC)8 | 257–291 | 28 | 11 | 6.25 | 2.05 | 0.68 | 0.84 | 0.117 | 0.19 | 0.82 |
| S025 | (ATT)7 | 268–350 | 29 | 13 | 5.14 | 2.02 | 0.55 | 0.81 | 0.200 | 0.32 | 0.79 |
| TA022 | (TC)6 | 276–318 | 30 | 16 | 6.87 | 2.23 | 0.90 | 0.85 | 0.000 | −0.05 | 0.84 |
| TA086 | (AG)10 | 393–419 | 31 | 10 | 5.62 | 1.94 | 0.74 | 0.82 | 0.077 | 0.10 | 0.80 |
| TA700 | (AG)9 | 278–312 | 29 | 13 | 5.76 | 2.05 | 0.62 | 0.83 | 0.155 | 0.25 | 0.81 |
| Mean | 30.07 | 10.26 | 5.26 | 1.88 | 0.67 | 0.80 | 0.11 | 0.16 | 0.77 | ||
| SE | 0.20 | 0.36 | 0.21 | 0.03 | 0.02 | 0.01 | 0.01 | 0.03 | 0.01 |
Figure 1Distribution of different repeat motifs and positions of simple sequence repeats (SSRs) in unigenes. (A) Proportion and distribution of each type of motif in dinucleotide (Di-), trinucleotide (Tri-), tetranucleotide (Tetra-) and other (i.e., Penta-, Hexa- and compound) repeats. In the legend, ‘other Tri-’ consists of ACG/CGT (0.26%), ACT/AGT (0.26%) and CCG/CGG (0.42%), and ‘Other Tetra-’ consists of 26 types of Tetra- repeats, the most abundant of which are AATC/ATTG (0.22%) and AGGG/CCCT (0.22%). (B) Abundances of six motifs in different unigene positions. Two types of SSRs were located in multiple regions. One of these SSRs was located across two 5′ UTRs, a coding sequences (CDS) and a 3′ UTR; another was located in multiple CDSs in one unigene. The SSR locations differed from each other. Unknown refers to the SSRs without matching locations.
Figure 2Gene Ontology (GO) analysis of unigenes containing SSRs. The lighter color of each bar represents the number of unigenes without matching coding sequences.
Figure 3Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway annotation of unigenes with embedded SSRs. The numbers outside the circle represent the cumulative number of unigenes beginning at zero and moving in a clockwise manner.
Figure 4Principal coordinates analysis (PCoA) of amplified loci (A) and dendrogram generated by Bruvo’s distances (B) of 31 accessions, including seven species and 24 cultivars of Paeonia lactiflora. The cultivar names are abbreviated with capitalized letters in (A), and their full names are shown in (B). The UPGMA tree was produced with 1000 bootstrap replicates, and the node values greater than 50 are shown in the tree.