| Literature DB >> 33193679 |
Ruixiang Xu1, Zhen Wang1, Yingjuan Su1,2, Ting Wang3.
Abstract
Pseudotaxus chienii (Taxaceae) is an endangered conifer species endemic to China. However, a lack of suitable molecular markers hinders the genomic and genetic studies on this species. Here, we characterized and developed the microsatellite markers from a newly sequenced P. chienii transcriptome. A total of 21,835 microsatellite loci were detected from 161,131 non-redundant unigene sequences, and the frequency of SSRs was 13.55%, with an average of one SSR loci per 9.18 kb. Mono-nucleotide, di-nucleotide, and tri-nucleotide were the dominant repeat types, accounting for 50.06, 13.49, and 29.39% of the total SSRs, respectively. In terms of distribution location, the coding regions (CDS) with few microsatellites and mainly consisted of tri-nucleotides. There were significant differences in the length of microsatellite among genic regions and motif types. Functional annotation showed that the unigenes containing microsatellites had a wide range of biological functions, most of which were related to basic metabolism, and a few might be involved in expression regulation of gene and response to environmental stress. In addition, 375 primer pairs were randomly selected and synthesized for the amplification and validation of microsatellite markers. Seventy-seven primer pairs were successfully amplified and 40 primer pairs were found to be polymorphic. Finally, 20 pairs of primers with high polymorphism were selected to assess the genetic diversity in four P. chienii populations. In addition, the newly developed microsatellite markers exhibited high transferability (70%) in Amentotaxus argotaenia. Our study could enable further genetic diversity analysis and functional gene mining on Taxaceae.Entities:
Keywords: Pseudotaxus chienii; genetic diversity; microsatellite markers; transcriptome; transferability
Year: 2020 PMID: 33193679 PMCID: PMC7593448 DOI: 10.3389/fgene.2020.574304
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Sampling location information of four P. chienii populations.
| Population | Location | Sample Size | Latitude (N) | Longitude (E) | Altitude (m a.s.l.) |
| MS | Maoshan, Zhejiang | 31 | 118°58′23′′ | 28°06′08′′ | 1120 |
| YS | Yinshan park, Guangxi | 30 | 110°14′53′′ | 24°09′15′′ | 1050 |
| BJS | Bijiashan, Jiangxi | 30 | 114°09′41′′ | 26°30′35′′ | 1340 |
| ZJJ | Zhangjiajie, Hunan | 19 | 110°28′56′′ | 29°23′12′′ | 1055 |
Summary of de novo transcriptome assembly of four tissues from P. chienii.
| Categories | Items | Root | Stem | Leaf | Strobilus |
| Raw reads | Total raw reads | 52,633,598 | 57,112,536 | 62,840,720 | 42,617,842 |
| Clean reads | Total clean reads | 51,073,864 | 56,419,624 | 61,110,258 | 41,448,574 |
| Total clean bases (Gb) | 7.66 | 8.46 | 9.17 | 6.22 | |
| Q20 (%) | 97.16 | 98.03 | 96.9 | 97.28 | |
| Q30 (%) | 92.16 | 94.03 | 91.55 | 92.34 | |
| GC content (%) | 44.25 | 46.26 | 45.57 | 45.23 | |
| Transcripts | Total number of transcripts | 156,747 | 202,908 | 107,675 | 201,124 |
| Total length of transcripts (bp) | 113,815,935 | 136,683,283 | 89,744,115 | 114,523,821 | |
| Maximum length of transcripts (bp) | 13,579 | 12,558 | 11,562 | 12,275 | |
| Minimum length of transcripts (bp) | 201 | 201 | 201 | 201 | |
| Mean length of transcripts (bp) | 726 | 674 | 833 | 569 | |
| N50 (bp) | 1,395 | 1,224 | 1,621 | 1,011 | |
| Unigenes | Total number of unigenes | 66,126 | 79,842 | 52,207 | 60,391 |
| Total length of unigenes (bp) | 87,790,725 | 101,369,156 | 74,169,328 | 76,205,289 | |
| Maximum length of unigenes (bp) | 13,579 | 12,558 | 11,562 | 12,275 | |
| Minimum length of unigenes (bp) | 201 | 201 | 201 | 201 | |
| Mean length of unigenes (bp) | 1,328 | 1,270 | 1,421 | 1,262 | |
| N50 (bp) | 1,843 | 1,728 | 1,931 | 1,754 |
Analysis results of microsatellite based on the P. chienii transcriptome.
| Items | Number |
| Total number of sequences examined | 161,131 |
| Total size of examined sequences (bp) | 200,495,148 |
| Total number of identified microsatellite loci | 21,835 |
| Number of microsatellites containing sequences | 17,974 |
| Number of sequences containing more than 1 microsatellite loci | 2,969 |
| Number of microsatellites present in compound formation | 1,513 |
| Frequency of microsatellite loci | 13.55 |
| Distribution density of microsatellite loci (kb) | 9.18 |
| Mono-nucleotide | 10,930 |
| Di-nucleotide | 2,946 |
| Tri-nucleotide | 6,418 |
| Tetra-nucleotide | 354 |
| Penta-nucleotide | 343 |
| Hexa-nucleotide | 844 |
FIGURE 1Distribution of microsatellite motif types and tandem repeat numbers in P. chienii transcriptome.
FIGURE 2Distribution of microsatellite repeat motifs in P. chienii transcriptome.
FIGURE 3Distribution of six microsatellite repeat types in different genic regions of P. chienii.
Summary of functional annotation results of unigenes containing microsatellite in P. chienii transcriptome.
| Annotated databases | Number of unigenes | Percentage (%) |
| Nr | 12,652 | 70.39 |
| Nt | 6,660 | 37.05 |
| Swiss-Prot | 10,622 | 59.10 |
| KOG | 4,274 | 23.78 |
| Pfam | 11,267 | 62.68 |
| GO | 11,334 | 63.06 |
| KEGG | 5,417 | 30.14 |
| Annotated in all databases | 2,045 | 11.38 |
| Annotated in at least one database | 14,264 | 79.36 |
| Total | 17,974 | 100 |
FIGURE 4GO functional classification of unigenes containing microsatellite in P. chienii transcriptome.
FIGURE 5KEGG pathway classification of unigenes containing microsatellite in P. chienii transcriptome.
FIGURE 6KOG functional classification of unigenes containing microsatellite in P. chienii transcriptome.
FIGURE 7Significant enrichment KEGG pathways of unigenes containing microsatellite.
Characteristics of 20 newly developed polymorphic microsatellite markers in P. chienii.
| Locus | Primer sequence (5′-3′) | Repeat motif | Size range (bp) | Annealing temperature (°C) | Na | Ho | He | PIC* | Null | GenBank accession no. |
| F: GGTCGAGTACGTGGTGGTTT R: GCCTGCGCTGTCATAAACTG | (AGG)5 | 158–161 | 57 | 3.000 | 0.145 | 0.257 | 0.237** | 0.2703 (Y) | MT563347 | |
| F: TGTGCCAGTACTGCTACTGC R: TGAATGCGTGCGGAAACAAG | (ACC)6 | 183–195 | 57 | 5.000 | 0.209 | 0.585 | 0.517 | −0.0267 (N) | MT563350 | |
| F: GATGCCGCTGGTTTCAATCC R: GCCGTACCGATTGGGATCAT | (GGA)8 | 198–214 | 57 | 5.000 | 0.309 | 0.397 | 0.366* | 0.315 (Y) | MT563351 | |
| F: GAGTGGGAGACGAAGAGTGC R: CGAAGTGGGCTGCAACAATG | (CTC)8 | 261–266 | 57 | 6.000 | 0.600 | 0.670 | 0.626 | 0.0196 (N) | MT563352 | |
| F: AGCTGCAAGGCTACACAGAG R: CAATCCCGGGCCTGTTAGAA | (GAA)5 | 235–239 | 57 | 5.000 | 0.318 | 0.563 | 0.500** | −0.1413 (N) | MT563353 | |
| F: GGGCCATCCTCTTCCTCAAC R: CTCGACACTGCTCCACATCT | (TCC)8 | 240–247 | 57 | 3.000 | 0.055 | 0.129 | 0.125** | 0.0299 (N) | MT563358 | |
| F: CGCTCCAACGAATCCAACC R: TAATGCCATCCGCACAACC | (CAGAAG)5 | 241–277 | 57 | 14.000 | 0.743 | 0.844 | 0.826 | −0.1586 (N) | MT563368 | |
| F: GAATTTGAAGCACGGCCTCA R: GAGTGCCCTGCTTTCTGGAT | (GGCACC)5 | 242–277 | 57 | 7.000 | 0.156 | 0.595 | 0.549** | 0.3533 (Y) | MT563369 | |
| F: ACGCCACGTTAGGACACAAT R: CCTAGATCAAGAGCGGCCTG | (CTT)6 | 238–319 | 57 | 6.000 | 0.227 | 0.389 | 0.364 | 0.0983 (N) | MT563374 | |
| F: CTGTCAACAAGCGGCTTTCC R: AGAGCCGGGGGAAAATTGAG | (CGG)7 | 232–319 | 57 | 8.000 | 0.473 | 0.694 | 0.640** | 0.0296 (N) | MT563377 | |
| F: CCCATCTGAACCCACGCTAA R: AAAGCGCTCATGCCCAAAAC | (GGC)7 | 239–300 | 57 | 16.000 | 1.000 | 0.887 | 0.877** | −0.2134 (N) | MT563378 | |
| F: ACCTATCACCTCCTCGACCC R: CCGTTCCATCACTGTGGACA | (CCACCG)6 | 203–229 | 55 | 13.000 | 0.573 | 0.807 | 0.784** | 0.3871 (Y) | MT563379 | |
| F: GAGGGATACAGAAGCACAG R: TATGACAAACCCAAACGAG | (ATA)5 | 276–288 | 56 | 2.000 | 0.591 | 0.416 | 0.330** | 0 (N) | MT563329 | |
| F: GACAACGGCAAAGGAGGAAT R: GCGATAGCCACCAAAGACAT | (ATA)6 | 319–322 | 58 | 3.000 | 0.000 | 0.329 | 0.290** | 0 (N) | MT563334 | |
| F: TGCGGTTCAGTAACAGTCCTTC R: TCCCCCACCTCTTCCCAG | (CTCCTG)5 | 439–452 | 58 | 6.000 | 0.657 | 0.513 | 0.408 | −0.314 (N) | MT563335 | |
| F: TGTGTGAAAGGACAAGGCGT R: GCACCCTATTCACCCGAGAT | (ATGCAG)7 | 225–259 | 56 | 5.000 | 0.318 | 0.474 | 0.431 | −0.0267 (N) | MT563337 | |
| F: CCCCTCATTGACAGGTTC R: AAGATAGTCGGGACACCAAG | (CTG)5 | 309–320 | 56 | 6.000 | 0.036 | 0.336 | 0.306** | 0.3232 (Y) | MT563381 | |
| F: CACGCCCACCATAGTTGT R: GGAGGAAGATGTCGTTGAAG | (AAGG)5 | 263–270 | 58 | 3.000 | 0.045 | 0.523 | 0.449** | 0.2884 (Y) | MT563386 | |
| F: GACCTCTTACCAGCTGCGAG R: ACCACCGGTTTCAGTTTCGT | (CCT)11 | 208–227 | 62 | 9.000 | 0.200 | 0.765 | 0.726** | 0 (N) | MT563390 | |
| F: TAAGTGGCTGCTGCATCACA R: TACAGCAGCAGCAGAGCTTT | (TCC)5 | 249-251 | 58 | 3.000 | 0.318 | 0.360 | 0.302 | 0.1828 (N) | MT563392 |
Genetic diversity parameters of four populations of P. chienii.
| Population | N | Na | Ne | I | Ho | He | PPB (%) | |
| MS | 31 | 3.450 | 1.997 | 0.702 | 0.384 | 0.389 | 90 | 0.031 |
| YS | 30 | 3.500 | 1.918 | 0.670 | 0.360 | 0.357 | 85 | 0.008 |
| BJS | 30 | 3.650 | 2.033 | 0.716 | 0.350 | 0.381 | 100 | 0.099** |
| ZJJ | 19 | 2.850 | 1.759 | 0.602 | 0.272 | 0.340 | 85 | 0.227** |
| Species level | 110 | 6.400 | 2.814 | 1.071 | 0.349 | 0.527 | 100 | 0.342 |