| Literature DB >> 31681436 |
Xiaoxian Ruan1, Zhen Wang1, Ting Wang2, Yingjuan Su1,3.
Abstract
Amentotaxus argotaenia (Taxaceae) is a vulnerable coniferous species with preference for shade and moist environment. Accurate estimation of genetic variation is crucial for its conservation, especially in the context of global warming. In this study, we acquired a transcriptome from A. argotaenia leaves using Illumina sequencing and de novo assembled 62,896 unigenes, of which 5510 EST-SSRs were detected. Twenty-two polymorphic EST-SSRs were successfully developed and further used to investigate genetic variation, linkage disequilibrium, and bottleneck signatures of A. argotaenia. The results showed that A. argotaenia had moderate genetic variation and high genetic differentiation, which may provide raw material to protect against climatic changes and accelerate local adaptation, respectively. No bottlenecks were found to occur in A. argotaenia. Our study not only showed that these EST markers are very effective in population genetic analysis but also lay a solid foundation for further investigating adaptive evolution and conservation strategies of A. argotaenia.Entities:
Keywords: Amentotaxus argotaenia; EST-SSR; genetic variation; population structure; transcriptome
Year: 2019 PMID: 31681436 PMCID: PMC6813739 DOI: 10.3389/fgene.2019.01014
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Sequencing, assembly, and annotation results of Amentotaxus argotaenia transcriptome.
| Total raw reads | 25,257,542 |
|---|---|
| Total clean reads | 23,553,846 |
| % Q20 | 100 |
| % Q30 | 97 |
| % GC | 47 |
| Number of transcripts | 80,674 |
| Average length of transcripts (bp) | 756 |
| N50 of transcripts (bp) | 1018 |
| Number of unigenes | 62,896 |
| Minimum length of unigenes (bp) | 301 |
| Average length of unigenes (bp) | 721 |
| N50 of unigenes (bp) | 947 |
| Number of unigenes annotated at least one databases | 36,671 |
| Number of unigenes with Nt annotations | 13,140 |
| Number of unigenes with Nr annotations | 32,183 |
| Number of unigenes with KOG annotations | 32,953 |
| Number of unigenes assigned to GO terms | 31,283 |
| Number of unigenes with Swiss-Prot annotations | 26,309 |
| Number of unigenes with Pfam annotations | 21,595 |
| Number of unigenes with KEGG pathways | 15,072 |
| Number of unigenes annotated to seven public databases | 490 |
Nt, NCBI nucleotide sequences; Nr, NCBI non-redundant database; KOG, euKaryotic Ortholog Groups; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Figure 1Functional classification of Gene Ontology for Amentotaxus argotaenia unigenes.
The distribution of repeat unit number and motif length of EST-SSRs.
| SSR motif length | Repeat unit number | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | 11 | > 11 | Total | Percentage (%) | |
| Mono | 1682 | 787 | 1085 | 3554 | 64.50 | |||||
| Di | 255 | 89 | 60 | 32 | 20 | 16 | 34 | 506 | 9.18 | |
| Tri | 747 | 266 | 112 | 61 | 8 | 15 | 5 | 12 | 1226 | 22.25 |
| Tetra | 65 | 27 | 1 | 4 | 1 | 2 | 100 | 1.82 | ||
| Penta | 42 | 3 | 3 | 3 | 2 | 2 | 55 | 1.00 | ||
| Hexa | 34 | 18 | 5 | 8 | 1 | 2 | 1 | 69 | 1.25 | |
| Total | 888 | 569 | 210 | 136 | 42 | 1719 | 812 | 1134 | 5510 | |
| Percentage (%) | 16.12 | 10.33 | 3.81 | 2.47 | 0.76 | 31.19 | 14.74 | 20.58 | ||
Mono, mono-nucleotide; Di, di-nucleotide; Tri, tri-nucleotide; Tetra, tetra-nucleotide; Penta, penta-nucleotide; Hexa, hexa-nucleotide.
Characterization of 38 EST-SSR primer pairs for A. argotaenia.
| Locus | Primer sequences (5′–3′) | Repeat motif | Allele size(bp) |
| GenBank accession no. | Putative function [Organism] |
|
|---|---|---|---|---|---|---|---|
|
| F: ACCTCGGGTCCTGTTGAAR: GGTTGTGGCGAATGCTG | (TAC)6(TAT)5 | 249 | 54 | MG209531 | Unknown [ | 3e-53 |
| SHS-1845 | F: TCTGAGATAAGGTGCTTGGGTGR: ATTTGAGGGCTACAGCGGTT | (GTC)7 | 126 | 55 | MG209532 | Unknown [ | 3e-48 |
|
| F: AGAGACCACCAACGACGAACR: CAGCGGCAGCATACCATT | (TCC)7 | 298 | 55 | MG209533 | hypothetical protein AMTR_s00032p00067030 [ | 3e-25 |
| SHS-2589 | F: CTAACCCTATCCCTAACCTCTTTCR: GTTTCATTCCAGGCACTCTCA | (GAA)5 | 159 | 57 | MG209534 | Unknown [ | 2e-121 |
| SHS-5811 | F: TAGATTTAGTTCCCAGCGGTGR: GATTGATTTCGGCTCGTGTAT | (AG)8 | 279 | 53 | MG209535 | Not found | — |
|
| F: TTCTACTTCTGCTGCTGGTGTR: GCATTGGTCTTCCTCCTTTAC | (TTC)7 | 214 | 54 | MG209536 | hypothetical protein AMTR_s00003p00222410 [ | 0 |
|
| F: CTCTTTGGGAGAAGTATTAGCR: TGGTCACTCGTGGACATTA | (AAGA)5 | 134 | 53 | MG209537 | PREDICTED: uncharacterized protein LOC103491482 isoform X2 [ | 5e-45 |
| SHS-18170 | F: GAGAGCCCACGGTCCTGTR: AGTCCCATCATCCACCTATCA | (GAA)6 | 300 | 53 | MG209538 | PREDICTED: zinc finger protein 43-like [ | 1e-10 |
|
| F: AAAGTCGGGTGATTACAGAGCR: TCCTTCGTGGAATGTTTATGA | (GA)7 | 397 | 54 | MG209539 | Not found | — |
|
| F: ACCTCCTACACCCCCTTCTR: AACTCCACCATACGCATCTTA | (GAA)6 | 236 | 54 | MG209540 | Unknown [ | 6e-97 |
|
| F: CCCAAAGAAAGGGCAAGAR: CGGCGGATGGTAATGTG | (CGC)7 | 278 | 53 | MG209541 | Unknown [ | 3e-64 |
| SHS-20137 | F: CTGTCAGGCATTTCTGGGTCTR: CGATTTTCATTTTGTTTGGTCTG | (CTT)7 | 257 | 58 | MG209542 | Not found | — |
|
| F: CATTCTCACACCCTTGTATTGCTR: CATCTTCACCATTTCTCTGTAGTCTT | (TA)8 | 259 | 58 | MG209543 | transcription factor AP2 [ | 8e-117 |
| SHS-21264 | F: CTCGTCCAAGAAGAACCATACR: CATCATAAACCACTTAGCAAATAC | (GAG)6 | 400 | 56 | MG209544 | PREDICTED: uncharacterized protein LOC104240103 [ | 7e-41 |
| SHS-21490 | F: GAGGAAGAGGGTTTTGGTCATR: AGTAGGCGTCTTTGGCGTT | (TAA)6 | 190 | 58 | MG209545 | hypothetical protein PRUPE_ppa010075mg [ | 1e-59 |
| SHS-22515 | F: CACATCCTCCGCCGACTR: TTGCTGTTTTACCGAGAAGAAG | (TAC)6(TAT)5 | 266 | 57 | MG209546 | Unknown [ | 2e-53 |
| SHS-23191 | F: ACCCAGTTGTGGTAGGAGCATR: AAAGTGTGAAACATCCCAAAGC | (GAG)6 | 161 | 57 | MG209547 | Not found | — |
| SHS-23195 | F: TGACAACGAGAACGAAGAACATAACR: GTCTGTAAGCCAACGCTGAGG | (AGA)6 | 115 | 57 | MG209548 | hypothetical protein SELMODRAFT_451322 [ | 2e-41 |
|
| F: CCTAATGGTGAATAACTTGTGCTCR: GCGAGTTTCTTGCTAAATGCTT | (TCTT)5 | 330 | 58 | MG209549 | Not found | — |
| SHS-24301 | F: TACCTGACTGGACTGCTGAATR: ATGTTAGAGGAATACGATAGGCT | (CCG)6 | 377 | 57 | MG209550 | Unknown [ | 1e-32 |
|
| F: AGATACTCTTGTTTCAGGAGCATTR: CAACCCAGGACATCACCATAG | (AAG)7 | 228 | 57 | MG209551 | Not found | — |
|
| F: GGGCGGAGGAGAATGGTCR: TGGGCTGCTGAAATAGGAAAC | (GGA)6 | 250 | 57 | MG209552 | PREDICTED: uncharacterized protein LOC104602979, partial [ | 1e-65 |
|
| F: CAATCGGATAAGGTGTTTCTR: CGAATAGTGGTAATCAAATAGG | (GA)16 | 379 | 52 | MG209553 | Unknown [ | 0 |
| SHS-28326 | F: GTATGGAAGGGAGGCGAAATR: GCCGCTGTGGTTGTGAAG | (ATA)6 | 143 | 57 | MG209554 | Unknown [ | 4e-29 |
|
| F: AATAAGAATAGGAGGGGTGAAGACR: GAGACAGAGGATTTGTAACGGAG | (CGG)6 | 218 | 57 | MG209555 | PREDICTED: mediator of RNA polymerase II transcription subunit 30-like isoform X1 [ | 1e-16 |
|
| F: CGTATCCCTGTTGATTCATTTTCR: GGTTGTATCATTCAGTCCCATTG | (GAAA)6 | 240 | 57 | MG209556 | Not found | — |
| SHS-30422 | F: TTCCTTCTACTCCCTCTTCTATGTCR: AACTGCTTACCTAAATGGTGCTG | (CTT)6 | 163 | 57 | MG209557 | PREDICTED: probable cellulose synthase A catalytic subunit 5 [ | 0 |
|
| F: ATGGATGGCAGGATTGGATR: AACAAATAAGGAAGAAGGTGGTAGT | (TTG)6 | 182 | 57 | MG209558 | Not found | — |
| SHS-31820 | F: TTTGGTTCCATACCTGCTCCTR: TTCGTGGTCACTCTTTTCCCT | (GAT)6 | 149 | 57 | MG209559 | Unknown [ | 5e-172 |
|
| F: CCAGACTTGCCACATCAGCR: AACCCACAACCCACCAGAG | (ATG)5(AGG)7 | 395 | 57 | MG209560 | Unknown [ | 4e-11 |
|
| F: AAATGAGGAATAAGTAGGTGAAGTTR: GCACATTAGGGTTCCTGATTAC | (AGA)6 | 254 | 57 | MG209561 | Not found | — |
|
| F: CAACCCGTCCCTTGCTTTAGR: CCTCTGCGTCCTTGTTGTTATC | (GGCAG)5 | 302 | 57 | MG209562 | Unknown [ | 6e-176 |
|
| F: TGGAAAAAACCACAGACGACTCR: GCCCTCAAACACAAAAGCAG | (ATA)7 | 143 | 56 | MG209563 | Unknown [ | 1e-94 |
|
| F: CTGGACAAAGAGAGCAACGGTR: AATGGCGACACAAGTGAGAAGT | (CTC)7 | 222 | 56 | MG209564 | Hypothetical protein AMTR_s00032p00067030 [ | 2e-19 |
| SHS-35222 | F: TGCTGCCTAAACACAATGTCTCTR: CACAAGTCTTCCTTTTCCCTAATG | (TG)9 | 121 | 56 | MG209565 | Not found | — |
|
| F: GTTGAGCATTGATTTAGATGTTCGR: TTTCCCCTCCTCTTTCTTTGAC | (TGA)8 | 189 | 55 | MG209566 | PREDICTED: uncharacterized protein LOC104596414 isoform X2 [ | 2e-68 |
|
| F: TTACCAACGCCAAATGCTGR: ACCCTACTCCCACTCCCTTCT | (GAGATG)7 | 154 | 55 | MG209567 | hypothetical protein AMTR_s00099p00142540 [ | 1e-24 |
| SHS-39519 | F: TTGTGCCTCTTCAAGGAGTAGTR: GAGAATCTTCCCTGTCGGTC | (CTC)6 | 261 | 55 | MG209568 | ACC synthase-like [ | 0 |
Ta, annealing temperature; bold font, polymorphic loci.
Genetic diversity statistics for four A. argotaenia populations based on 22 polymorphic EST-SSR primers.
| JQS ( | CP ( | QNS ( | WGS ( | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Locus |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| PIC |
| SHS-1306 | 2 | 1.923 | 0.000*‡ | 0.480 | 2 | 1.166 | 0.000* | 0.142 | 2 | 1.180 | 0.000* | 0.153 | 2 | 1.600 | 0.000*‡ | 0.375 | 0.395 |
| SHS-2019 | 2 | 1.301 | 0.267 | 0.231 | 3 | 1.807 | 0.462 | 0.447 | 2 | 1.180 | 0.000* | 0.153 | 2 | 1.133 | 0.000* | 0.117 | 0.429 |
| SHS-6181 | 1 | 1.000 | 0.000 | 0.000 | 1 | 1.000 | 0.000 | 0.000 | 2 | 1.087 | 0.083 | 0.080 | 3 | 2.327 | 0.000*‡ | 0.570 | 0.225 |
| SHS-17706 | 3 | 1.822 | 0.600 | 0.451 | 4 | 1.931 | 0.231‡ | 0.482 | 3 | 2.880 | 0.500* | 0.653 | 3 | 2.667 | 0.500* | 0.625 | 0.605 |
| SHS-18213 | 4 | 1.531 | 0.000*‡ | 0.347 | 2 | 1.742 | 0.000*‡ | 0.426 | 4 | 1.714 | 0.000*‡ | 0.417 | 3 | 1.910 | 0.000*‡ | 0.477 | 0.402 |
| SHS-18563 | 1 | 1.000 | 0.000 | 0.000 | 1 | 1.000 | 0.000 | 0.000 | 2 | 1.800 | 0.000*‡ | 0.444 | 3 | 2.844 | 0.000*‡ | 0.648 | 0.371 |
| SHS-19735 | 4 | 2.217 | 0.600 | 0.549 | 2 | 1.352 | 0.000*‡ | 0.260 | 2 | 1.385 | 0.000*‡ | 0.278 | 3 | 1.684 | 0.250* | 0.406 | 0.393 |
| SHS-20198 | 2 | 1.923 | 0.000*‡ | 0.480 | 3 | 1.610 | 0.000*‡ | 0.379 | 4 | 3.236 | 0.250*‡ | 0.691 | 3 | 2.032 | 0.000*‡ | 0.508 | 0.560 |
| SHS-24187 | 4 | 2.284 | 0.467 | 0.562 | 3 | 2.048 | 0.692 | 0.512 | 3 | 2.667 | 0.667 | 0.625 | 4 | 2.090 | 0.375* | 0.521 | 0.528 |
| SHS-26622 | 6 | 4.091 | 0.800 | 0.756 | 7 | 4.694 | 1.000 | 0.787 | 4 | 2.743 | 1.000 | 0.635 | 4 | 2.498 | 1.000 | 0.600 | 0.707 |
| SHS-26840 | 4 | 2.761 | 0.867 | 0.638 | 4 | 2.467 | 0.462 | 0.595 | 3 | 2.268 | 0.583 | 0.559 | 2 | 2.000 | 1.000* | 0.500 | 0.514 |
| SHS-28207 | 4 | 2.663 | 1.000 | 0.624 | 5 | 3.282 | 1.000 | 0.695 | 5 | 3.840 | 0.917* | 0.740 | 3 | 1.684 | 0.500 | 0.406 | 0.701 |
| SHS-28474 | 4 | 1.772 | 0.000*‡ | 0.436 | 3 | 1.610 | 0.000*‡ | 0.379 | 3 | 2.323 | 0.000*‡ | 0.569 | 4 | 1.707 | 0.000*‡ | 0.414 | 0.456 |
| SHS-28717 | 2 | 1.471 | 0.000*‡ | 0.320 | 2 | 1.742 | 0.000*‡ | 0.426 | 3 | 2.323 | 0.000*‡ | 0.569 | 2 | 1.280 | 0.000*‡ | 0.219 | 0.581 |
| SHS-31463 | 1 | 1.000 | 0.000 | 0.000 | 2 | 1.257 | 0.231 | 0.204 | 1 | 1.000 | 0.000 | 0.000 | 2 | 1.133 | 0.125 | 0.117 | 0.084 |
| SHS-31908 | 1 | 1.000 | 0.000 | 0.000 | 3 | 1.857 | 0.000*‡ | 0.462 | 2 | 1.800 | 0.000*‡ | 0.444 | 3 | 1.293 | 0.000*‡ | 0.227 | 0.297 |
| SHS-32587 | 2 | 1.471 | 0.000*‡ | 0.320 | 3 | 1.610 | 0.000*‡ | 0.379 | 4 | 2.595 | 0.083*‡ | 0.615 | 3 | 2.612 | 0.000*‡ | 0.617 | 0.534 |
| SHS-32686 | 2 | 1.800 | 0.267 | 0.444 | 2 | 1.352 | 0.308 | 0.260 | 2 | 1.180 | 0.000* | 0.153 | 2 | 1.969 | 0.125‡ | 0.492 | 0.399 |
| SHS-32939 | 3 | 1.495 | 0.400 | 0.331 | 2 | 1.451 | 0.385 | 0.311 | 2 | 1.180 | 0.167 | 0.153 | 2 | 1.992 | 0.563 | 0.498 | 0.393 |
| SHS-34629 | 2 | 1.301 | 0.267 | 0.231 | 2 | 1.649 | 0.538 | 0.393 | 2 | 1.385 | 0.000*‡ | 0.278 | 2 | 1.600 | 0.000*‡ | 0.375 | 0.473 |
| SHS-35453 | 2 | 1.800 | 0.667 | 0.444 | 2 | 1.988 | 0.000*‡ | 0.497 | 2 | 1.882 | 0.583 | 0.469 | 3 | 2.667 | 0.500* | 0.625 | 0.579 |
| SHS-38297 | 2 | 1.301 | 0.267 | 0.231 | 2 | 1.451 | 0.385 | 0.311 | 2 | 1.087 | 0.083 | 0.080 | 1 | 1.000 | 0.000 | 0.000 | 0.384 |
N, number of individuals of each populations; Na, number of alleles per locus; Ne, number of effective alleles per locus; Ho, observed heterozygosity; He, expected heterozygosity; PIC, polymorphism information content; *significant deviation from the Hardy–Weinberg equilibrium (p < 0.001); ‡significant possibility of the presence of null alleles detected by MICRO-CHECKER.
Figure 2Population genetic structure analysis of A. argotaenia based on EST-SSRs. JQS, Population Jiuqushui; CP, Population Chuanping; WGS, Population Wugongshan; QNS, Population Qiniangshan.