| Literature DB >> 29871589 |
Matthew S Haynsen1,2,3, Mohammad Vatanparast3, Gouri Mahadwar3,4, Dennis Zhu3,5, Roy Z Moger-Reischer3,6, Jeff J Doyle7, Keith A Crandall2,8, Ashley N Egan9,10.
Abstract
BACKGROUND: Kudzu, Pueraria montana var. lobata, is a woody vine native to Southeast Asia that has been introduced globally for cattle forage and erosion control. The vine is highly invasive in its introduced areas, including the southeastern US. Modern molecular marker resources are limited for the species, despite its importance. Transcriptomes for P. montana var. lobata and a second phaseoloid legume taxon previously ascribed to genus Pueraria, Neustanthus phaseoloides, were generated and mined for microsatellites and single nucleotide polymorphisms.Entities:
Keywords: Invasive; Kudzu; Molecular markers; Neustanthus phaseoloides; Pueraria montana var. lobata; Transcriptome
Mesh:
Year: 2018 PMID: 29871589 PMCID: PMC5989403 DOI: 10.1186/s12864-018-4798-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics following ConDeTri cleaning and Trinity assembly
| Accessions | CPP27 | Pmnk6 | CPP02 |
|---|---|---|---|
| Number of raw reads | 279,109 | 396,022 | 423,426 |
| Number of raw bases (bp) | 112,337,841 | 247,596,818 | 158,214,933 |
| Number of clean reads | 257,015 | 381,166 | 348,529 |
| Cleaned reads / Raw reads (%) | 92.1% | 71.0% | 82.3% |
| Number of clean bases (bp) | 75,672,645 | 124,810,371 | 87,666,889 |
| Mean clean read length (bp) | 294 | 444 | 252 |
| Number of aligned reads | 99,248 | 116,524 | 119,452 |
| Aligned read / Cleaned reads (%) | 38.6% | 41.4% | 34.3% |
| Number of contigs | 18,325 | 15,736 | 18,412 |
| Number of bases in contigs (bp) | 11,703,977 | 15,640,762 | 11,892,992 |
| Mean contig length (bp) | 639 | 994 | 646 |
| N50 (bp) | 755 | 1256 | 759 |
| Longest contig (bp) | 4335 | 4815 | 6221 |
| Number of singletons | 60,869 | 45,306 | 73,994 |
| Singletons / Cleaned reads (%) | 23.7% | 16.1% | 21.2% |
| Number of bases in singletons (bp) | 17,591,281 | 20,431,176 | 18,048,611 |
| Mean singleton length (bp) | 289 | 451 | 244 |
| Number of transcripts (contigs + singletons) | 79,194 | 61,042 | 92,406 |
bp base pairs
Fig. 1Contig length distributions of Trinity using the ConDeTri dynamic read trimmer
Fig. 2Transcriptome completeness of transcripts quantified through 956 universal single-copy orthologs using BUSCO
Fig. 3The number of unique and shared, homologous transcripts among kudzu CPP27 and Pmnk6 and tropical kudzu CPP02 transcriptome assemblies as ascertained via reciprocal best BLAST hit analyses
Summary of gene ontology analysis
| Accessions | Transcripts | Orfs | Predictions | BLAST Hits | Annotated GO IDs | ECs |
|---|---|---|---|---|---|---|
| CPP27 | 79,194 | 37,741 | 30,716 | 28,795 | 18,446 | 8039 |
| (18,325/60869) | (13,534/17182) | (12,583/16212) | (7958/10488) | |||
| Pmnk6 | 61,042 | 50,320 | 42,386 | 39,366 | 24,447 | 6337 |
| (15,736/45306) | (14,821/27565) | (12,705/26661) | (8079/16368) | |||
| CPP02 | 92,406 | 34,223 | 27,661 | 22,472 | 13,230 | 4064 |
| (18,412/73994) | (14,677/12984) | (10,407/12065) | (6085/7145) |
Orfs open reading frames, GO gene ontology, ECs enzyme codes. Parentheses: (contigs/singletons)
Fig. 4Gene ontology classifications of kudzu and tropical kudzu annotated transcripts. Numbers indicate the number of sequences associated with the particular GO term in each category
Single nucleotide polymorphism detection among kudzu and tropical kudzu genotypes
| Comparison | HC SNPs | SNPs > 95%a | SNPs >20xb | Total SNPsc | Ts/Tv |
|---|---|---|---|---|---|
| Pmnk6 vs CPP27 | 10,417 | 6016 | 426 | 358 | 1.41 |
| (7494/2923) | (4125/1891) | (252/174) | |||
| CPP02 vs CPP27 | 99,584 | 86,626 | 5831 | 5185 | 1.60 |
| (81,276/18308) | (70,638/15988) | (5091/740) | |||
| CPP02 vs Pmnk6 | 220,739 | 174,884 | 21,258 | 19,028 | 1.73 |
| (164,118/56621) | (127,311/47573) | (19,255/2003) | |||
| CPP02 vs Pmnk6, CPP27 | 314,416 | 248,719 | 33,603 | 30,143 | 1.71 |
| (229,163/85251) | (178,102/70617) | (29,812/3791) | |||
| Japan vs Pmnk6, CPP27, China | 494,234 | 79,088 | 27,108 | 24,475 | 1.47 |
| (494,234/0) | (79,088/0) | (27,108/0) |
aSNPs with the > 95% frequency
bSNPs with > 95% frequency and > 20x coverage
cOne-to-one point mutations after exclusion of indels and length variants; HC: high confidence; parentheses: (contigs/singletons)
Transcriptome eSSRs
| CPP27 | CPP02 | |
|---|---|---|
| Transcripts | 79,194 | 92,406 |
| Raw eSSRs | 5255 | 4586 |
| Dinucleotide | 770 | 670 |
| Trinucleotide | 4032 | 3524 |
| Tetranucleotide | 180 | 137 |
| Pentanucleotide | 106 | 79 |
| Hexanucleotide | 167 | 176 |
| Primered eSSRs | 1646 | 1459 |
| Dinucleotide | 14 | 28 |
| Trinucleotide | 1458 | 1273 |
| Tetranucleotide | 62 | 54 |
| Pentanucleotide | 41 | 25 |
| Hexanucleotide | 71 | 79 |
Seven eSSR primers optimized and used to assess population genetics in kudzu accessions
| Locus | Sequence | Dye/Tail | SSR | Length (bp) |
|---|---|---|---|---|
| PP2 | F: 5′-TAG GAG TGC AGC AAG CAT ATG CCG CGG ATC TTT GAA AG-3’ | VIC /M13A | AAC | 100–130 |
| R: 5’-CAA ATT GGC CCT GTC CCA AT-3’ | n/a | |||
| PP4 | F: 5′-TGT AAA ACG ACG GCC AGT CAT GCC CAC GTG CTT CAT AG-3’ | 6FAM/M13 | GCT | 100–140 |
| R: 5’-CTC TCA GAT CCA GGC CCA AA-3’ | n/a | |||
| PP10 | F: 5′-TAG GAG TGC AGC AAG CAT GGC ATG TAG ATC CAG CTA AA-3’ | VIC/M13A | GGT | 310–330 |
| R: 5′-TTG ACA GAT TTC TGA TTC TTG G-3’ | n/a | |||
| PP13 | F: 5′-TAG GAG TGC AGC AAG CAT GAT TGA GCA GGC ACG AGA AC-3’ | VIC/M13A | GCT | 270–300 |
| R: 5’-CAG TAG CAG GCA TGT GTT GG-3’ | n/a | |||
| PL1 | F: 5’-CAC TGC TTA GAG CGA TGC TGT AAG CGT TCG TTC GTT GG-3’ | PET/M13B | CTT | 400–440 |
| R: 5’-TCA ACC TGG TGC TCT CTG AC-3’ | n/a | |||
| PL7 | F: 5′-TGT AAA ACG ACG GCC AGT AGT GGC CTT GCT CTT CTT CC-3’ | 6FAM/M13 | CTT | 80–140 |
| R: 5′-GTG TCA TCT CAG CAC GTT GG-3’ | n/a | |||
| PL11 | F: 5′-TGT AAA ACG ACG GCC AGT TGG CAT CAT CCT TCA ACC AC-3’ | 6FAM/M13 | ACC | 300–330 |
| R: 5′-ATT CGG GAA TAG TGG GTG GG-3’ | n/a |
F forward primer, R reverse primer. Dyes VIC: 2′-chloro-7′phenyl-1,4-dichloro-6-carboxy-fluorescein; 6FAM: 6-carboxyfluorescein; PET: chemical structure currently unpublished as proporietary to Lifetech. Tail: see Culley et al. [19] for information about M13, M13A, and M13B
Fig. 5STRUCTURE diagram of 75 P. montana accessions across four nations (K = 3)
Allelic frequency for Pueraria national populations
| Locus | USA | China | Japan | Thailand | Mean | SD | Total |
|---|---|---|---|---|---|---|---|
| PP2 | 8 | 7 | 6 | 4 | 6.25 | 1.71 | 9 |
| PP4 | 4 | 5 | 7 | 3 | 4.75 | 1.71 | 9 |
| PP10 | 5 | 5 | 6 | 3 | 4.75 | 1.26 | 8 |
| PP13 | 3 | 7 | 4 | 2 | 4.00 | 2.16 | 7 |
| PL1 | 4 | 4 | 2 | 4 | 3.50 | 1.00 | 9 |
| PL7 | 8 | 8 | 11 | 3 | 7.50 | 3.32 | 15 |
| PL11 | 5 | 7 | 3 | 1 | 4.00 | 2.58 | 7 |
| Mean | 5.29 | 6.14 | 5.57 | 2.86 | 4.96 | 1.96 | 9.14 |
| SD | 1.98 | 1.46 | 2.99 | 1.07 | 1.42 | 0.80 | 2.73 |
| Total | 37 | 43 | 39 | 20 | 34.75 | 13.73 | 64 |
N number of accessions, SD standard deviation
Observed and expected heterozygosities for Pueraria subpopulations
| US 1 | US 2 | US 3 | CN 1 | CN 2 | CN 3 | JP 1 | JP 2 | JP 3 | TH | |
|---|---|---|---|---|---|---|---|---|---|---|
| # Individuals | 8 | 10 | 7 | 5 | 8 | 8 | 7 | 8 | 7 | 7 |
| Obs. Het. | 0.717 | 0.552 | 0.472 | 0.611 | 0.378 | 0.632 | 0.396 | 0.506 | 0.656 | 0.594 |
| Exp. Het. | 0.643 | 0.503 | 0.547 | 0.648 | 0.589 | 0.763 | 0.579 | 0.572 | 0.661 | 0.583 |
| HWE p-value | 0.766 | 0.251 | 0.611 | 0.765 | 0.079 | 0.392 | 0.013 | 0.429 | 0.869 | 0.442 |
US United States, CN China, JP Japan, TH Thailand, Obs: Observed, Exp Expected, Het Heterozygosity, HWE Hardy-Weinberg Equilibrium
Subpopulation pairwise Fst
| US 1 | US 2 | US 3 | CN 1 | CN 2 | CN 3 | JP 1 | JP 2 | JP 3 | TH | |
|---|---|---|---|---|---|---|---|---|---|---|
| US 1 | – | 0.811 | 0.541 | 0.441 | 0.009 | 0.000* | 0.297 | 0.126 | 0.099 | 0.000* |
| US 2 | −0.023 | – | 0.378 | 0.730 | 0.009 | 0.000* | 0.432 | 0.108 | 0.072 | 0.000* |
| US 3 | − 0.011 | − 0.008 | – | 0.306 | 0.009 | 0.000* | 0.360 | 0.946 | 0.153 | 0.000* |
| CN 1 | − 0.009 | − 0.022 | 0.024 | – | 0.108 | 0.009 | 0.901 | 0.162 | 0.108 | 0.000* |
| CN 2 | 0.075 | 0.098 | 0.099 | 0.075 | – | 0.297 | 0.207 | 0.081 | 0.739 | 0.000* |
| CN 3 | 0.077 | 0.107 | 0.120 | 0.073 | 0.022 | – | 0.063 | 0.000* | 0.324 | 0.000* |
| JP 1 | 0.015 | −0.002 | 0.022 | −0.035 | 0.051 | 0.064 | – | 0.207 | 0.486 | 0.000* |
| JP 2 | 0.016 | 0.025 | −0.030 | 0.049 | 0.078 | 0.085 | 0.042 | – | 0.135 | 0.000* |
| JP 3 | 0.029 | 0.028 | 0.042 | 0.037 | −0.014 | 0.006 | 0.002 | 0.036 | – | 0.000* |
| TH | 0.315 | 0.370 | 0.377 | 0.330 | 0.310 | 0.244 | 0.347 | 0.352 | 0.322 | – |
Below diagonal pairwise Fst values, above diagonal p-values
US United States, CN China, JP Japan, TH Thailand
* = significant under Bonferroni correction (p < 0.001)
Fig. 6Neighbor joining distance tree based on Fst values and 10,000 bootstraps. US = United States; CN = China; JP = Japan; and TH = Thailand
Fig. 7Sampling sites: (a) United States: US 1, US 2, US 3 (25); (b) Japan: JP 1, JP 2, JP 3 (22); and (c) China: CN 1, CN 2, CN 3 (21) and Thailand: TH (7)