| Literature DB >> 25653903 |
Kevin Weitemier1, Shannon C K Straub2, Mark Fishbein3, Aaron Liston1.
Abstract
Despite knowledge that concerted evolution of high-copy loci is often imperfect, studies that investigate the extent of intragenomic polymorphisms and comparisons across a large number of species are rarely made. We present a bioinformatic pipeline for characterizing polymorphisms within an individual among copies of a high-copy locus. Results are presented for nuclear ribosomal DNA (nrDNA) across the milkweed genus, Asclepias. The 18S-26S portion of the nrDNA cistron of Asclepias syriaca served as a reference for assembly of the region from 124 samples representing 90 species of Asclepias. Reads were mapped back to each individual's consensus and at each position reads differing from the consensus were tallied using a custom perl script. Low frequency polymorphisms existed in all individuals (mean = 5.8%). Most nrDNA positions (91%) were polymorphic in at least one individual, with polymorphic sites being less frequent in subunit regions and loops. Highly polymorphic sites existed in each individual, with highest abundance in the "noncoding" ITS regions. Phylogenetic signal was present in the distribution of intragenomic polymorphisms across the genus. Intragenomic polymorphisms in nrDNA are common in Asclepias, being found at higher frequency than any other study to date. The high and variable frequency of polymorphisms across species highlights concerns that phylogenetic applications of nrDNA may be error-prone. The new analytical approach provided here is applicable to other taxa and other high-copy regions characterized by low coverage genome sequencing (genome skimming).Entities:
Keywords: 2ISP; Asclepias; Concerted evolution; Genome skimming; High-copy; ITS; Intra-individual site polymorphism; Intragenomic polymorphism; Nuclear ribosomal DNA (nrDNA); Partial SNP (pSNP)
Year: 2015 PMID: 25653903 PMCID: PMC4304868 DOI: 10.7717/peerj.718
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Polymorphic site abundance in Asclepias.
Polymorphic site abundance in Asclepias taxa.
| Voucher | Poly # | Poly % | High # | High % | SRA | |
|---|---|---|---|---|---|---|
| Fishbein 3146 [WS] | 174 | 3.12 | 16 | 0.29 |
| |
|
| Fishbein 6463 [OKLA] | 273 | 4.7 | 11 | 0.19 |
|
| Steinmann 5243 [IEB] | 668 | 11.45 | 31 | 0.53 |
| |
| Lynch 12652 [OKLA] | 248 | 4.25 | 25 | 0.43 |
| |
| Reina 2004-1315 [ARIZ] | 55 | 1.04 | 36 | 0.68 |
| |
|
| Reina 2008-203 [OKLA] | 169 | 2.99 | 26 | 0.46 |
|
| Lynch 11495 [OKLA] | 174 | 2.98 | 2 | 0.03 |
| |
| Lynch 12037 [OKLA] | 342 | 5.87 | 7 | 0.12 |
| |
| Fishbein 6536 [OKLA] | 391 | 6.73 | 24 | 0.41 |
| |
| Lynch 13314 [OKLA] | 474 | 8.15 | 38 | 0.65 |
| |
| Fishbein 6486 [OKLA] | 347 | 5.95 | 6 | 0.1 |
| |
| Fishbein 3612 [ARIZ] | 451 | 7.73 | 26 | 0.45 |
| |
| Lynch 1694 [OKLA] | 513 | 8.85 | 33 | 0.57 |
| |
|
| Fishbein 5833 [OKLA] | 378 | 6.54 | 39 | 0.68 |
|
| Fishbein 6072 [OKLA] | 875 | 15.08 | 111 | 1.91 |
| |
| Lynch 10642 [OKLA] | 309 | 5.32 | 22 | 0.38 |
| |
| Lynch 10779 [OKLA] | 472 | 8.09 | 24 | 0.41 |
| |
| Fishbein 6347 [OKLA] | 245 | 4.2 | 14 | 0.24 |
| |
| Fishbein 4793 [OKLA] | 297 | 5.1 | 27 | 0.46 |
| |
| Webster 17186 [OKLA] | 464 | 7.95 | 59 | 1.01 |
| |
| Lynch 12336 [OKLA] | 394 | 6.75 | 16 | 0.27 |
| |
| Lynch 10942 [OKLA] | 344 | 5.96 | 30 | 0.52 |
| |
|
| Fishbein 5772 [OKLA] | 308 | 5.35 | 13 | 0.23 |
|
| Ventura & Lopez 7986 [TEX] | 471 | 8.09 | 27 | 0.46 |
| |
| Fishbein 6504 [OKLA] | 230 | 4 | 35 | 0.61 |
| |
| Fishbein 5723 [OKLA] | 350 | 6 | 13 | 0.22 |
| |
| Zuloaga & Morrone 7087 [OKLA] | 258 | 4.42 | 24 | 0.41 |
| |
| Fishbein 6511 [OKLA] | 167 | 2.9 | 18 | 0.31 |
| |
|
| Fishbein 6500 [OKLA] | 157 | 2.69 | 8 | 0.14 |
|
| Carr 12032 [TEX] | 124 | 2.29 | 44 | 0.81 |
| |
| Lynch 11224 [OKLA] | 141 | 2.54 | 33 | 0.59 |
| |
|
| Lynch 11029 [OKLA] | 192 | 3.3 | 7 | 0.12 |
|
| Lynch 10923 [OKLA] | 610 | 10.49 | 32 | 0.55 |
| |
|
| Lynch 10799 [OKLA] | 492 | 8.43 | 17 | 0.29 |
|
| Fishbein 5586 [OKLA] | 329 | 5.71 | 42 | 0.73 |
| |
| Fishbein 3660 [ARIZ] | 556 | 9.58 | 47 | 0.81 |
| |
|
| Lynch 1655 [OKLA] | 711 | 12.25 | 49 | 0.84 |
|
| Lynch 14142 [OKLA] | 190 | 3.33 | 19 | 0.33 |
| |
|
| Lynch 1623 [OKLA] | 90 | 1.69 | 85 | 1.6 |
|
|
| Fishbein 5097 [OKLA] | 215 | 3.72 | 18 | 0.31 |
|
|
| Fishbein 3671 [ARIZ] | 225 | 3.87 | 25 | 0.43 |
|
| Lynch 11299 [OKLA] | 600 | 10.36 | 54 | 0.93 |
| |
| Fishbein 5596 [OKLA] | 331 | 5.67 | 13 | 0.22 |
| |
| Lynch 11374 [OKLA] | 670 | 11.51 | 44 | 0.76 |
| |
| Lynch 12567 [OKLA] | 434 | 7.45 | 28 | 0.48 |
| |
| Lynch 12050 [OKLA] | 326 | 5.65 | 35 | 0.61 |
| |
|
| Fishbein 6531 [OKLA] | 217 | 3.72 | 13 | 0.22 |
|
| Fishbein 2493 [ARIZ] | 140 | 2.97 | 74 | 1.57 |
| |
|
| Fishbein 3657 [WS] | 165 | 3 | 65 | 1.18 |
|
| Vásquez & Alvarez 4905 [IEB] | 160 | 2.76 | 12 | 0.21 |
| |
| Fishbein 5605 [MISSA] | 249 | 4.27 | 8 | 0.14 |
| |
| Lynch 12661 [OKLA] | 343 | 5.92 | 11 | 0.19 |
| |
| Lynch 11018 [OKLA] | 236 | 4.05 | 11 | 0.19 |
| |
| Lynch 11453 [OKLA] | 620 | 10.7 | 23 | 0.4 |
| |
| Fishbein 6263 [OKLA] | 282 | 4.84 | 9 | 0.15 |
| |
| Lynch 12447 [OKLA] | 427 | 7.34 | 24 | 0.41 |
| |
| Venable & Becerra s.n. [ARIZ] | 129 | 2.39 | 31 | 0.57 |
| |
| Gierisch 4191 [ARIZ] | 76 | 1.42 | 30 | 0.56 |
| |
|
| Fishbein 6518 [OKLA] | 391 | 6.72 | 21 | 0.36 |
|
| Lynch 11260 [OKLA] | 364 | 6.27 | 11 | 0.19 |
| |
|
| Lynch 11263 [OKLA] | 260 | 4.46 | 5 | 0.09 |
|
| Fishbein 3101 [OKLA] | 151 | 2.59 | 7 | 0.12 |
| |
| Freeman 9106 [KANU] | 208 | 3.62 | 20 | 0.35 |
| |
| Zuloaga & Morrone 7168 [OKLA] | 377 | 6.47 | 18 | 0.31 |
| |
| Fishbein 3009 [ARIZ] | 186 | 3.22 | 18 | 0.31 |
| |
| Lynch 12316 [OKLA] | 458 | 7.87 | 39 | 0.67 |
| |
| Lynch 14113 [OKLA] | 375 | 6.45 | 44 | 0.76 |
| |
|
| Fishbein 5389 [OKLA] | 249 | 4.31 | 41 | 0.71 |
|
|
| Nee 32966 [NY] | 432 | 7.47 | 17 | 0.29 |
|
|
| Fishbein 5816 [OKLA] | 376 | 6.45 | 16 | 0.27 |
|
| Fishbein 6268 [OKLA] | 109 | 1.92 | 23 | 0.4 |
| |
| Lynch 11543 [OKLA] | 708 | 12.75 | 87 | 1.57 |
| |
| Fishbein 5819 [OKLA] | 230 | 3.95 | 15 | 0.26 |
| |
|
| Lynch 13339 [OKLA] | 100 | 1.74 | 16 | 0.28 |
|
|
| Lynch 11477 [OKLA] | 134 | 2.3 | 7 | 0.12 |
|
| Bellsey 97-5 [ARIZ] | 745 | 12.8 | 35 | 0.6 |
| |
|
| Lynch 1533 [OKLA] | 659 | 11.32 | 20 | 0.34 |
|
|
| Fishbein 5857 [OKLA] | 697 | 11.95 | 48 | 0.82 |
|
| Lynch 13546 [OKLA] | 333 | 5.91 | 42 | 0.75 |
| |
| Laferrière 1478 [MO] | 324 | 5.59 | 39 | 0.67 |
| |
| Fishbein 5136 [OKLA] | 439 | 7.53 | 9 | 0.15 |
| |
| Lynch 12408 [OKLA] | 356 | 6.11 | 15 | 0.26 |
| |
| Zuloaga & Morrone 7069 [OKLA] | 458 | 7.85 | 23 | 0.39 |
| |
| Fishbein 5143 [OKLA] | 410 | 7.11 | 29 | 0.5 |
| |
|
| Pérez 1850 [MO] | 609 | 10.49 | 34 | 0.59 |
|
| Fishbein 2432 [ARIZ] | 465 | 7.99 | 41 | 0.7 |
| |
| Lynch 12847 [OKLA] | 233 | 4.05 | 22 | 0.38 |
| |
|
| Fishbein 5654 [MISSA] | 216 | 3.73 | 7 | 0.12 |
|
| Webb s.n. [ARIZ] | 116 | 2.33 | 41 | 0.82 |
| |
|
| Fishbein 6545 [OKLA] | 204 | 3.55 | 31 | 0.54 |
|
| Lynch 1656 [OKLA] | 747 | 12.86 | 33 | 0.57 |
| |
| Riser 329 [WS] | 31 | 0.61 | 23 | 0.45 |
| |
| Ellison s.n. [HPSU] | 158 | 2.96 | 33 | 0.62 |
| |
|
| Fishbein 6525 [OKLA] | 237 | 4.11 | 38 | 0.66 |
|
|
| Riser 335 [WS] | 68 | 1.18 | 26 | 0.45 |
|
| Fishbein 2951 [ARIZ] | 500 | 8.61 | 35 | 0.6 |
| |
| Fishbein 5846 [OKLA] | 327 | 5.65 | 36 | 0.62 |
| |
| Fishbein 5137 [OKLA] | 586 | 10.05 | 15 | 0.26 |
| |
|
| Zamudio 5234 [MEXU] | 272 | 4.88 | 35 | 0.63 |
|
| Fishbein 3000 [ARIZ] | 294 | 5.05 | 8 | 0.14 |
| |
|
| Fishbein 5148 [MISSA] | 386 | 6.71 | 45 | 0.78 |
|
| Lynch 10884 [OKLA] | 882 | 15.13 | 35 | 0.6 |
| |
| Lynch 10981 [OKLA] | 242 | 4.19 | 23 | 0.4 |
| |
| Reina 98-579 [WS] | 150 | 2.57 | 19 | 0.33 |
| |
| Fishbein 3518 [WS] | 182 | 3.31 | 23 | 0.42 |
| |
|
| Lynch 1008 [OKLA] | 383 | 6.86 | 77 | 1.38 |
|
| Fishbein 6434 [OKLA] | 269 | 4.64 | 7 | 0.12 |
| |
|
| Fishbein 6446 [OKLA] | 370 | 6.36 | 18 | 0.31 |
|
| Fishbein 3142 [WS] | 299 | 5.14 | 12 | 0.21 |
| |
| Fishbein 2948 [ARIZ] | 160 | 3.09 | 92 | 1.77 |
| |
| Lynch 11138 [OKLA] | 204 | 3.51 | 11 | 0.19 |
| |
|
| Fishbein 4885 [OKLA] | 298 | 5.1 | 7 | 0.12 |
|
| Fishbein 2810 [ARIZ] | 161 | 2.89 | 37 | 0.66 |
| |
| Fishbein 5608 [MISSA] | 198 | 3.39 | 11 | 0.19 |
| |
| Fishbein 2816 [ARIZ] | 685 | 11.84 | 29 | 0.5 |
| |
| Fishbein 4825 [MISSA] | 297 | 5.1 | 31 | 0.53 |
| |
| Lynch 12526 [OKLA] | 251 | 4.33 | 11 | 0.19 |
| |
| Fishbein 6494 [OKLA] | 282 | 4.86 | 19 | 0.33 |
| |
| Lynch 12787 [OKLA] | 375 | 6.5 | 29 | 0.5 |
| |
| Lynch 11102 [OKLA] | 23 | 0.41 | 21 | 0.37 |
| |
| Lynch 10735 [OKLA] | 506 | 8.68 | 14 | 0.24 |
| |
| Lynch 12955 [OKLA] | 261 | 4.55 | 40 | 0.7 |
| |
| Fishbein 4806 [MISSA] | 425 | 7.37 | 46 | 0.8 |
| |
| Lynch 11369 [OKLA] | 364 | 6.53 | 73 | 1.31 |
| |
| D. A. Neil 242 [MO] | 122 | 2.28 | 30 | 0.56 |
|
Notes.
Voucher: Collector, collection #, [herbarium]; Poly #: Number of polymorphic positions; Poly %: Percentage of assembled positions that are polymorphic; High #, High %: Number and percentage of highly polymorphic positions; SRA: NCBI Sequence Read Archive accession number
Figure 1Polymorphic site frequency among species of Asclepias.
Histogram of polymorphic site frequency among species of Asclepias. Individuals contained from 0.4% to 15.1% polymorphic sites.
Figure 2Polymorphic sites across the nrDNA cistron of Asclepias.
Number of individuals that are (A) polymorphic and (B) highly polymorphic at each position. Polymorphic positions are those with ≥2% of reads differing from the consensus; highly polymorphic positions are those with ≥10% differing reads. Subunit regions, white background; spacer regions, shaded background. Numbers in each region are the percentage of sites polymorphic or highly polymorphic in at least one individual.
Figure 3Polymorphism probability by region and structure.
Probability that at least one individual is (A) polymorphic or (B) highly polymorphic at a position that is either within a spacer (ITS1, ITS2) or subunit region (18S, 5.8S, 26S), and either paired (stems) or unpaired (loops). Error bars indicate 95% confidence intervals. Values derived from two-factor multiple logistic regressions (Table 3).
ANOVA of the number of polymorphic individuals at nrDNA positions, by position type.
Two-way ANOVA of the number of polymorphic individuals at nrDNA positions categorized as either subunit (18S, 5.8S, 26S) or spacer regions (ITS1, ITS2), and as either paired (stems) or unpaired (loops). More individuals are likely to be polymorphic at sites that are in spacer regions over subunit regions, and that are paired over unpaired. Bold values indicate categories that significantly affect polymorphism abundance (P < 0.05).
| Source of variation | Sum of squares |
|
| |
|---|---|---|---|---|
| (Intercept) | 8,310.6 | 1 | ||
|
| 109.1 | 1 |
|
|
|
| 16.4 | 1 |
|
|
| Paired* Subunit | 0.4 | 1 | 0.1859 | 0.6663 |
| Residuals | 11,878.9 | 5,835 |
Notes.
df, degrees of freedom.
Multiple logistic regression of the likelihood of nrDNA position polymorphism, by position type.
Two-factor multiple logistic regression of the likelihood of nrDNA positions being (A) polymorphic or (B) highly polymorphic in at least one individual. Positions are categorized as either within a subunit (18S, 5.8S, 26S) or spacer region (ITS1, ITS2), and as either paired (stem) or unpaired (loop). Odds ratios indicate whether a category decreases (<1) or increases (>1) the likelihood a position is polymorphic or highly polymorphic. The intercept represents paired, spacer positions. Categories that significantly affect polymorphism likelihood are indicated by italics (P < 0.1) or boldface (P < 0.05). Polymorphism probabilities for each category are presented in Fig. 3.
|
| ||||||
|---|---|---|---|---|---|---|
| Source | Odds ratio | 95% CI | Coefficient estimate | Std. error |
| |
| (Intercept) | 3.7136 | 0.3825 | ||||
|
|
|
| − |
| − |
|
|
|
|
| − |
| − |
|
|
|
|
|
|
|
| |
|
| ||||||
| (Intercept) | 0.48551 | 0.1201 | ||||
|
|
|
| −1.35881 | 0.12564 | −10.815 | < |
|
|
|
|
|
|
|
|
| Subunit * Unpaired | 0.970 | 0.651–1.444 | −0.03061 | 0.20315 | −0.151 | 0.8802 |
Notes.
confidence interval
standard error
Phylogenetic signal tests across Asclepias for the number of polymorphic or highly polymorphic sites.
Tests for phylogenetic signal across Asclepias for the number of polymorphic or highly polymorphic positions across the entire nrDNA cistron (Subunits + spacers) or just the subunits (Subunits only).
| Parsimony permutations | Lambda | log |
| ||
|---|---|---|---|---|---|
| Polymorphic sites | Subunits + spacers | <0.0012 | 0.51 |
| 0.0067 |
| Subunits only | <0.0200 | 0.45 |
| 0.0112 | |
| Highly polymorphic sites | Subunits + spacers | >0.6050 | <0.0001 |
| 1 |
| Subunits only | >0.5300 | <0.0001 |
| 1 |
Notes.
the proportion of permutations with a shorter tree length than the true data
the maximum likelihood estimate of lambda
the log-likelihood ratio of the unconstrained model including the estimated lambda over the constrained model with lambda = 0
the probability of obtaining a likelihood ratio this small or smaller by chance alone
Figure 4Ancestral state reconstruction of polymorphic site abundance.
Ancestral state reconstruction of the number of polymorphic positions in nrDNA in Asclepias obtained with squared-change parsimony. The tree topology is that pruned from Fig. 2 of Fishbein et al. (2011) with clades indicated by letters, following that study.
Figure 5Ancestral state reconstruction of highly polymorphic site abundance.
Ancestral state reconstruction of the number of highly polymorphic positions in nrDNA in Asclepias obtained with squared-change parsimony. The tree topology is that pruned from Fig. 2 of Fishbein et al. (2011) with clades indicated by letters, following that study.