| Literature DB >> 31058965 |
Craig F Barrett1, Brandon T Sinn1, Aaron H Kennedy2.
Abstract
Heterotrophic plants are evolutionary experiments in genomic, morphological, and physiological change. Yet, genomic sampling gaps exist among independently derived heterotrophic lineages, leaving unanswered questions about the process of genome modification. Here, we have sequenced complete plastid genomes for all species of the leafless orchid genus Hexalectris, including multiple individuals for most, and leafy relatives Basiphyllaea and Bletia. Our objectives are to determine the number of independent losses of photosynthesis and to test hypotheses on the process of genome degradation as a result of relaxed selection. We demonstrate four to five independent losses of photosynthesis in Hexalectris based on degradation of the photosynthetic apparatus, with all but two species displaying evidence of losses, and variation in gene loss extending below the species level. Degradation in the atp complex is advanced in Hexalectris warnockii, whereas only minimal degradation (i.e., physical loss) has occurred among some "housekeeping" genes. We find genomic rearrangements, shifts in Inverted Repeat boundaries including complete loss in one accession of H. arizonica, and correlations among substitutional and genomic attributes. Our unprecedented finding of multiple, independent transitions to a fully mycoheterotrophic lifestyle in a single genus reveals that the number of such transitions among land plants is likely underestimated. This study underscores the importance of dense taxon sampling, which is highly informative for advancing models of genome evolution in heterotrophs. Mycoheterotrophs such as Hexalectris provide forward-genetic opportunities to study the consequences of radical genome evolution beyond what is possible with mutational studies in model organisms alone.Entities:
Keywords: chloroplast; deletion; evolution; heterotroph; phylogenomics; plastome; pseudogene; selection
Mesh:
Year: 2019 PMID: 31058965 PMCID: PMC6736286 DOI: 10.1093/molbev/msz111
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Details of Accessions Included in This Study.
| Coll.# | Genus | Species | Loc | Length | IR | LSC | SSC | GC | x-cov | #Reads | Mapped | %Plastid Reads |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Baco1 |
|
| FL | 158,368 | 26,910 | 86,711 | 17,873 | 37.3 | 60.8 | 13,219,124 | 139,287 | 1.05 |
| 32 |
|
| JA | 158,720 | 26,860 | 87,177 | 17,823 | 37.2 | 598.0 | 14,792,462 | 940,676 | 6.36 |
| 47 |
|
| TX | 109,543 | 4,475 | 90,507 | 10,086 | 35.9 | 169.8 | 11,912,444 | 184,256 | 1.55 |
| 271 |
|
| TX | 105,192 | n/a | n/a | n/a | 35.5 | 485.3 | 46,290,820 | 507,602 | 1.10 |
| 377 |
|
| AZ | 109,461 | 4,306 | 90,957 | 9,892 | 35.9 | 415.7 | 87,738,174 | 451,735 | 0.52 |
| 36 |
|
| JA | 138,101 | 31,225 | 68,147 | 7,504 | 37.1 | 443.0 | 69,828,048 | 608,415 | 0.87 |
| 41 |
|
| JA | 137,846 | 31,263 | 67,707 | 7,399 | 37.1 | 193.1 | 26,559,816 | 170,483 | 0.64 |
| 166 |
|
| AZ | 117,252 | 4,865 | 97,471 | 10,051 | 36.2 | 36.6 | 6,253,740 | 37,771 | 0.60 |
| 64 |
|
| TX | 152,460 | 27,090 | 83,614 | 14,669 | 36.8 | 551.0 | 49,121,014 | 834,057 | 1.70 |
| 48 |
|
| TX | 111,720 | 15,147 | 71,963 | 9,063 | 37.2 | 72.6 | 12,620,482 | 88,569 | 0.70 |
| 79 |
|
| TX | 111,592 | 15,075 | 72,400 | 9,042 | 37.1 | 100.4 | 12,461,696 | 110,985 | 0.89 |
| 270 |
|
| TX | 111,519 | 15,148 | 72,181 | 9,042 | 37.1 | 146.7 | 13,268,698 | 161,991 | 1.22 |
| 85 |
|
| JA | 136,661 | 27,129 | 71,757 | 10,508 | 37 | 398.8 | 46,935,030 | 540,450 | 1.15 |
| 500 |
|
| CH | 134,807 | 26,814 | 70,342 | 10,837 | 37.1 | 112.1 | 12,089,618 | 149,663 | 1.24 |
| 320 |
|
| TX | 145,345 | 25,555 | 83,813 | 10,422 | 36.9 | 206.3 | 14,764,714 | 297,143 | 2.01 |
| 44 |
|
| NC | 120,704 | 23,681 | 63,566 | 9,776 | 37.2 | 761.3 | 35,558,544 | 911,263 | 2.56 |
| 50 |
|
| TX | 120,185 | 23,582 | 62,242 | 10,779 | 37.2 | 1137.5 | 47,730,484 | 1,348,183 | 2.83 |
| 267 |
|
| OK | 121,646 | 23,681 | 63,674 | 10,610 | 37.2 | 932.7 | 61,605,466 | 1,125,190 | 1.83 |
| 276 |
|
| IN | 120,027 | 23,564 | 62,140 | 10,759 | 37.3 | 45.6 | 19,701,640 | 55,164 | 0.28 |
| 49 |
|
| TX | 119,057 | 17,332 | 66,903 | 17,490 | 36.9 | 703.5 | 38,633,900 | 829,431 | 2.15 |
| 69 |
|
| TX | 119,058 | 17,339 | 66,938 | 17,442 | 36.9 | 115.3 | 14,329,294 | 135,738 | 0.95 |
Note.—Coll.#, collection number; Loc, locality (US or Mexican state); length, plastome size in bp; IR, length of the inverted repeat; LSC, large single copy region; SSC, small single copy region; GC, GC content of the plastome excluding one IR copy; x-cov, mean coverage depth of the plastome; #reads, total number of reads generated; mapped, the number of reads mapping to the completed plastome; %plastid reads, percentage of the total read pool mapping to the plastome; and GenBank, NCBI accession number.
. 1.Phylogenetic trees for LCB (above) and CDS matrices (below). * = pp of 1, ML bootstrap value of 100%, and Parsimony jackknife value of 100%; otherwise, values are indicated respectively. Scale bar and branch lengths = substitutions per site from Bayesian trees.
. 2.Scaled representation of plastid genomic regions in Bletia, Basiphyllaea, and Hexalectris, ranked by decreasing total plastome length. Black arrows = IR copies and gray arrows = genomic inversions relative to Bletia and other angiosperm plastomes. Thin black lines on the left = LSC region and to the right = small single copy region. Note that Hexalectris arizonica accession 271 lacks an IR.
. 3.Functional gene content of Bletia, Basiphyllaea, and Hexalectris based on ancestral state reconstruction of each gene under a loss-only (Dollo) model of transition probabilities, using the “CDS” matrix. Filled rectangles = pseudogenes and open rectangles = physical gene losses. “?” for trnL = evidence of length variation in the “variable loop” region, which may be a precursor to pseudogene formation.
. 4.Trees and ancestral state reconstructions of photosynthetic capability based on (A) the “CDS” matrix and (B) the “LCB” matrix. Photosynthetic capability is represented as a binary character, under a model allowing only losses of photosynthesis and no gains. Green circles/pie charts represent the probability of a photosynthetic ancestor; red represents a nonphotosynthetic ancestor. Numbers in black next to nodes represent the likelihood fraction of the most likely ancestral state; all other nodes have likelihoods >0.95. Numbers in red indicate the inferred instances of photosynthetic loss. Arrows indicate the alternative position of Hexalectris revoluta based on each matrix.
Tests of Alternative Models of Transition Probabilities Between Photosynthetic Character States (0 = Photosynthetic and 1 = Nonphotosynthetic).
| Model |
| ln | fp | AICc | AICweight |
|
|---|---|---|---|---|---|---|
| ER, equal rates |
| −6.78896 | 1 | 15.58 | 0.42 | 0.71; 0.71 |
| ARD, different rates |
| −6.78896 | 2 | 17.58 | 0.15 | 0.71; 0.72 |
| Gain-only |
| −6.82156 | 1 | 15.64 | 0.41 | 0; 0.68 |
| Loss-only |
| −9.70410 | 1 | 21.41 | 0.02 | 1.14; 0 |
Note.—“q-matrix” specifies the specific state transition constraints for each model; “ln L” is the likelihood of the model; fp, the number of free model parameters; “AICc” is the corrected Akaike information criterion; “AICweight” is the Akaike weight for a particular model in favor of the alternatives; and “q01; q10” represent the estimated transition probabilities.
Results of RELAX Analyses Based on Genes Concatenated By Functional Class (atp, rps, and rpl; except in the case of accD, clpP, and matK Which Do Not Share Similar Functions).
| Model | log |
| Selection | Branch |
|
|
|
|---|---|---|---|---|---|---|---|
| All | |||||||
| Alternative | −8,293.4 | 1.96, 0.031 |
| Test | 0.01 (57.93%) | 0.06 (37.31%) | 10.17 (4.76%) |
| Reference | 0.07 (57.93%) | 0.25 (37.31%) | 3.26 (4.76%) | ||||
| Null | −8,295.7 | Test | 0.00 (58.21%) | 0.02 (35.58%) | 6.06 (6.21%) | ||
| Reference | 0.00 (58.21%) | 0.02 (35.58%) | 6.06 (6.21%) | ||||
|
| |||||||
| Alternative | −6,861.3 | 0.85, 0.180 | Relaxation | Test | 0.00 (3.08%) | 0.43 (96.78%) | 90.82 (0.14%) |
| Reference | 0.00 (3.08%) | 0.37 (96.78%) | 206.53 (0.14%) | ||||
| Null | −6,862.2 | Test | 0.00 (1.66%) | 0.40 (98.23%) | 172.60 (0.10%) | ||
| Reference | 0.00 (1.66%) | 0.40 (98.23%) | 172.60 (0.10%) | ||||
| All | |||||||
| Alternative | −8,319.9 | 2.54, 0.097 | Intensification | Test | 0.00 (72.92%) | 0.22 (12.65%) | 2.74 (14.43%) |
| Reference | 0.01 (72.92%) | 0.55 (12.65%) | 1.49 (14.43%) | ||||
| Null | −8,321.2 | Test | 0.00 (72.30%) | 0.61 (11.21%) | 1.78 (16.49%) | ||
| Reference | 0.00 (72.30%) | 0.61 (11.21%) | 1.78 (16.49%) | ||||
| All | |||||||
| Alternative | −5,321.3 | 1.13, 0.340 | Intensification | Test | 0.00 (17.27%) | 0.00 (79.02%) | 23.74 (3.71%) |
| Reference | 0.00 (17.27%) | 0.00 (79.02%) | 16.41 (3.71%) | ||||
| Null | −5,321.8 | Test | 0.01 (80.86%) | 0.04 (15.67%) | 21.22 (3.47%) | ||
| Reference | 0.01 (80.86%) | 0.04 (15.67%) | 21.22 (3.47%) |
Note.—“Model” refers to the null versus alternative models; “log L” is the log-likelihood for a particular model; K = the value of the “K” parameter, where K > 1 indicates intensified selection and K < 1 indicates relaxed selection, and P-val = the significance of K; “Selection” indicates the predominant mode of selection for the test branches; “Branch” distinguishes parameter estimates for test versus reference branches; ω1–ω3 are the three estimates of dN/dS for the three site classes in the model, for test and reference branches, with the proportion of sites under each value given in parentheses. Analyses of single genes can be found in supplementary table S3, Supplementary Material online.