| Literature DB >> 24832669 |
Tamaki Yuri1, Rebecca T Kimball2, John Harshman3, Rauri C K Bowie4, Michael J Braun5, Jena L Chojnowski6, Kin-Lan Han7, Shannon J Hackett8, Christopher J Huddleston9, William S Moore10, Sushma Reddy11, Frederick H Sheldon12, David W Steadman12, Christopher C Witt12, Edward L Braun13.
Abstract
Insertion/deletion (indel) mutations, which are represented by gaps in multiple sequence alignments, have been used to examine phylogenetic hypotheses for some time. However, most analyses combine gap data with the nucleotide sequences in which they are embedded, probably because most phylogenetic datasets include few gap characters. Here, we report analyses of 12,030 gap characters from an alignment of avian nuclear genes using maximum parsimony (MP) and a simple maximum likelihood (ML) framework. Both trees were similar, and they exhibited almost all of the strongly supported relationships in the nucleotide tree, although neither gap tree supported many relationships that have proven difficult to recover in previous studies. Moreover, independent lines of evidence typically corroborated the nucleotide topology instead of the gap topology when they disagreed, although the number of conflicting nodes with high bootstrap support was limited. Filtering to remove short indels did not substantially reduce homoplasy or reduce conflict. Combined analyses of nucleotides and gaps resulted in the nucleotide topology, but with increased support, suggesting that gap data may prove most useful when analyzed in combination with nucleotide substitutions.Entities:
Year: 2013 PMID: 24832669 PMCID: PMC4009869 DOI: 10.3390/biology2010419
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Figure 1Estimate of avian phylogeny based upon nucleotide sequence data (maximum likelihood [ML] tree using the GTR+I+Γ model) and the higher-level classification described in the text. Nodes with 100% support are indicated with an asterisk. Red asterisks indicate nodes with 100% support that define supra-ordinal clades with extensive independent corroboration (see below). Coloring conventions here will be used in all trees, and named supra-ordinal clades are indicated using letters below branches (see Table 1 for details).
Supra-ordinal clades in the Early Bird tree and their corroboration by independent evidence (from mitogenomics [28,59], analyses of nuclear regions [30,31,63,64] that were not used by Hackett et al. [13] and transposable element (TE) insertions [60,61,62]). Strong corroboration (bootstrap support ≥70% or ≥3 TE insertions) was indicated using “++” and moderate corroboration (presence of the clade in with bootstrap support <70% or 1–2 TE insertions) was indicated using “+”. Blank cells indicate that the available independent evidence could not address the presence or absence of the clade, whereas “—” indicates evidence contradicting the clade. Citations for the introduction of clade names are included; names without citations were introduced here (supporting information, file 2).
| Support from Independent Evidence | ||||
|---|---|---|---|---|
| Clade | Name | Mitochondrial | Other Nuclear | TE insertions |
| A | Psittacopasserae [ | — | + | ++ |
| B | Eufalconimorphae [ | — | — | ++ |
| C | Australaves [ | — | + | + |
| D | Picodynastornithes | — | ++ | |
| E | Picocoraciae [ | — | ++ | |
| F | Eucavitaves (CPBT clade [ | ++ | ++ | |
| G | Cavitaves | |||
| H | Telluraves (“Landbirds” [ | — | ++ | + |
| I | Litoritelluraves | — | + | + |
| J | Austrodyptornithes | + | ||
| K | Aequornithes [ | ++ | ||
| L | Insolitaves | — | — | |
| M | Strisores [ | + | ||
| N | Novaeratitae | ++ | ++ | + |
| O | Notopalaeognathae | ++ | ++ | ++ |
Figure 2Branch lengths estimated from gap data (using the CFNv+Γ model) plotted against branch lengths from all nucleotide data (estimated using the GTR+I+Γ model). Branch length estimates for specific nucleotide partitions (introns, coding exons and 3' untranslated regions [UTRs]) are presented for comparison of relative rates (next page).
Figure 3Estimates of avian phylogeny obtained using 12,030 gap characters obtained using (a) ML analyses with the CFNv+Γ model and (b) the maximum parsimony (MP) criterion. Orders were collapsed when monophyletic to simplify the trees. Bootstrap support on terminal branches reflects the support of those orders; orders represented by a single taxon are indicated using “(1)”. There were a limited number of rearrangements relative to the nucleotide topology within orders, most without bootstrap support. We highlighted the topology for the order Galliformes, because the gap topology included a clade with bootstrap support that conflicts with multiple nuclear gene regions [8,83] and morphology [84].
Figure 4Branch length heterogeneity evident in the (a) optimal nucleotide tree (based upon the GTR+I+Γ model) and (b) the optimal gap tree (based upon the CFNv+Γ model).
Retention indices [88] for gap characters and nucleotide data. Retention indices were calculated using the ML topologies for nucleotides (Figure 1) or gaps (Figure 3a).
| Topology | |||
|---|---|---|---|
| Data Matrix | Nucleotide tree | Gap tree | |
| Gaps | |||
| All | 0.7154 | 0.7209 | |
| >1-bp (excluding 1-bp gaps) | 0.7141 | 0.7190 | |
| >2-bp (excluding 1- and 2-bp gaps) | 0.7238 | 0.7288 | |
| Nucleotides | |||
| All | 0.5231 | 0.5188 | |
| Introns | 0.5206 | 0.5167 | |
| Coding exons | 0.5315 | 0.5251 | |
| 3' untranslated regions | 0.5632 | 0.5597 | |
Figure 5(a) Comparison of bootstrap support in trees based on all gap characters and gap characters >1-bp in length. Bipartitions that appeared well supported (≥70% bootstrap) by one analysis and poorly supported (<50% bootstrap) in the other are shaded. Numbers correspond to the following bipartitions: 1. Ardea-Cochlearius-Eudocimus; 2. Alisterus-Psittacula; 3. Chalcopsitta-Platycercus; and 4. Picodynastornites. (b) Comparison of bootstrap support for analyses using all gap characters and RY-coded nucleotide data. The same numbers of informative characters were used in each of these analyses (next page).
Figure 6Combined evidence estimate of the avian tree of life. A partitioned ML analysis was conducted using the GTR+I+Γ model for the nucleotide partition and the CFNv+Γ model for the gap partition. Arrows indicate nodes defining supra-ordinal clades where bootstrap support increased or decreased by more than 10% relative to the nucleotide analysis (Figure 1). The combined evidence topology for Columbiformes was congruent with the gap topology instead of the nucleotide topology (inset; bootstrap values are reported for combined analysis [above branches] and for gap characters [below branches]).