| Literature DB >> 23554903 |
Jerome C Regier1, Charles Mitter, Andreas Zwick, Adam L Bazinet, Michael P Cummings, Akito Y Kawahara, Jae-Cheon Sohn, Derrick J Zwickl, Soowon Cho, Donald R Davis, Joaquin Baixeras, John Brown, Cynthia Parr, Susan Weller, David C Lees, Kim T Mitter.
Abstract
BACKGROUND: Higher-level relationships within the Lepidoptera, and particularly within the species-rich subclade Ditrysia, are generally not well understood, although recent studies have yielded progress. We present the most comprehensive molecular analysis of lepidopteran phylogeny to date, focusing on relationships among superfamilies. METHODOLOGY PRINCIPALEntities:
Mesh:
Year: 2013 PMID: 23554903 PMCID: PMC3595289 DOI: 10.1371/journal.pone.0058568
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Previous hypotheses of deep-level relationships in Lepidoptera.
A. Composite working hypothesis based on morphology [7]. B. Ditrysian-only relationships (rooted on Tineoidea) inferred from degen1 ML analysis of 123 taxa sequenced either for 5 or 26 gene segments, with bootstrap values ≥50% displayed for nodes at the superfamily level and above [6]. C. Lepidopteran relationships (rooted on Micropterigoidea) inferred from ML analysis of 350 taxa, using nucleotides from the first and second codon positions (+ third codon position for EF-1α only) of 8 gene segments, with bootstrap values >50% displayed for nodes at the superfamily level and above [5]. Numbers in parentheses after taxon names are numbers of exemplars sampled.
Figure 2Assessing the effectiveness of the GARLI heuristic ML search through an analysis of 4608 search replicates as derived from the full 483-taxon, 19-gene, nt123_degen1 data set.
Out of 4608 search replicates, the single fully-resolved topology of highest likelihood is displayed (lnL = −583,900.053394). Terminal taxa, not shown in this figure in order to save space, are displayed in Figure S1. Dichotomous nodes that are not present in one or more strict consensuses of subsets of the 4608 topologies are identified by having numbers with blue coloration above subtending branches. The three subsets are as follows: 4, all topologies with lnL scores that are within 0.0001% (10-4 %) of that of the best ML topology (2 topologies total, including the best ML topology); 3, all topologies within 0.001% (10-3 %, 19 topologies total); 2, all topologies within 0.01% (10-2 %, 1827 topologies total). Selected bootstrap percentages based on 15 heuristic search replicates and 500 bootstrap pseudoreplicates that are ≥50% are displayed below branches (see Figure S1 for all bootstrap percentages). An orange-colored bar is placed beside each node that has bootstrap support ≥50% and that is missing in one or more of the subset consensuses. The dashed arrow identifies the altered placement of one (and only one) taxon that was found in a new and improved topology (lnL = −583,898.838616), when the dichotomous topology displayed in this figure was used as a starting tree in a second round of 561 GARLI ML heuristic searches. This new topology was recovered in 248 of the 561 search replicates. Higher-level taxon names, some of which are abbreviated, are displayed. All abbreviations follow: Anth+Phidit+Carth+Endrom., Anthelidae + Phiditidae + Carthaeidae + Endromidae; Saturn+Sphing+Bomb., Saturniidae + Sphingidae + Bombycidae; Eupterot+Brahm+Apat., Eupterotidae + Brahmaeidae + Apatelodidae; Nolidae+Stictopt., Nolidae + Stictopterinae; Notodont+Oenosand., Notodontidae + Oenosandridae; Uraniid.+Epiplemidae, Uraniidae + Epiplemidae; Sematur.+Epicopeiidae, Sematuridae + Epicopeiidae; Papilion., Papilionidae; Pterophorid+Copromorph., Pterophoridae (part) + Copromorphidae (part); Copromorph., Copromorphidae (part); Callidul., Callidulidae; “small ditrysian families”, Copromorphidae + Carposinidae + Epermeniidae + Alucitidae + Hyblaeidae + Pterophoridae (part) + Thyrididae + Pseudurgis (unplaced); Dalcer.+Limacodidae, Dalceridae + Limacodidae; Megalopyg+Aidos+Himant., Megalopygidae + Aidos + Himantopteridae; Zygaen.+Lacturidae, Zygaenidae + Lacturidae; Choreutid+Schreckenstein.+Douglasiidae, Choreutidae + Schreckensteiniidae + Douglasiidae.
Assessing the effectiveness of the GARLI heuristic bootstrap search by varying the number of search replicates performed per individual bootstrap pseudoreplicate in an analysis of 500 483-taxon, 19-gene, nt123_degen1, bootstrapped data sets.*
| Number of search replicates per bootstrap pseudoreplicate | |||||
| Node number | Taxonomic group |
| 5 | 10 | 15 |
| 75 | "butterflies" |
| 81 | 82 | 83 |
| 76 | "butterflies" − Papilionidae |
| 93 | 94 | 94 |
| 53 | Zygaenoidea subgroup |
| 62 | 61 | 61 |
| 50 | Zygaenoidea subgroup |
|
| 77 | 77 |
| Zygaenoidea subgroup |
| 87 | 87 | 87 | |
| Zygaenoidea subgroup |
| 78 | 77 | 78 | |
| 47 | Zygaenoidea sensu stricto |
|
| 95 | 96 |
| Pyraloidea |
| 73 | 74 | 74 | |
| Gelechioidea |
| 55 | 59 | 59 | |
| Gelechioidea subgroup (7 taxa) |
| 99 | 99 | 100 | |
| Pterophoridae (4 taxa) |
| 94 | 93 | 94 | |
| Epermeniidae (3 taxa) |
|
| 90 | 95 | |
| Cossidae subgroup (3 taxa) |
| 100 | 100 | 100 | |
| Brachodidae subgroup (2 taxa) |
|
| 98 | 99 | |
| 15 | Ditrysia − (Tineoidea, Gracillarioidea, Yponomeutoidea) |
| 96 | 97 | 98 |
Bootstrap percentages of all taxonomic groups in Figures 3 and S1 that are at least 5% lower than the value for 15 search replicates are displayed in this table in boldfaced, italicized font (columns 3–6). In no case was the value for 1 search replicate higher than that for 15 by 5% or more. Only bootstrap percentages close to or over 60% at 15 search replicates, and which differ by 5% or more from corresponding values at 1 search replicate, are shown in this table. Node numbers (column 1) refer to correspondingly numbered nodes in Figure 3, while un-numbered taxonomic groups correspond to terminal taxa in that same figure.
Assessing the effectiveness of the GARLI heuristic bootstrap search by varying the number of search replicates performed per individual bootstrap pseudoreplicate in an analysis of 500 483-taxon, 19-gene, nt123, bootstrapped data sets.*
| Number of search replicates per bootstrap pseudoreplicate | ||||||
| Node number | Taxonomic group | 1 | 5 | 10 | 15 | 25 |
| 14 | Ditrysia − Tineoidea |
| 100 | 100 | 100 | 100 |
| Tineoidea |
| 98 | 98 | 98 | 97 | |
| 15 | Ditrysia − (Tineoidea, Gracillarioidea, Yponomeutoidea) |
| 97 | 98 | 99 | 98 |
| Gracillarioidea + Yponomeutoidea |
| 97 | 98 | 98 | 97 | |
| Dudgeoneidae (2 taxa) |
| 93 | 94 | 94 | 95 | |
| Epermeniidae |
|
| 93 | 94 | 97 | |
| Sesiidae |
| 62 | 63 | 62 | 64 | |
| Pterophoridae subgroup (4 taxa) |
|
| 68 | 70 | 72 | |
| Choreutidae |
|
| 99 | 100 | 100 | |
| Mimallonidae + |
| 69 | 69 | 70 | 71 | |
| Drepanidae |
| 90 | 91 | 91 | 92 | |
| Gelechioidea subgroup |
| 96 | 98 | 98 | 98 | |
| Gelechioidea subgroup |
| 96 | 98 | 98 | 98 | |
| Gelechioidea subgroup |
| 83 | 87 | 86 | 86 | |
| Gelechioidea subgroup |
| 79 | 82 | 82 | 81 | |
| Cosmopterigidae subgroup (2 taxa) |
| 78 | 79 | 79 | 80 | |
| Pyraloidea |
|
| 86 | 87 | 89 | |
| Pyralidae |
| 98 | 100 | 100 | 100 | |
| 75 | "butterflies" |
|
| 68 | 69 | 72 |
| Geometridae + Uraniidae |
|
| 88 | 90 | 91 | |
| Uraniidae |
|
| 96 | 97 | 98 | |
| Notodontidae + Oenosandridae |
| 77 | 78 | 77 | 78 | |
Bootstrap percentages of all taxonomic groups in Figures 3 and S2 that are at least 5% lower than the value for 15 search replicates are displayed in this table in boldfaced, italicized font (columns 3–7). In no case was the value for 1 search replicate higher than that for 15 by 5% or more. Only bootstrap percentages close to or over 60% at 25 search replicates, and which differ by 5% or more from corresponding values at 1 search replicate, are shown in this table. Node numbers (column 1) refer to correspondingly numbered nodes in Figure 3, while un-numbered taxonomic groups correspond to terminal taxa in that same figure.
Figure 3Summary of three phylogenetic analyses of 483 taxa and 19 genes.
Bootstrap percentages derived from GARLI analysis of three data sets -- nt123_degen1, nt123, and nt123_partition -- are displayed in that order above internal branches of a condensed, higher-level-only portion of the nt123_degen1 ML topology (see numbers in black). Selected nodes are arbitrarily numbered for convenient reference (see numbers in blue). The full nt123_degen1 and nt123 topologies are shown in Figure S1 and S2, respectively. A bracket indicates that the node displayed was not recovered in the ML analysis of that data set. A dash indicates that the bootstrap value is <50%. The number of exemplars is listed in parentheses after the family or subfamily name. The region of the topology that includes Tineoidea has blue-colored branches, and its favored alternative topology, based on analysis of nt123, is also displayed (see lower boxed area). Throughout this report, we have subsumed all tineoid taxa traditionally identified as Acrolophidae under Tineidae, all tineoid taxa traditionally identified as Arrhenophanidae under Psychidae, and Crinopterygidae under Incurvariidae, following van Nieukerken et al. [1]. BP, bootstrap percentage.
A further assessment of the effectiveness of the GARLI heuristic bootstrap search by instituting a huge increase in the number of search replicates performed per individual bootstrap pseudoreplicate in an analysis of 505 483-taxon, 19-gene, nt123_degen1, bootstrapped data sets.*
| Numbers of search replicates /bootstrap pseudoreplicate | ||||
| Node number | Taxonomic group | 15 | 1000 | % points difference |
| Lasiocampidae | 95 | 100 | + 5 | |
|
| 31 | 71 | + 40 | |
| 75 | butterflies | 83 | 88 | + 5 |
| Nymphalidae | 93 | 98 | + 5 | |
| Epermeniidae | 95 | 100 | + 5 | |
|
| 36 | 66 | + 30 | |
|
| 76 | 95 | + 19 | |
|
| 66 | 89 | + 23 | |
| 50 |
| 77 | 88 | + 11 |
| Zygaenidae + Lacturidae | 87 | 93 | + 6 | |
|
| 77 | 89 | + 12 | |
| 16 |
| 40 | 57 | + 17 |
| 13 | Apoditrysia + Yponomeutoidea + Gracillarioidea + Tineidae (no | 64 | 70 | + 6 |
| 12 |
| 68 | 79 | + 11 |
| 11 | Apoditrysia + Yponomeutoidea + Gracillarioidea + Tineidae (no | 87 | 92 | + 5 |
| 'Adelidae − | 92 | 99 | + 7 | |
| Micropterigidae + Agathiphagidae | 70 | 65 | − 5 | |
Node numbers (column 1) refer to correspondingly numbered nodes in Figure 3, while un-numbered taxonomic groups either correspond to terminal taxa in that same figure or to groups not recovered. Numbers in columns 3 and 4 are the resulting bootstrap percentages. Taxonomic groups whose bootstrap percentage increases by >10% are in boldfaced, italicized font (column 2).
Selected bootstrap results based on analysis of taxon-depleted nt123_degen1 data sets.*
| Node number | Taxonomic Group | 483 taxa | 453 taxa, no ACrogue | 436 taxa, no RNRrogue | 434 taxa, no RNRrogue, − Acan, − Neop: | 344 taxa, APODIT | 314 taxa, APODIT, no ACrogue | 133 taxa, MACRO | 129 taxa, MACRO, no ACrogue |
| Bomb + Lasi | 36 | 52 | 25 | 24 | 40 | 54 | x | 38 | |
| Geom:Sema + Drep:Epic | 68 |
|
|
|
| 61 | 66 | 59 | |
| 22 | Bomb + Lasi + Noct + Drep + Geom + Mima + Cime ( = | 39 | 63 [no |
|
| 60 |
| ||
| Mima + | 33 | [no | [no | [no | 44 | [no | 57 | [no | |
| Pyra |
|
|
|
|
|
| |||
| 21 |
| 23 | 38 | 36 | 37 | 41 | 40 | ||
|
| 31 | 27 | 39 | 26 | 34 | 24 | |||
| Gele | 59 |
|
|
| 68 |
| |||
| 19 |
| 14 | 65 | 58 | 57 | 16 | 34 | ||
| 71 | Eper + Copr − | 35 | 69 (no |
|
| 37 | 54 (no | ||
| Tort + Immo | x | 44 | 32 | x | x | x | |||
| 46 | Zyga + Sesi + Coss | x | 25 [no Zyga:Cycl, Zyga: Epip] | x | 3 | 2 [no Zyga:Cycl, Zyga:Epip] | 23 [no Zyga:Cycl, Zyga:Epip] | ||
| Sesi + Coss | x | 67 | x | x | x | 64 | |||
| Schr + Grac:Doug | 37 | [no Grac:Doug] |
|
| 39 | [no Grac:Doug] | |||
| 15 | Ditrysia − (Urod, Ypon, Grac, Tine) ( = | 40 | 56 | 57 | |||||
| 32 | Adel + Ande | 67 | 66 | 68 | |||||
| Acan + Neop | 35 |
| [no Acan or Neop] | ||||||
| Acan + Neop + Erio | x | 45 [no Neop: | [no Acan or Neop] | ||||||
| Eulep + Neop + Acan + Erio | 26 | 33 [no Neop: | 52 [no Acan or Neop] | ||||||
| 27 | Hepi + Mnes + Loph | 58 |
|
| |||||
| 4 | Ditrysia + Tisc + Pala + Adel + Ande + Nept + Acan + Neop + Hepi + Mnes + Loph | 28 | x | x | |||||
| Agat + Microp |
|
|
|
Bootstrap results in PAUP* are those shown under the "le = yes" option. x, Not present in the bootstrap table under le option, so value <50%; "Strong" bootstrap values, i.e., ≥80%, and "moderate" bootstrap values, i.e., 70–79%, are in boldfaced, italicized font (columns 3–10). Node numbers (column 1) refer to correspondingly numbered nodes in Figure 3. Abbreviated taxonomic group names are in column 2 and throughout the table (see below for abbreviations). Columns 3–10 show the bootstrap values based on analysis of eight different nt123_degen1 data sets. Descriptors of these data sets (see labels at top) include number of taxa (129 – 483), whether rogue taxa were excluded, and whether analysis was restricted to a subset of total Lepidoptera (i.e., APODIT, restricted to Apoditrysia; MACRO, restricted to Macroheterocera). ACrogue, Adams-consensus rogue; RNRrogue, RNR rogue; Bomb, Bombycoidea; Lasi, Lasiocampidae; Geom:Sema, Geometroidea:Sematuridae; Drep:Epic, Drepanoidea:Epicopeiidae; Noct, Noctuoidea; Drep, Drepanoidea; Geom, Geometroidea; Mima, Mimallonidae; Cime, Cimeliidae; MACRO, Macroheterocera; Doa, Noctuoidea: Doa sp.; Pyra, Pyraloidea; Hybl, Hyblaeidae; Gele, Gelechioidea; Copr, Copromorphoidea; Eper, Epermeniidae; Thyr, Thyrididae; Call, Callidulidae; "butterflies", Nymphalidae + Lycaenidae + Pieridae + Hedylidae + Hesperiidae + Papilionidae; Pter, Pterophoridae; Aluc, Alucitidae; Gele, Gelechioidea; OBTECT, Obtectomera; Copromorpha, Copromorpha sp.; Tort, Tortricoidea; Immo, Immoidea; Zyga, Zygaenoidea; Sesi, Sesioidea; Coss, Cossoidea; Zyga:Cycl, Zygaenoidea:Cyclotornidae; Zyga:Epip, Zygaenoidea:Epipyropidae; Schr, Schreckensteiniidae; Grac:Doug, Gracillarioidea:Douglasiidae; Ditrysia, Ditrysia (as defined in Figure S1); Urod, Urodidae; Ypon, Yponomeutoidea; Grac, Gracillarioidea; Tine, Tineoidea; APODIT, Apoditrysia; Adel, Adeloidea; Ande, Andesianidae; Acan, Acanthopteroctetidae; Neop, Neopseustidae; Erio, Eriocraniidae; Neop:Apoplania, Neopseustidae:Apoplania sp.; Tisc, Tischeriidae; Pala, Palaephatidae; Eulep: Ditrysia + Palaephatidae + Tischeriidae + Adeloidea + Andesianidae; Erio, Eriocraniidae; Hepi, Hepialidae; Mnes, Mnesarchaeidae; Loph, Lophocoronidae; Nept, Nepticuloidea; Agat, Agathiphagidae; Micr, Micropterigidae.
Selected bootstrap results based on analysis of taxon-depleted nt123 data sets.*
| Node number | Taxonomic Group | 483 taxa | 455 taxa, no ACrogue | 432 taxa, no RNRrogue | 356 taxa, no RNRrogue, − heterog | 344 taxa, APODIT | 316 taxa, APODIT no ACrogue | 133 taxa, MACRO | 129 tx, MACRO, no ACrogue |
| Bomb + Lasi | 63 |
|
|
| 54 |
|
|
| |
| Mima + | 26 | x | x | x | 38 | 62 | x | 65 [no | |
| Geom:Sema + Drep:Epic |
|
|
|
|
|
|
| 68 | |
| 22 | Bomb + Lasi + Noct + Drep + Geom + Mima + Cime ( = | 55 | 36 [no |
|
|
|
| ||
| Mima + |
| [no |
|
|
| [no |
| [no | |
| Pyra |
|
|
|
|
|
| |||
| 21 |
| 37 | 23 |
| 68 | 58 | 47 | ||
| Gele | 52 | 29 | 68 |
| 52 | 54 | |||
| 19 | MACRO + Pyra + Hybl + Copr + Eper+ Thyr + Call + butterflies + Pter + Aluc + Gele ( = OBTECT + Gele) | x | x | x | x | x | x | ||
| Call + Hybl + Thyr | x | 59 |
|
| x | 40 | |||
| Call + Hybl | 24 |
|
| 68 | x | 69 | |||
| 71 | Eper + Copr − | x | 17 (no | x | 21 (no | 14 | 13 (no | ||
| Tort + Immo | 53 |
|
|
| 60 |
| |||
| 46 | Zyga + Sesi + Coss | 2 |
| 32 | 41 | x |
| ||
| 32 | Adel + Ande |
|
|
| |||||
| Acan + Neop | x | [no Neop] | [no Neop] | ||||||
| Acan + Neop + Erio | x |
|
| ||||||
| Eulep + Nept + Acan + Neop + Erio | x |
| 49 [no Neop] | ||||||
| 27 | Hepi + Mnes + Loph |
|
| [no Loph] | |||||
| 4 | Eulep + Nept + Acan + Neop + Hepi + Mnes + Loph | x | x | x | |||||
| Agat + Microp |
|
|
|
Bootstrap results in PAUP* are those shown under the "le = yes" option. Node numbers (column 1) refer to correspondingly numbered nodes in Figure 3. "Strong" bootstrap values, i.e., ≥80%, and "moderate" bootstrap values, i.e., 70–79%, are in boldfaced, italicized font (columns 3–10). See footnote to Table 4 for definitions of abbreviations.
Figure 4Base-composition distance diagrams derived from analysis of the nt123 and nt123_degen1 data sets for 483 taxa.
Branching structure obtained by neighbor-join / minimum evolution analysis of Euclidean distances calculated on the proportions of each of the four nucleotide types in each species. All diagrams are drawn to the same scale, and units are 'per cent ÷ 100'. The blue shaded portions identify taxa deleted from nt 123 data subsets to explore the effect of decreased nucleotide heterogeneity on bootstrap percentages.
Figure 5Summary of phylogenetic analyses based on taxon (sub)sampling of Tineoidea.
Summary phylogenetic trees are displayed with corresponding bootstrap percentages for analysis of nt123 and nt123_degen1 data sets based on different taxon subsamples for Tineoidea. For ease and focus of presentation, only relationships among strongly supported, higher-level groupings are shown (see Figure 3). These groupings are: Tine: Tineidae – Eudarcia (20 taxa total); Psych = Psychidae (9 taxa total); Eudarcia (currently classified within Tineidae, 1 taxon); Compsoctena (currently classified within Eriocottidae; 1 taxon); NTD: non-tineoid Ditrysia (27 taxa total); and the non-ditrysian outgroup (not shown), which consist of Palaephatidae + Tischeriidae (5 taxa total).
Figure 6Base-composition distance diagrams of nt123_degen1 and nt123 data sets for the 63 taxa in the Tineoidea test set.
Both diagrams are drawn to the same scale, and units are 'per cent ÷ 100'. Bootstrap percentages ≥50% are displayed. Bootstrap percentages are based on analysis of total taxon-specific nucleotide compositions, as described in Materials and Methods. All terminal taxa are identified to genus for nt123 but not for nt123_degen1, due to the reduced compositional heterogeneity in the latter data set. The vertical bars identify those taxa used in a phylogenetic analysis (Figure 5) to test the effect of reduced compositional heterogeneity on the analysis of nt123. The five sets of taxa whose inter-relationships are analyzed in Figure 5 are color- and/or symbol-coded (see key).