Literature DB >> 29134101

Aligning evidence: concerns regarding multiple sequence alignments in estimating the phylogeny of the Nudibranchia suborder Doridina.

Joshua M Hallas1,2, Anton Chichvarkhin3,4, Terrence M Gosliner2.   

Abstract

Molecular estimates of phylogenetic relationships rely heavily on multiple sequence alignment construction. There has been little consensus, however, on how to properly address issues pertaining to the alignment of variable regions. Here, we construct alignments from four commonly sequenced molecular markers (16S, 18S, 28S and cytochrome c oxidase subunit I) for the Nudibranchia using three different methodologies: (i) strict mathematical algorithm; (ii) exclusion of variable or divergent regions and (iii) manually curated, and examine how different alignment construction methods can affect phylogenetic signal and phylogenetic estimates for the suborder Doridina. Phylogenetic informativeness (PI) profiles suggest that the molecular markers tested lack the power to resolve relationships at the base of the Doridina, while being more robust at family-level classifications. This supports the lack of consistent resolution between the 19 families within the Doridina across all three alignments. Most of the 19 families were recovered as monophyletic, and instances of non-monophyletic families were consistently recovered between analyses. We conclude that the alignment of variable regions has some effect on phylogenetic estimates of the Doridina, but these effects can vary depending on the size and scope of the phylogenetic query and PI of molecular markers.

Entities:  

Keywords:  Cryptobranchia; Dorid systematics; Phanerobranchia; multiple sequence alignment; sequence homology

Year:  2017        PMID: 29134101      PMCID: PMC5666284          DOI: 10.1098/rsos.171095

Source DB:  PubMed          Journal:  R Soc Open Sci        ISSN: 2054-5703            Impact factor:   2.963


Introduction

The debate regarding multiple sequence alignment (MSA) construction [1-8] and molecular marker selection [9] for use in phylogenetic estimates has been well established in the literature. There is little consensus, however, on the most accurate and replicable approach when considering MSA assemblies [10]. The crux of the matter is how best to align and represent homology in highly variable regions of sequence data. Most approaches employ a combination of a mathematical algorithm and manual curation of variable regions. Unfortunately, in most instances manual curation is loosely defined—which makes subsequent studies difficult to replicate—and even though mathematical algorithms allow for studies to be easily replicated, they do not take into account evolutionary history and consider homology exclusively as similarity [3]. In addition, they often fail to accurately align variable regions [11]. The estimation of evolutionary relationships relies heavily on the ability to determine homology between any given set of characters. The wrongful designation of characters as homologous can confound interpretations of evolutionary relationships by creating homoplasies, which results in the loss of phylogenetic signal [12]. Thus, the determination of homology in either morphological character matrices or MSAs (computational or manually curated) has a direct impact on the accuracy and ability to replicate phylogenetic estimates. This has led to differing positions concerning variable regions and their importance in increasing [13,14] or decreasing [15-17] phylogenetic signal. These highly variable loop regions become increasingly more difficult to align as sequences become more divergent. This confounds the ability to determine accuracy in MSAs and may result in contradictory estimates of deeper evolutionary relationships. It has been proposed that the large sections created by highly variable loop regions in MSAs should be excluded because of the uncertainty in determining reliable estimates of positional homology [18]. Conversely, it has also been suggested that effects from MSAs should be explored in determining phylogenetic estimates due to their likelihood of containing phylogenetic signal [19,20]. Highly divergent sequences and a general lack of diverse molecular markers have made MSAs and resulting phylogenetic estimates for the Nudibranchia problematic [21-26]. For these reasons, we examine how MSA construction of common nDNA and mtDNA markers affect phylogenetic estimates of the diverse Nudibranchia clade Doridina. Dorid nudibranchs have been a robust model for investigating biochemical diversity [27-33], morphological evolution [34-42], colour evolution [43-49], population structure [50-52] and development [53-57]. Even though there have been previous phylogenetic estimates that included dorid nudibranchs (e.g. [35,36]), they did not include representatives of all 19 families within the Doridina, and thus, a comprehensive phylogenetic context for the evolution of these traits is deficient. The Doridina suborder currently consists of 19 families and more than 2000 described species. The classification has been divided into five superfamilies (Bathydoridoidea, Doridoidea, Onchidoridoidea, Phyllidioidea and Polyceroidea) defined by morphological variation in gill and feeding structures [21,36,58,59]. Species that possess the ability to retract their gills into fully formed gill pockets, Eudoridoidea (=Cryptobranchia), are divided into two clades based on the presence or absence of radula, Labiostomata (=Doridoidea) and Porostomata (=Phyllidiodea), respectively [36]. Dorid nudibranchs that lack a fully formed gill pocket represent the Anthobranchia (=Phanerobranchia), which is subdivided into the Suctoria (=Onchidoridoidea) and the Non-Suctoria (=Polyceroidea) based on the presence or absence of a buccal pump. Even though morphological [36,59] and molecular [22,23,60,61] analyses have shown the Doridina to be monophyletic, no study has focused on a complete sampling of all currently recognized families. Thus, relationships between families are poorly understood due to conflicting phylogenies and inadequate sampling. In this study, we use three common MSA methodologies: (i) strict mathematical algorithm; (ii) exclusion of variable or divergent regions and (iii) manually curated. We then discuss issues pertaining to the lack of resolution in phylogenetic estimates and limitations in resolving the base of the Doridina. We also address the possible effects alignment construction may have on phylogenetic informativeness (PI), and how phylogenetic signal may shift given alignment methodology. Lastly, we re-evaluate taxonomic classifications that were consistent between all three analyses.

Material and methods

Taxon sampling

We sampled (121 taxa) representatives from the five superfamilies and all 19 families that currently comprise the Doridina, and mined GenBank for specimens that had at least two sequences. To limit the amount of missing data, we obtained extractions or tissue samples from published specimens to sequence molecular markers that were previously unattained. Pleurobranchaea meckeli and Berthella martensi were chosen as outgroups because the Pleurobranchomorpha has been suggested to be sister to the Nudibranchia [23]. Additional members from the Aeolidina and Dendronotina were also used in testing the monophyly of the Doridina. Specimens, voucher numbers, collection sites and GenBank accession numbers are listed in table 1. Voucher specimens are located in the collections at California Academy of Sciences (CASIZ) and the Museum of National Scientific Center of Marine Biology, Vladivostok (MIMB).
Table 1.

Specimens successfully sequenced and used for molecular analyses. Voucher numbers, GenBank accession numbers and collection localities.

GenBank accession number
specimenvoucher16S18S28SCOIlocality
PLEUROBRANCHOIDEA
Pleurobranchaea meckeli (Blainville, 1825)FJ917439FJ917449FJ917481FJ917499Mediterranean Sea, Spain
Pleurobranchidae
 Berthella martensi (Pilsbry, 1896)MZUCR 6982HM162592MF958319MF958363HM162683Las Secas, Islas sin nombre, Panama
AEOLIDINA
Flabellinidae
 Flabellina pedata (Montagu, 1815)AF249247AJ224788AF249817North Sea, Helgoland
DENDRONOTINA
Arminidae
 Armina loveni (Bergh, 1866)AF249243AF249196AF249781North Sea, Kattegat
DORIDINA
Bathydoridoidea
Bathydorididae
 Prodoris clavigera (Thiele, 1912)CASIZ 167553JX274067MF958320MF958364JX274106Elephant Island, Antarctica
Doridoidea
Actinocyclidae
 Actinocyclus verrucosus Ehrenberg, 1831CASIZ 189448MF958311MF958352MF958397MF958438Kauai, Hawaii
 Hallaxa indecora (Bergh, 1905)CASIZ 179600MF958302MF958340MF958386South Loi Island, Marshall Islands
 Hallaxa translucens Gosliner & S. Johnson, 1994CASIZ 173447EU982814MF958341MF958387EU982760Iles Radama, Madagascar
 Hallaxa iju Gosliner & S. Johnson, 1994CASIZ 175559EU982813EU982759Maui, Hawaii
Cadlinidae
 Aldisa sanguinea (J. G. Cooper, 1863)CASIZ 182031MF958309MF958350MF958394MF958435Marin Co., California
 Aldisa sp.CASIZ 175733EU982818MF958351MF958395MF958436Tiger Reef, Malaysia
 Cadlina cf. luteomarginata (MacFarland, 1905)CASIZ 188599AKJ653679KP340317KP340350KM219678Parksville, Vancouver Island, British Columbia
 Cadlina laevis (Linnaeus, 1767)CASIZ 175444MF958359MF958406Scotland
 Cadlina luarna (Marcus & Marcus, 1967)CASIZ 175437EU982768EU982718Guanacaste, Costa Rica
 Cadlina modesta MacFarland, 1966CASIZ 223286MF958310MF958437Vista del Mar, San Luis Obispo Co., California
 Cadlina pellucida (Risso, 1826)CASIZ 175448EU982774MF958396EU982724Ilha de Pessegueiro, Baixo Alentejo Prov., Portugal
 Cadlina sparsa (Odhner, 1921)CASIZ 182932EU982776EU982726La Jolla, San Diego Co., California
Chromodorididae
 Cadlinella ornatissima (Risbec, 1928)CASIZ 177420MF958284MF958325MF958371MF958415Maricaban Island, Luzon, Philippines
 Cadlinella ornatissima (Risbec, 1928)CASIZ 175452EU982779EU982728Mooloolaba, Queensland, Australia
 Chromodoris alternata (Burn, 1957)SAM D19281AY458800EF534031EF535120Port Phillip Bay, Victoria, Australia
 Chromodoris ambigua Rudman, 1987SAM D19260AY458801EF534038EF535119Port Phillip Bay, Victoria, Australia
 Chromodoris quadricolor (Rüppell & Leuckart, 1830)AF249241AJ224773AF249802Red Sea, Egypt
 Doriprismatica atromarginata (Cuvier, 1804)AF249211AF249789Great Barrier Reef, Australia
  'Felimare' elegans (Cantraine, 1835)AF249238AJ224779AF249787NE Atlantic, Spain
  'Felimare' midatlantica (Gosliner, 1990)CASIZ 175443JQ727789JQ727898Islotes do Martinhal, Algarve, Portugal
  'Felimare' picta verdensis Ortea, Valds & Garca-Gmez, 1996CASIZ 179384HM162594MF958346MF958389HM162685Gulf of Guinea, Ilha do Principe, Sao Tome & Principe
  'Felimare' villafranca (Riaao, 1818)AF249237AJ224780NE Atlantic, Spain
  'Felimida' edmundsi Cervera, Garcia-Gmez & Ortea, 1989CASIZ 179385HM162595MF958347MF958390HM162686Gulf of Guinea, Ilha do Principe, Sao Tome & Principe
  'Felimida' krohni (Verany, 1846)AF249239AJ224774AY427445AF249805NE Atlantic, Spain
 Goniobranchus geometricus (Risbec, 1928)CASIZ 175549JQ727717JQ727842Nosy Valiha, Iles Radama, Madagascar
 Hypselodoris imperialis (Pease, 1860)CASIZ 142952EU982807EU982754Mala Wharf, Maui, Hawaii
 Hypselodoris infucata (Rüppell & Leuckart, 1830)FJ917427FJ917442FJ917467FJ917485NSW, Australia
 Miamira magnifica Eliot, 1904CASIZ 169951EU982781EU982731Old Woman Island, Queensland, Australia
 Thorunna daniellae (Kay and Young, 1969)CASIZ 170055EU982809EU982756Maalaea Marina, Maui, Hawaii
 Tyrinna evelinae (Marcus, 1958)CASIZ 175440EU982811MF958391EU982757Playa Ventana, Guanacaste, Costa Rica
 Tyrinna nobilis Bergh, 1898ZSM M20050508EF534054EF534035EF535127
Dorididae
 Aphelodoris sp. 1CASIZ 176920MF958293MF958332MF958379MF958424Oudekraal, Cape Prov., South Africa
 Aphelodoris sp. 1GQ326866GQ292033Auckland, New Zealand
 Aphelodoris luctuosa (Cheeseman, 1882)GQ326867GQ292042Auckland, New Zealand
 Doris montereyensis (Cooper, 1862)CASIZ 174493MF958294MF958333MF958425Battery Point, Crescent City, Del Norte Co., California
 Doris odhneri MacFarland, 1966CASIZ 188014MF958295MF958334MF958380Duxbury Reef, Marin Co., California
 Doris sp. 8CASIZ 192348MF958306MF958345Red Sea, Saudi Arabia
Discodorididae
 Asteronotus cespitosus (van Hasselt, 1824)CASIZ 191096MF958288MF958328MF958375MF958419Kranket Island, Papua New Guinea
 Atagema cf osseosa (Kelaart, 1859)CASIZ 185142MF958296MF958335MF958426Maui, Hawaii
 Discodoris coerulescens Bergh, 1888CASIZ 182850MF958290MF958330MF958377MF958421Maricaban Island, Luzon, Philippines
 Halgerda dalanghita Fahey & Gosliner, 1999CASIZ 181264MF958289MF958329MF958376MF958420Maricaban Island, Luzon, Philippines
Discodorididae
 Peltodoris nobilis (MacFarland, 1905)CASIZ 182223EU982816EU982761Pillar Point, San Mateo Co., California
 Platydoris sanguinea Bergh, 1905CASIZ 177762MF958285MF958326MF958372MF958416Maricaban Island, Batangaas Prov., Philippines
 Rostanga calumus Rudman & Avern, 1989FJ917427FJ917468FJ917485NSW, Australia
 Sclerodoris tuberculata Eliot, 1904CASIZ 190788MF958286MF958327MF958373MF958417Madang Prov., Papua New Guinea
 Thordisa albomacula Chan & Gosliner, 2006CASIZ 179590MF958287MF958374MF958418Kwajalein Atoll, Marshall Islands
ONCHIDORIDOIDEA
Akiodorididae
 Armodoris anudeorum Valdés, Moran & Woods, 2011LACM 3118KP340290GQ326879KP340355KP340387McMurdo Sound, Ross Sea, Antarctica
Calycidorididae
 Calycidoris guentheri Abraham, 1876CASIZ 190966AKP340301KP340338KP340371KP340397Chukchi Sea, Alaska
 Diaphorodoris lirulatocauda Millen, 1985CASIZ 184341KP340307KP340344KP340377KP340403Duxbury Reef, Marin Co., California
 Diaphorodoris luteocincta (Sars, 1870)LACM 8.7AKP340308KP340378KP340404Bahia de Algeciras, Cadiz Prov., Spain
 Diaphorodoris cf mitsuiiCASIZ 185986KP340310KP340345KP340379KP340406Sepok Point, Philippines
 Diaphorodoris papillata Portmann & Sandmeier, 1960LACM 8.6AKP340311KP340407Bahia de Algeciras, Cadiz Prov., Spain Peninsula
Corambidae
 Corambe obscura (Verrill, 1870)CASIZ 183942KP340303KP340340KP340373KP340399New Castle Portsmouth Bay, New Hampshire
 Corambe pacifica MacFarland & O'Donoghue, 1929LACM 2007-2.6CKP340305KP340342KP340375KP340401Long Beach Marina, Los Angeles Co., California
 Corambe steinbergae (Lance, 1962)CASIZ 190508KP340306KP340402Pillar Point, San Mateo Co., California
Goniodorididae
 Ancula gibbsoa (Risso, 1818)CASIZ 182028KP340291KP340322KP340356KP340388Cumberland Co., Maine
 Ancula gibbsoa (Risso, 1818)CASIZ 181211MF958291MF958422Duxbury Reef, Marin Co., California
 Goniodoris nodosa (Montagu, 1808)AF249226AJ224783AF249788NE Atlantic, Spain
 Okenia kendi Gosliner, 2004CASIZ 186125AMF958297MF958381MF958427Luzon, Philippines
 Trapania reticulata Rudman, 1987CASIZ 191431MF958303MF958342MF958432Tab Island, Papua New Guinea
Onchidorididae
 Acanthodoris atrogriseata O'Donoghue, 1927CASIZ 186000KJ653646KP340323KP340357KM219646Puget Sound, Kitsap Co., Washington
 Acanthodoris hudsoni MacFarland, 1905CASIZ 179480KJ653652KP340324KP340359KM219650Asilomar, Monterey Co., California
 Acanthodoris nanaimoensis O'Donoghue, 1921CASIZ 181569AKJ653656KP340325KP340360KM219657Pillar Point, San Mateo Co., California
 Acanthodoris pilosa (Abildgaard, 1789)CASIZ 183941AKJ653659KP340326KP340361Cobscook Bay, Washington Co., Maine
 Acanthodoris planca Fahey & Valdés, 2005CASIZ 176116KJ653669KP340327KP340362KM219671Table Bay, Western Cape Prov., South Africa
 Acanthodoris rhodoceras Cockerell & Eliot, 1905CASIZ 181572KJ653671KP340328KP340363KM219673Pillar Point, San Mateo Co., California
 Knoutsodonta brasiliensis (Alvim, Padula & Pimenta, 2011)BNHS-Opistho-336KC255225KC255226
 Knoutsodonta depressa (Alder & Hancock, 1842)CASIZ 186769AKP340315KP340347KP340409Huelva, Spain
 Knoutsodonta jannae (Millen, 1987)CASIZ 175578KP340296KP340331KP340366KP340392Pillar Point, San Mateo Co., California
 Knoutsodonta oblonga (Alder & Hancock, 1845)MN 3010AKP340349KP340385KP340410Mewstone, Skomer, United Kingdom
 Onchidoris bilamellata (Linnaeus, 1767)CASIZ 188593KP340314KP340346KP340382Puget Sound, Kitsap Co., Washington
 Onchidoris muricata Muller, 1776CASIZ 181312KJ653676KP340348KP340383KM219680Asilomar, Monterey Co., California
 Onchidoris evincta (Millen, 2006)CASIZ 187758AKP340293KP340329KP340364KP340390Puget Sound, Kitsap Co., Washington
 Onchidoris macropompa Martynov, Korshunova, Sanamyan & Sanamyan, 2009MIMB 34210MF958292MF958331MF958378MF958423Avacha Bay, Kamchatka
 Onchidoris proxima (Alder & Hancock, 1854)CASIZ 183921AKJ653673KP340336KP340369KM219676Passamaquody Bay Eastport, Washington Co., Maine
 Onchidoris slavi Martynov, Korshunova, Sanamyan & Sanamyan, 2009MIMB 34211MF958409MF958446Avacha Bay, Kamchatka
 Onchimira cavifer Martynov, Korshunova, Sanamyan & Sanamyan, 2009MIMB 34209MF958298MF958336MF958382MF958428Avacha Bay, Kamchatka
 Onchimira cavifer Martynov, Korshunova, Sanamyan & Sanamyan, 2009MF958360MF958407MF958445Avacha Bay, Kamchatka
PHYLLIDIOIDEA
Dendrodorididae
 Dendrodoris arborescens (Collingwood, 1881)CMNH-ZM08965AB917459AB917436Oka, Tateyama
 Dendrodoris atromaculata (Alder & Hancock, 1864)CASIZ 181231MF958307MF958348MF958392MF958434Janao Bay, Luzon, Philippines
 Dendrodoris denisoni (Angas, 1864)CASIZ 177702MF958308MF958349MF958393Janao Bay, Luzon, Philippines
 Dendrodoris fumata (Rüppell & Leuckart, 1830)CASIZ 192304MF958358MF958405MF958444Red Sea, Saudi Arabia
 Dendrodoris fumata (Rüppell & Leuckart, 1830)AF249216FJ917470AF249799Great Barrier Reef, Australia
 Dendrodoris guttata (Odhner, 1917)CMNH-ZM 08967AB917461AB917446Okinoshima, Tateyama
 Dendrodoris nigra (Stimpson, 1855)CASIZ 182821MF958318MF958357MF958404MF958443Maricaban Island, Luzon, Philippines
 Dendrodoris nigra (Stimpson, 1855)AF249242AF249215AF249795Great Barrier Reef, Australia
 Doriopsilla albopunctata (J. G. Cooper, 1863)CPIC 00909KR002428KR002480Long Beach, California
 Doriopsilla bertschi Hoover, Lindsay, Goddard & Valdés, 2015CPIC 01058KR002462KR002517Bahía de los Ángeles, Baja California, Mexico
 Doriopsilla davebehrensi Hoover, Lindsay, Goddard & Valdés, 2015LACM 3419KR002476Bahía de los Ángeles, Baja California, Mexico
 Doriopsilla fulva (MacFarland, 1905)CPIC 00933KR002444KR002498Malibu, California
 Doriopsilla gemela Gosliner, Schaefer & Millen, 1999CPIC 00938KR002453KR002506Malibu, California
 Doriopsilla janaina Marcus &. Marcus, 1967CASIZ 173618MF958312MF958353MF958398Galápagos Islands, Ecuador
 Doriopsilla miniata (Alder & Hancock, 1864)CMNH ZM08970AB917464AB917457Yoshio, Katsuura
 Doriopsilla spaldingi Valdés & Behrens, 1998CPIC 00908KR002427KR002479San Pedro, Los Angele Co., California
Mandeliidae
 Mandelia mirocornata Valdés & Gosliner, 1999CASIZ 176266MF958278MF958321MF958365MF958411Oudekraal, Cape Prov., South Africa
Phyllidiidae
 Ceratophyllidia sp.CASIZ 181247MF958281MF958323MF958368MF958413Beatrice, Philippines
 Phyllidia coelestis Bergh, 1905CASIZ 190982MF958279MF958366MF958412Kranket Island, Madang Prov., Papua New Guinea
 Phyllidiella nigra (van Hasselt, 1824)CASIZ 186196AMF958280MF958322MF958367Maricaban Strait, Batangas Prov., Luzon, Philippines
 Phyllidiella pustulosa (Cuvier, 1804)AF249232AF249208Great Barrier Reef, Australia
 Phyllidiopsis annae Brunckhorst, 1993CASIZ 186138MF958283MF958324MF958370Philippines
 Reticulidia halgerda Brunckhorst & Burn in Brunckhorst, 1990CASIZ 186491MF958282MF958369MF958414Maricaban Island, Luzon, Philippines
POLYCEROIDEA
Aegiridae
 Aegires albopunctatus MacFarland, 1905CASIZ 182213MF958313MF958354MF958399MF958439Marin Co., California
 Aegires citrinus Pruvot-Fol, 1930CASIZ 144027MF958314MF958355MF958400MF958440Mooloolaba, Queensland, Australia
 Aegires flores Fahey & Gosliner, 2004CASIZ 191244MF958316MF958402MF958442Papua New Guinea
 Aegires serenae (Gosliner and Behrens, 1997)CASIZ 191285MF958315MF958401MF958441Papua New Guinea
 Aegires villosus Farran, 1905CASIZ 177563MF958317MF958356MF958403Maricaban Island, Luzon, Philippines
Gymnodorididae
 Gymnodoris sp.CASIZ 176781MF958361Pulau Penang, Malaysia
Hexabranchiidae
 Hexabranchus sanguineus (Ruppell & Leuckart, 1828)CASIZ 142942MF958304MF958343Kapalua Bay, Maui, Hawaii
 Hexabranchus sanguineus (Ruppell & Leuckart, 1828)CAZIS 193381MF958305MF958344MF958388MF958433Papua New Guinea
Okadaiidae
 Vayssierea sp.CASIZ 190731MF958362MF958408Sunshine Coast, Kings Beach, Australia
Polyceridae
 Kaloplocamus sp. 1CASIZ 194412MF958299MF958337MF958383MF958429South Madagascar, Madagascar
 Limacia sp. 1CASIZ 176312HM162602KP340320KP340353HM162692False Bay, Western Cape Prov., South Africa
 Limacia sp. 2CASIZ 176276HM162603HM162693Oudekraal, Cape Prov., South Africa
 Nembrotha cristata Bergh, 1877CASIZ 191428MF958301MF958339MF958385MF958431Madang Prov., Papua New Guinea
 Plocamopherus pecoso Valls and Gosliner, 2006CASIZ 191587MF958300MF958338MF958384MF958430Madang Prov., Papua New Guinea
 Polycera quadrilineata (Müller, 1776)AF249229AJ224777North Sea, Kattegat
 Roboastra ricei Pola, Cervera & Gosliner, 2008CASIZ 173900HM162598HM162688Florida, 5 mi offshore of Loran Tower
 Tambja marbellensis Schick & Cervera, 1998CASIZ 180379HM162599HM162689Setubal District, Outao, Portugal
 Triopha catalinae (Cooper, 1863)CASIZ 170648HM162600KP340321KP340354HM162690Yacht Harbor, San Francisco, California
 Triopha maculata MacFarland, 1905CASIZ 181556HM162601HM181556Duxbury Reef, Marin Co., California
Specimens successfully sequenced and used for molecular analyses. Voucher numbers, GenBank accession numbers and collection localities.

Extraction, PCR protocols and sequencing

Genomic DNA was extracted using the Qiagen DNeasy Blood & Tissue extraction kit (Valencia, CA). Amplification and sequencing for targeted gene fragments 16S, 18S, 28S and cytochrome c oxidase subunit I (COI) followed protocols used by Hallas & Gosliner [62]. Amplified fragments were sequenced at the Center for Comparative Genomics located at the California Academy of Sciences and National Scientific Center of Marine Biology.

Multiple sequence alignments

We assembled and edited sequences in Geneious Pro v. 9.1.7 [63] and BioEdit [64]. We aligned rDNA fragments (16S, 18S and 28S) using three different methodologies to examine conflicts regarding estimated phylogenies. For our first method, we used a computer algorithm E-INS-i in MAFFT (MA) [65]. The E-INS-i algorithm is designed to handle sequence data with several conserved regions embedded in long unalignable gapped regions. Our second method excluded all variable regions from the initial MAFFT alignment of rDNA using Gblocks (GA) [11]. We determined blocks using a less stringent selection by allowing for smaller final blocks, gap positions within the final blocks and less strict flank positions. For our third method, we used the initial MAFFT alignment from method one, but manually curated variable regions (CA). This was done by hand to correct for possible inappropriate alignment of sequence regions. For each alignment method, we concatenated all four targeted molecular markers into single MSAs that resulted in three separate concatenated datasets.

Phylogenetic informativeness

We estimated PI for each molecular marker per alignment [9] through the PhyDesign web interface [66], and estimated each PI profile using the site-rates model HyPhy [67]. First, we generated ultrametric trees by converting our concatenated Bayesian inference (BI) phylogenetic estimates in Mesquite [68]. Owing to the lack of a fossil record for the Doridina, our ultrametric trees are not in known time units. However, it has been shown that PI profiles can be used effectively even if divergence time estimates are absent [69]. Then, we analysed alignments in relation to each of their resulting estimated phylogeny to gain a greater understanding of how PI can change based on the alignment approach.

Phylogenetic analyses

We determined evolutionary models using PartitionFinder v.1.1.1 [70], and partitioned our concatenated datasets by rDNA fragment as well as codon position for COI. We also analysed our nDNA and mtDNA fragments separately to investigate possible conflicting evolutionary histories. We did this by aligning both nDNA and mtDNA datasets using the algorithm E-INS-i in MAFFT, and used the same partitioning scheme as in our concatenated datasets. We analysed all our datasets using BI and maximum-likelihood (ML). BI searchers were run using MrBayes v. 3.2.1 [71], and convergence was checked in TRACER v.1.5 [72]. The datasets were run for 5 × 107 generations with Markov chains sampled every 1000 generations. The standard 25% burn-in was calculated and remaining tree estimates were used to create a 50% majority rule consensus tree of calculated posterior probabilities. Posterior probabilities (pp) that exceeded 0.95 were considered strongly supported, and values 0.94 and below were interpreted as having low support. For our ML analyses, we calculated non-parametric bootstrapping (bs) values and the ML tree simultaneously in RAxML v. 7.2.6 [73]. We used the same partitioning scheme as in our BI search, but used the evolution model GTR+I− and executed fast bootstrapping runs for 5 × 104 iterations. Bootstrap values 70 or higher were considered strongly supported, while all other values were evaluated as weakly supported [74].

Results

Molecular data

Sequences obtained for phylogenetic analyses and PI profiles are labelled in table 1, and all alignments have been deposited in TreeBASE (http://purl.org/phylo/treebase/phylows/study/TB2:S20396). As expected, our concatenated alignments (MA, GA and CA) varied in length as well as number of parsimony informative characters (table 2). Surprisingly, there were more parsimony informative characters in the MA than the CA. The GA had far fewer parsimony informative characters than the other two alignments generated. Evolutionary model GTR+I+I− was selected by PartitionFinder v. 1.1.1 for each partition based on the Akaike information criterion. Interestingly, specific species of Dendrodoris (D. nigra, D. arborescens, D. guttata and D. fumata) had mtDNA regions that were highly divergent.
Table 2.

Summary of multiple sequence alignment variation between alignment methodologies for 16S, 18S, 28S and COI.

length (base pairs)
parsimony informative characters
alignment16S18S28SCOI16S18S28SCOI
MAFFT4982578960661315485388370
Gblocks3901546585661236351174370
Curated58427221077661308396324370
Summary of multiple sequence alignment variation between alignment methodologies for 16S, 18S, 28S and COI. The net PI profiles depict the amount of signal through time of each molecular marker in relation to the respective estimated phylogeny (figure 1a–c). Regardless of the alignment, net PI profiles show most phylogenetic signal resides towards the tips of the trees. The amount of overall phylogenetic signal was lowest in our GA estimate, which suggests that information was lost when variable regions were excluded. Lastly, our MA appeared to have slightly more PI than our CA.
Figure 1.

Phylogenetic informativeness profiles estimated by PhyDesign. (a) MAFFT alignment. (b) Gblocks alignment. (c) Curated alignment.

Phylogenetic informativeness profiles estimated by PhyDesign. (a) MAFFT alignment. (b) Gblocks alignment. (c) Curated alignment.

Phylogeny

BI posterior probability estimates generally resulted in higher overall support of relationships than ML non-parametric bootstrap values. Branch lengths for basal nodes were extremely short (figure 2a–c), especially in regions of the tree that resulted in low resolution for relationships between superfamilies (figure 3a–c). There were, however, some general similarities between topologies of both phylogenetic searches and among alignments.
Figure 2.

Phylograms of Doridina phylogenetic estimates from all three different alignment constructions. Topologies represent Bayesian estimates. (a) MAFFT alignment. (b) Gblocks alignment. (c) Curated alignment.

Figure 3.

Cladogram of Doridina phylogenetic estimates. Topology represents Bayesian estimate. Branches are coloured based on superfamily designations, which are pictured adjacent to the phylogenies. Only four of the five families are depicted due to the inability to obtain an image of Bathydoridoidea. Circles represent posterior probabilities (top) and non-parametric bootstrap support values (bottom). Closed circles indicate high Bayesian and ML support (pp ≥ 0.95; bs ≥ 75). Red circles indicate moderate support values (pp: 0.95–0.90; bs: 75–70). Open circles indicate no support (pp < 0.90; bs < 70). Relationships that were not recovered by ML analysis are represented by dashes. (a) MAFFT alignment. (b) Gblocks alignment. (c) Curated alignment.

Phylograms of Doridina phylogenetic estimates from all three different alignment constructions. Topologies represent Bayesian estimates. (a) MAFFT alignment. (b) Gblocks alignment. (c) Curated alignment. Cladogram of Doridina phylogenetic estimates. Topology represents Bayesian estimate. Branches are coloured based on superfamily designations, which are pictured adjacent to the phylogenies. Only four of the five families are depicted due to the inability to obtain an image of Bathydoridoidea. Circles represent posterior probabilities (top) and non-parametric bootstrap support values (bottom). Closed circles indicate high Bayesian and ML support (pp ≥ 0.95; bs ≥ 75). Red circles indicate moderate support values (pp: 0.95–0.90; bs: 75–70). Open circles indicate no support (pp < 0.90; bs < 70). Relationships that were not recovered by ML analysis are represented by dashes. (a) MAFFT alignment. (b) Gblocks alignment. (c) Curated alignment. The Doridina suborder was recovered as monophyletic with high support in all analyses estimates (CA: pp = 1.00, bs = 91; GA: pp = 1.00, bs = 86; MA: pp = 1.00, bs = 86; figures 4–6). GA was the only analysis that recovered the Aegiridae sister to all other members of the Doridina, but there was no support. Both MA and CA recovered Prodoris sister to the rest of the Doridina.
Figure 4.

Phylogenetic estimate from MAFFT Alignment. Topology represents Bayesian estimate, with posterior probabilities (pp) and non-parametric bootstrap (bs) support values depicted above and below each branch, respectively. Relationships that were not recovered by ML analysis are represented by dashes. Branches are coloured based on family designations and represented on the exterior of the phylogeny.

Figure 6.

Phylogenetic estimate from Curated alignment. Topology represents Bayesian estimate, with posterior probabilities (pp) and non-parametric bootstrap (bs) support values depicted above and below each branch, respectively. Relationships that were not recovered by ML analysis are represented by dashes. Branches are coloured based on family designations and represented on the exterior of the phylogeny.

Phylogenetic estimate from MAFFT Alignment. Topology represents Bayesian estimate, with posterior probabilities (pp) and non-parametric bootstrap (bs) support values depicted above and below each branch, respectively. Relationships that were not recovered by ML analysis are represented by dashes. Branches are coloured based on family designations and represented on the exterior of the phylogeny. Phylogenetic estimate from Gblocks alignment. Topology represents Bayesian estimate, with posterior probabilities (pp) and non-parametric bootstrap (bs) support values depicted above and below each branch, respectively. Relationships that were not recovered by ML analysis are represented by dashes. Branches are coloured based on family designations and represented on the exterior of the phylogeny. Phylogenetic estimate from Curated alignment. Topology represents Bayesian estimate, with posterior probabilities (pp) and non-parametric bootstrap (bs) support values depicted above and below each branch, respectively. Relationships that were not recovered by ML analysis are represented by dashes. Branches are coloured based on family designations and represented on the exterior of the phylogeny. Onchidoridoidea and Phyllidioidea were the only two superfamilies recovered as monophyletic in any of the three analyses (figure 3a–c). Onchidoridoidea was recovered as monophyletic only in the CA (pp = 0.99, bs = 46). Phyllidioidea was monophyletic with little to no support in CA (pp = 0.94; bs = 47) and MA (pp = 0.75, bs = 34) analyses. Most families, however, were recovered as monophyletic with high pp and bs support across all three alignments analysed. The Phyllidiidae and Calycidorididae were the only two families that were not supported in all three analyses. In addition, Chromodorididae, Dendrodorididae, Dorididae and Polyceridae were consistently recovered as not monophyletic. The Doriopsilla clade was recovered more closely related (GA: pp = 1.00, bs = 50; MA: pp = 1.00, bs = 65) to a monophyletic Phyllidiidae than to the Dendrodoris clade. This strongly suggests the Dendrodorididae is not monophyletic. Interestingly, Cadlinidae was recovered as sister to the Phyllidioidea in MA (pp = 0.67, bs = 13) and CA (pp = 0.64), but in our GA estimate Cadlinidae was sister (pp = 1.00, bs = 17) also to a clade formed by Phyllidiidae and Doriopsilla. Aphelodoris was recovered nested within Discodorididae and sister to Atagema cf. osseosa (CA: pp = 0.63, bs = 52; GA: pp = 0.98, bs = 73; MA: pp = 0.95, bs = 67). Furthermore, Vayssierea sp. and Gymnodoris sp. were recovered nested in the Polyceridae and sister to Polycera quadrilineata (CA: pp = 1.00, bs = 64; GA: pp = 0.98, bs = 77; MA: pp = 0.53, bs = 86). The Polyceroidea, excluding Aegiridae and Hexabranchidae, was consistently recovered closely related to the Chromodorididae (CA: pp = 0.69; GA: pp = 0.99, bs = 36; MA: pp = 0.97, bs = 53). In addition, all three analyses recovered similar topologies that suggest Cadlinella orniatisma to be sister to Hexabranchus sanguineus. MA is the only phylogenetic estimate that recovered significant support for this relationship (pp = 0.96, bs = 82). Onchimira cavifera was recovered nested in the Onchidorididae and sister (CA: pp = 0.98, bs = 82; GA: pp = 0.77, bs = 51; MA: pp = 0.99, bs = 72) to a clade formed by Knoutsodonta and Onchidoris (CA: pp = 1.00, bs = 86; GA: pp = 1.00, bs = 87; MA: pp = 1.00, bs = 91). There were only two instances where long branches were evident in the Doridina; Vayssierea sp. and a clade formed by Dendrodoris arborescens, D. fumata, D. guttata and D. nigra. Analysis of nDNA (electronic supplementary material, figure S1) and mtDNA data (electronic supplementary material, figure S2) suggests that the long branches may be an artefact of mtDNA data. The nDNA tree depicts no significantly long branches within the Doridina and recovers a monophyletic Dendrodoris clade, which contradicts our mtDNA tree that recovered a polyphyletic Dendrodoris.

Discussion

Resolving the Doridina

Disparity between phylogenetic estimates can confound interpretations and conclusions regarding processes and patterns of lineage diversification. Conflicting estimates are usually consequences of opposing methodologies, which have centred on taxonomic sampling [75-77], molecular markers [12,78], phylogenetic estimates [79-81] and alignment construction [2,3,14,79]. We examined molecular markers, which have been used in varying arrangements, that are most commonly used in nudibranch phylogenetics [21,23,24,49,60-62,82-92]. The large spikes observed in rDNA fragments for our MA and CA are probably a result of highly variable regions or ambiguous sequence calls, which ML is poor at estimating [66], thus overestimating PI towards the tips of their respective tree. Furthermore, PI profiles suggest that information was lost when these regions were excluded from our analyses, and resulted in surprisingly high loss of parsimony informative characters (25%–55%) for rDNA markers. The removal of these variable regions has been shown to negatively affect phylogenetic estimates [9] and may explain why some relationships were not consistently recovered. This supports the position that highly variable loop regions can be vital in resolving some phylogenetic relationships [8]. Unfortunately, we were unable to increase the resolution of the dorid tree by any of our three MSA construction methods. All three PI profiles were fairly consistent in depicting similar curves, and as we have been able to show, these markers are more appropriate for phylogenetic estimates at family- or higher-level classifications. Another issue we encountered was the potential noise that was incorporated into our estimates by additional taxon sampling. Even though PI profiles suggest these markers were informative at family level, our increased taxonomic sampling may have hindered our ability to recover consistency across analyses. For example, we only recovered a monophyletic Onchidoridoidea in our CA phylogenetic estimate. By contrast, Hallas & Gosliner [62] recovered a mostly resolved monophyletic Onchidoridoidea with significant pp and bs support. Their taxonomic sampling, however, was much more focused and included histone 3 as an additional molecular marker. These contradictory estimates illustrate issues that can result from inappropriate taxonomic sampling [93] and noise incorporated into analyses with inclusion of highly divergent taxa. In a few instances, however, our expansive taxonomic sampling has illuminated the relationships of some problematic groups, specifically Aphelodoris, Cadlinidae, Cadlinella, Hexabranchidae and Polyceridae, but in relation to the Onchidoridoidea our estimates contradicted previous highly supported hypotheses. Even though we were able to include Onchimira cavifera, it is relevant to state that we were unable to procure other morphologically unique species that may have affected our ability to resolve some family relationships in the Doridina (e.g. Colga, Goslineria, Hoplodoris, Kalinga, Murphydoris, Otinodoris). These findings illustrate that each phylogenetic query has its own set of challenges and optimal sampling strategy [94], and that the focus for each investigation should be carefully calculated.

Doridina relationships

We were not able to confidently investigate patterns of biogeographical, morphological or chemical evolution due to the lack of resolution at the base of our phylogenetic estimates. The present study, however, offers some consistent new insights into Doridina relationships, in part due to our increased taxonomic sampling. This work reinforces the conclusion from previous studies that traditional phanerobranch and cryptobranch groupings are not monophyletic [36,59,62]. Even though there was only moderate support in our GA, Cadlinidae does appear to be closely related to at least some members of the Porostomata, despite the ambiguous position of some porostomes such as Dendrodoris and Mandelia. In addition, we consistently recovered Gymnodorididae and Okadaiidae nested within the Polyceridae, which together are closely related to the Chromodorididae and Hexabranchidae. Unexpectedly, Cadlinella was recovered sister to the Hexabranchidae in our phylogenetic estimates. Cadlinella was originally included into the Chromodorididae based on morphological similarities [95] and further supported by molecular studies [84,93]. Our broader taxon sampling, however, consistently recovered Cadlinella sister to Hexabranchus and that both of these taxa, together with the Polyceridae, are closely related to the Chromodorididae. This also is supported by the sperm ultrastructure of Cadlinella, which has been shown to be divergent from members of the Chromodorididae [96]. In addition, the yellowish northeastern Pacific species of Doriopsilla were thought to represent a species complex of closely related taxa, but this assumption was not tested by including any species from outside the complex, other than the outgroup taxon D. spauldingi [97]. In our analysis, we included D. miniata from Japan and D. janaina, another eastern Pacific species that has divergent colouration. In all three of our analyses, the ‘species complex’ suggested by Hoover et al. [97] includes members of two separate lineages, rather than a single radiation. This suggests that these species with yellowish colouration and white spots evolved similar colouration convergently rather than by means of radiation from a single common ancestor. Lastly, the evolution of the gill pocket is further confounded by the recovery of Onchimira cavifera nested within the Onchidorididae. Onchimira cavifera was described as having both cryptobranch and phanerobranch characteristics, and hypothesized as a missing link in the current understanding of gill reduction [37,62,98,99]. Onchimira possess all the characteristics of a phanerobranch: buccal pump, rectangular rachidian tooth and hooked shaped first lateral tooth, but also possesses a fully formed gill pocket and retractable gill, which is typical of cryptobranch dorids. Surprisingly, Onchimira is not closely related to the only other two members of the Onchidorididae that possess similar gill structure to the Cryptobranchia, Calycidoris and Diaphorodoris [62], but instead nested within the Onchidorididae. Based on our estimates, it is unclear how or under what conditions the gill pocket might have evolved or was lost throughout the Doridina because of the lack of resolution at the base of the tree.

Molecular evolution

It is unclear why there are such large inconsistencies between mtDNA and nDNA phylogenies regarding Dendrodoris. To confirm if there was sequencing error, we examined additional specimens of D. fumata and D. nigra to compare to those on GenBank [61,100]. Surprisingly, all sequences collected were identical. Our inclusion of D. atromaculata and D. denisoni, however, suggested that there are possible highly divergent regions among mtDNA sequences. We were unable to compare other species of Dendrodoris from Hirose et al. [100] because they only analysed COI. A complete sampling of Dendrodoris is needed to fully comprehend the discrepancies between mtDNA and nDNA sequences. Furthermore, there appears to be no molecular distinction between Aegires citrinus and A. serenae. Both species are clearly defined by morphological characteristics [101], but both the mtDNA and the nDNA suggest they are in fact the same species. Much like in the Dendrodorididae, it is unclear what molecular mechanisms might have influenced our observations. Further investigations are needed, but are beyond the scope of our data.

Conclusion

We decided to take an approach that used three common methods used in MSA construction for Nudibranchia phylogenetics. As expected, our findings suggested that MSA methodology affected phylogenetic estimates of the Doridina, especially regarding how we decided to align highly variable rDNA regions. We were able to show that the most commonly sequenced molecular markers for the Nudibranchia lacked the robustness to resolve the base of the dorid tree, and manipulation of highly variable regions affected our ability to recover consistent phylogenetic estimates. This effect, however, is most probably dependent upon the size and scope of the phylogenetic query and amount of missing data. These markers are better suited for higher-level classifications as suggested by our PI profiles. Even though the base of the Doridina was unresolved, family-level classifications were mostly supported across our three analyses, and families that were recovered as non-monophyletic were consistent between alignments. Our analyses suggest that the exclusion of variable regions may have weakened our ability to resolve the base of the Doridina, but previous studies that used much larger datasets have benefited from removing these regions (e.g. [109,110]). Even though the focus of the present study was to understand MSA construction, our estimates of the Doridina also give a frame of reference for allowing more intensive queries into specific family evolutions. For example, the evolution of caryophyllidia in the Discodorididae, molecular evolution in Aegiridae and Dendrodorididae, or the relationships pertaining to the Polyceridae, Okakaiidae, Gymnodorididae, Hexabranchidae and Chromodorididae, which are some of the most morphologically unique and chemically distinct families. Nudibranch studies unquestionably suffer from a lack of abundant and diverse molecular markers. Studies have argued that increasing molecular markers could resolve problematic relationships [94,102], but an increase in molecular markers does not resolve issues regarding homology and variable region alignments. Automated filtering protocols allow for MSAs to be easily replicated and eliminate the uncertainty of manual curation of alignments; however, these methods are not without error. In addition, there has been little consensus on the soundest method of increasing signal to resolve phylogenies [6]. Genomic tools, which only have recently been used to investigate nudibranch [103] and larger opisthobranch phylogenetics [104,105], have potential of resolving dorid relationships. However, genomic applications also suffer from alignment and homology issues [106,107]. Phylogenetic resolution of the Doridina can greatly benefit from a genomic approach, but it is important to emphasize the critical role MSAs and homology have on phylogenetic studies. Owing to the varying size and scope of molecular and taxonomic sampling, we strongly recommend the exploration of multiple MSA construction methods that can aid in the selection of an approach that best suits the data.
  58 in total

1.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

Review 2.  Potential applications and pitfalls of Bayesian inference of phylogeny.

Authors:  John P Huelsenbeck; Bret Larget; Richard E Miller; Fredrik Ronquist
Journal:  Syst Biol       Date:  2002-10       Impact factor: 15.683

3.  Phylogenetic relationships among Opisthobranchia (Mollusca: Gastropoda) based on mitochondrial cox 1, trnV, and rrnL genes.

Authors:  Cristina Grande; Josè Templado; J Lucas Cervera; Rafael Zardoya
Journal:  Mol Phylogenet Evol       Date:  2004-11       Impact factor: 4.286

4.  Why would phylogeneticists ignore computerized sequence alignment?

Authors:  David A Morrison
Journal:  Syst Biol       Date:  2009-03-25       Impact factor: 15.683

5.  More genes or more taxa? The relative contribution of gene number and taxon number to phylogenetic accuracy.

Authors:  Antonis Rokas; Sean B Carroll
Journal:  Mol Biol Evol       Date:  2005-03-02       Impact factor: 16.240

6.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

7.  A test of color-based taxonomy in nudibranchs: Molecular phylogeny and species delimitation of the Felimida clenchi (Mollusca: Chromodorididae) species complex.

Authors:  Vinicius Padula; Juliana Bahia; Isabella Stöger; Yolanda Camacho-García; Manuel António E Malaquias; Juan Lucas Cervera; Michael Schrödl
Journal:  Mol Phylogenet Evol       Date:  2016-07-18       Impact factor: 4.286

8.  A tale that morphology fails to tell: a molecular phylogeny of Aeolidiidae (Aeolidida, Nudibranchia, Gastropoda).

Authors:  Leila Carmona; Marta Pola; Terrence M Gosliner; Juan Lucas Cervera
Journal:  PLoS One       Date:  2013-05-02       Impact factor: 3.240

9.  PhyDesign: an online application for profiling phylogenetic informativeness.

Authors:  Francesc López-Giráldez; Jeffrey P Townsend
Journal:  BMC Evol Biol       Date:  2011-05-31       Impact factor: 3.260

10.  Current Methods for Automated Filtering of Multiple Sequence Alignments Frequently Worsen Single-Gene Phylogenetic Inference.

Authors:  Ge Tan; Matthieu Muffato; Christian Ledergerber; Javier Herrero; Nick Goldman; Manuel Gil; Christophe Dessimoz
Journal:  Syst Biol       Date:  2015-06-01       Impact factor: 15.683

View more
  2 in total

1.  Toxicity and taste: unequal chemical defences in a mimicry ring.

Authors:  Anne E Winters; Nerida G Wilson; Cedric P van den Berg; Martin J How; John A Endler; N Justin Marshall; Andrew M White; Mary J Garson; Karen L Cheney
Journal:  Proc Biol Sci       Date:  2018-06-13       Impact factor: 5.349

2.  Thirteen New Plastid Genomes from Mixotrophic and Autotrophic Species Provide Insights into Heterotrophy Evolution in Neottieae Orchids.

Authors:  Félix Lallemand; Maria Logacheva; Isabelle Le Clainche; Aurélie Bérard; Ekaterina Zheleznaia; Michał May; Marcin Jakalski; Étienne Delannoy; Marie-Christine Le Paslier; Marc-André Selosse
Journal:  Genome Biol Evol       Date:  2019-09-01       Impact factor: 3.416

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.