Literature DB >> 24453557

DNA barcodes identify Central Asian Colias butterflies (Lepidoptera, Pieridae).

Juha Laiho1, Gunilla Ståhls2.   

Abstract

A majority of the known Colias species (Lepidoptera: Pieridae, Coliadinae) occur in the mountainous regions of Central-Asia, vast areas that are hard to access, rendering the knowledge of many species limited due to the lack of extensive sampling. Two gene regions, the mitochondrial COI 'barcode' region and the nuclear ribosomal protein RpS2 gene region were used for exploring the utility of these DNA markers for species identification. A comprehensive sampling of COI barcodes for Central Asian Colias butterflies showed that the barcodes facilitated identification of most of the included species. Phylogenetic reconstruction based on parsimony and Neighbour-Joining recovered most species as monophyletic entities. For the RpS2 gene region species-specific sequences were registered for some of the included Colias spp. Nevertheless, this gene region was not deemed useful as additional molecular 'barcode'. A parsimony analysis of the combined COI and RpS2 data did not support the current subgeneric classification based on morphological characteristics.

Entities:  

Keywords:  Barcoding; COI; Central-Asia; Colias; RpS2

Year:  2013        PMID: 24453557      PMCID: PMC3890677          DOI: 10.3897/zookeys.365.5879

Source DB:  PubMed          Journal:  Zookeys        ISSN: 1313-2970            Impact factor:   1.546


Introduction

The use of a standardized gene region, i.e. a 650 bp fragment of the 5’-region of the mitochondrial cytochrome c oxidase subunit I (hereafter COI), as a DNA barcode (Hebert et al. 2003), to facilitate identification of biological specimens, as well as for calling attention to possible new species has generated a steadily increasing number of DNA barcoding studies of invertebrates (Taylor and Harris 2012), and particularly of Lepidoptera (see www.lepbarcoding.org). While the utility of DNA barcoding as an investigative tool has gained much support, there still remain a number of problems related to the use of a single DNA sequence as a taxon barcode. Several studies on Lepidoptera have shown that species may be polymorphic and/or share haplotypes (Nice et al. 2002, Wahlberg et al. 2003, Elias et al. 2007, Schmidt and Sperling 2008), so that identifications may become less reliable. Additionally, it has been shown that incomplete lineage sorting or mitochondrial introgression could obscure the delimitation of closely related taxa (Tautz et al. 2003, Zakharov et al. 2009). Using one or a few specimens as representatives of a species indeed provides us with little information about their intraspecific variation, particularly for widely distributed species (e.g. Funk and Omland 2003, Seberg et al. 2003, Sperling 2003).

The genus

The butterfly genus Fabricius, 1807 is a genus of the family Pieridae (subfamily Coliadinae), comprising about 85 species. Most of its species have a limited distribution in the Arctic and Alpine regions of the Holarctic realm, but two species occur in the Afrotropical and seven are known from the Neotropical regions (Verhulst 2000). A few species are widely distributed and common, such as the Palaearctic (Esper, 1805) and (Geoffroy, 1785), and the Nearctic Boisduval, 1852 and Godart, 1819. As a consequence, these taxa are frequently used in ethological, ecological and genetic research (e.g. Pollock et al. 1998, Wang and Porter 2004, Porter and Levin 2010). and are a species pair where only typical specimens can be reliably distinguished morphologically, and members of these species are known to frequently hybridize (e.g. Dinca et al. 2011 and references therein). Lukhtanov et al. (2009) indicated that mitochondrial introgression was a likely explanation for the shared barcodes they registered between these sympatric taxa. The Nearctic taxa and are broadly sympatric sister species that hybridize frequently and that likely share a significant portion of their genomes through introgression (e.g. Wang and Porter 2004, Porter and Levin 2010). Verhulst (2000) illustrated hybrid individuals of six species of from the Palaearctic region, including . The Central Asian mountainous regions harbour nearly half of all PageBreak species. The distribution, ecology and taxonomy are still incompletely documented for most of these species, mainly due to their remote occurrences (Verhulst 2000). Central Asian species occurring in remote mountainous areas that are hard to access have been far less studied than their North American or European congeners. An important part of the older material that exists in museum collections worldwide (e.g. from Tibet) originates from early collecting expeditions in the late 19th and early 20th centuries. Important material was, however, also collected within the former Soviet Union during 20th century. Fieldwork in Central-Asia has subsequently become less complicated, and thus new material is again available for research. As a result of this, new species such as Verhulst, 1990 and Verhulst, 1991, have been described, as well as a number of new subspecies. Despite an increasing research effort on Central Asian species there are as yet no published studies on their phylogenetic relationships. The first contribution to the species classification of was given by Berger (1986), who used a few morphological characters to establish a comprehensive subgeneric classification, comprising the subgenera Fabricius, 1807, Berger, 1986, Berger, 1986, Watson, 1895, Berger, 1986, Berger, 1986, Butler, 1869 and Berger, 1986. Later, Ferris (1993) used 84, mainly morphological, characters to reconstruct a phylogeny of all North American species known at that time, which was the first species phylogeny within the genus . The first contribution to the knowledge of the molecular phylogenetic relationships of the North American species was made by Pollock et al. (1998), who studied a number of species using a 333 bp sequence fragment of the mtDNA COI gene. They found some small differences between species classified in the subgenera and , thus supporting Berger’s (1986) separation of from . Pollock et al. (1998) also noted that even though is a speciose genus, this was not mirrored in the COI sequence diversity. Wheat and Watt (2008) studied the molecular phylogenetic relationships of North American taxa using mitochondrial gene sequences (ribosomal 12S and 16S rRNA, Leu2 and Val tRNA and COI + II). Their results showed that the COI sequences only allowed identification of some of the taxa supported by the full data set used in their study. The results of their study further suggested that species radiations within are comparatively young as compared with those of related pierid butterflies, since molecular divergences among species were small. Based on molecular data Brunton (1998) studied the phylogenetic relationships of the 12 species occurring in Europe. He recovered three monophyletic groups largely corresponding to geographical distributions. He concluded that the Scandinavian species appeared to be the oldest in Europe, sharing a common ancestor with species from the USA. According to Brunton (1998) the European species radiated from Scandinavia to the rest of Europe forming an eastern clade and a western clade. As with Pollock et al. (1998), the results did not agree with Berger’s (1986) subgeneric classification. The aim of the present study was to test the usefulness of COI barcodes for species identification of a broad representation of Central Asian PageBreak species, including nine species overlapping with Lukhtanov et al.’s (2009) study, and 19 species not previously barcoded. In addition, we wanted to elucidate the informativeness of the RpS2 gene region that Wahlberg and Wheat (2008) found informative for lepidopteran phylogenetic relationships. We tested the nuclear ribosomal protein gene RpS2 as a potential complementary barcode region for and for use in a combined analysis with COI for testing the current subgeneric classification of the species in the present study. We also contrasted our COI barcodes against a larger set of COI barcodes of taxa available from GenBank (GB).

Materials and methods

Study area and taxon sampling

This study includes material from the mountain regions of Kirgizistan, Tadzhikistan, northern Afghanistan, northern Pakistan and India (e.g. mountain ranges Tian Shan, Hindu Kush, Karakorum, Himalaya) and the mountain regions in the Chinese provinces Qinghai, Gansu, Sichuan, Yunnan and the autonomous regions Tibet and Xinjiang Uygur. The fauna of these Central Asian regions comprises about 34 species (Verhulst 2000) while the species number for Central Asia in broad sense is over 40 species. The taxon sampling aimed to cover as many of the species from this area as possible. Additionally, a few species occurring in adjacent territories (e.g. Buryatia) were also available for molecular study. Whenever possible, several individuals of each species were analysed to assess intraspecific variation. The available specimens used for molecular study consisted of a total of 56 adult specimens covering 27 species of Central Asian and two species from adjacent territories (Table 1). The specimens are preserved as DNA voucher specimens and labelled accordingly, to be deposited in the collections of the Zoological Museum of Finnish Museum of Natural History, Helsinki, Finland (MZH) (DNA voucher specimens MZH_JL1-JL71). Species identifications were verified by JL based on easily recognizable diagnostic characters using the monograph by Verhulst (2000), while the taxonomy is according to Grieshuber and Lamas (2007). Additionally, we used 35 COI barcode sequences (17 species) of Palaearctic species obtained from GB, as listed in Table 2.
Table 1.

List of specimens used for molecular analyses including GenBank accession numbers.

SpeciesSexLocality and dateLab codeCOI accession numberRpS2 accession number
subgenus Colias Fabricius, 1807
Colias hyale (Linnaeus, 1758) irkutskana Stauder, 1923maleRussia, SW Transbaikalia, Buryatia, Selenga river district, Gusinoye Ozero village env., steppe rivulet valley, 7.6.2003MZH_JL35HE775142HE775198
Colias hyale (Linnaeus, 1758) irkutskana Stauder, 1923maleRussia, SW Transbaikalia, Buryatia, Selenga river district, Gusinoye Ozero village env., steppe rivulet valley, 7.6.2003MZH_JL44HE775143HE775199
subgenus Eriocolias Berger, 1986
Colias adelaidae adelaidae Verhulst, 1991maleChina, Gansu, Xia-He, 3400 m, 35°11'N, 102°31'E, 25.6.2004MZH_JL61HE775187HE775243
Colias alpherakii alpherakii Staudinger, 1882femaleKyrgyzstan, Alai mts., 4 km SE Tengizbai pass, 3400 m, 3.7.2001MZH_JL37HE775169HE775225
Colias alpherakii alpherakii Staudinger, 1882femaleKyrgyzstan, Alai mts., 4 km SE Tengizbai pass, 3400 m, 3.7.2001MZH_JL51HE775180HE775236
Colias berylla berylla Fawcett, 1904maleChina, S Tibet, Himalaya Mts., Lablungla pass, 4800 m, 18–22.7.2001MZH_JL48HE775178HE775234
Colias berylla berylla Fawcett, 1904maleChina, Tibet, Lhodak, 4600 m, 15.7.2002MZH_JL55HE775182HE775238
Colias christophi christophi Grum Grshimailo, 1885femaleTadjikistan, Turkestanskyi Mts., Kumbel pass, 3000 m, July 2002MZH_JL45HE775175HE775231
Colias christophi helialaica Schulte, 1988maleKyrgyzstan, Alai Mts., W end of Tengizbai pass, 3700 m, 5–6.7.2001MZH_JL67HE775192HE775246
Colias cocandica cocandica Erschoff, 1874maleKyrgyzstan, Suusamyr Mt. r., Alabel pass, 3200 m, 10.7.2002MZH_JL43HE775174HE775230
Colias cocandica hinducucica Verity, 1911maleTajikistan, E Pamir, Ak-Buura Mts., 4250 m, 14–15.7.2003MZH_JL34HE775168HE775224
Colias cocandica pljushtchi Verhulst, 2000maleKyrgyzstan, Sary Dzhaz riv. bas., Kaindy-Ketta mts., Tashkoro village, 3000 m 10.7.2003MZH_JL19HE775160HE775216
Colias eogene C. et R. Felder, [1865] elissa Grum Grshimailo, 1890maleKyrgyzstan, W end of Tengizbai pass, 3700 m, 5–6.7.2001MZH_JL1HE775144HE775200
Colias eogene C. et R. Felder, [1865] elissa Grum Grshimailo, 1890maleKyrgyzstan, W end of Tengizbai pass, 3700 m, 5–6.7.2001MZH_JL40HE775171HE775227
Colias fieldii Ménétriés, 1855 chinensis Verity, 1909maleChina, Sichuan, Zhangia, 3000 m, 32°47'N, 103°36'E, 6.6.2002MZH_JL50HE775179HE775235
Colias fieldii Ménétriés, 1855 chinensis Verity, 1909femaleChina, Gansu, Shin-Long-Shan, 2800 m, 35°48'N, 103°59'E, 29.6.2004MZH_JL60HE775186HE775242
Colias grumi grumi Alphéraky, 1897femaleChina, Gansu, Altun Shan, road from Aksay to Danjing pass, 2500–2800 m, 22–23.7.2002MZH_JL54HE775197-
Colias heos heos (Herbst, 1792)maleRussia, SW Transbaikalia, Buryatia, Selenga river district, Gusinoye Ozero village env., steppe rivulet valley, 1.7.2003MZH_JL39HE775170HE775226
Colias heos heos (Herbst, 1792)maleRussia, SW Transbaikalia, Buryatia, Selenga river district, Gusinoye Ozero village env., steppe rivulet valley, 1.7.2003MZH_JL46HE775176HE775232
Colias lada lada Grum Grshimailo, 1891maleChina, Sichuan, Maningano surr., 31°56'N, 99°12'E, 4500 m, 15.6.2002MZH_JL7HE775150HE775206
Colias lada lada Grum Grshimailo, 1891maleChina, Sichuan, Maningano surr., 31°56'N, 99°12'E, 4500 m, 15.6.2002MZH_JL27HE775165HE775221
Colias ladakensis Felder, 1865 seitzi Bollow, 1939maleChina, SW Tibet, Himalaya Mts., 100km W Paryang, 4650–5000 m, 13.6.2004MZH_JL4HE775147HE775203
Colias ladakensis Felder, 1865 seitzi Bollow, 1939maleChina, SW Tibet, Himalaya Mts., 100km W Paryang, 4650–5000 m, 13.6.2004MZH_JL57HE775183HE775239
Colias marcopolo marcopolo Grum Grshimailo, 1888maleTadjikistan, E Pamir, Dunkeldyk Lake, 4400 m, 25.7.2003MZH_JL30HE775166HE775222
Colias marcopolo marcopolo Grum Grshimailo, 1888maleTadjikistan, E Pamir, Dunkeldyk Lake, 4400 m, 25.7.2003MZH_JL33HE775167HE775223
Colias marcopolo marcopolo Grum Grshimailo, 1888maleTadjikistan, E Pamir, Dunkeldyk Lake, 4400 m, 25.7.2003MZH_JL41HE775172HE775228
Colias montium montium Oberthür, 1886maleChina, Sichuan, Maningano surr., 31°55'N, 99°12'E, 4000 m, 9–18.6.2004MZH_JL59HE775185HE775241
Colias nebulosa Oberthür, 1894 sungpani Bang-Haas, 1927maleChina, Sichuan, Maningano surr., 31°56'N, 99°12'E, 4500 m, 15.6.2002MZH_JL9HE775152HE775208
Colias nebulosa Oberthür, 1894 sungpani Bang-Haas, 1927maleChina, Sichuan, Maningano surr., 31°56'N, 99°12'E, 4500 m, 15.6.2002MZH_JL24HE775162HE775218
Colias nebulosa Oberthür, 1894 sungpani Bang-Haas, 1927maleChina, Sichuan, Maningano surr., 31°56'N, 99°12'E, 4500 m, 15.6.2002MZH_JL26HE775164HE775220
Colias nina Fawcett, 1904 hingstoni Riley, 1923maleChina, SW Tibet, Himalaya Mts., 60 km S Saga, 4600–5000 m, 7–8.6.2004MZH_JL53HE775181HE775237
Colias nina Fawcett, 1904 hingstoni Riley, 1923maleChina, SW Tibet, Himalaya Mts., Lablongla pass, 4800 m, 5.6.2004MZH_JL58HE775184HE775240
Colias regia regia Grum Grshimailo, 1887maleKyrgyzstan, Kaindy-Ketta Mt. r., Kumar pass, 3200 m, 12.7.2003MZH_JL8HE775151HE775207
Colias regia regia Grum Grshimailo, 1887maleKyrgyzstan, Kaindy-Ketta Mt. r., Kumar pass, 3200 m, 12.7.2003MZH_JL42HE775173HE775229
Colias romanovi romanovi Grum Grshimailo, 1885maleKyrgyzstan, Alai mts., 4 km SE Tengizbai pass, 3400 m, 7–8.7.2001MZH_JL3HE775146HE775202
Colias romanovi romanovi Grum Grshimailo, 1885maleKyrgyzstan, Alai mts., 4 km SE Tengizbai pass, 3400 m, 7–8.7.2001MZH_JL47HE775177HE775233
Colias sieversi sieversi Grum Grshimailo, 1887maleTadjikistan, Peter I Mts., Ganishob, 2400 m, 17.6.2004MZH_JL70HE775195-
Colias sifanica sifanica Grum Grshimailo, 1891maleChina, Gansu, Xia-He, 3400 m, 35°11'N, 102°31'E, 25.6.2004MZH_JL11HE775154HE775210
Colias sifanica sifanica Grum Grshimailo, 1891maleChina, Gansu, Xia-He, 3400 m, 35°11'N, 102°31'E, 25.6.2004MZH_JL64HE775189HE775245
Colias staudingeri Alphéraky, 1881 pamira Grum Grshimailo, 1890maleKyrgyzstan, Zaalaisky (Transalai) Mts., Altyn Dara river, 3000 m, 25.7.2000MZH_JL2HE775145HE775201
Colias staudingeri Alphéraky, 1881 pamira Grum Grshimailo, 1890maleKyrgyzstan, Zaalaisky (Transalai) Mts., Altyn Dara river, 3000 m, 25.7.2000MZH_JL13HE775156HE775212
Colias staudingeri Alphéraky, 1881 pamira Grum Grshimailo, 1890maleKyrgyzstan, Zaalaisky (Transalai) Mts., Altyn Dara river, 3000 m, 25.7.2000MZH_JL23HE775161HE775217
Colias stoliczkana stoliczkana Moore, 1882maleIndia, Jammu Kashmir, Ladakh Range, Markha Valley, Ganda Pass, 4600 m, 12.7.2001MZH_JL15HE775158HE775214
Colias thisoa Ménétriés, 1832 aeolides Grum Grshimailo, 1890maleKyrgyzstan, Sary Dzhaz riv. bas., Kaindy-Ketta mts., Tashkoro village, 3000 m, 10.7.2003MZH_JL10HE775153HE775209
Colias thisoa Ménétriés, 1832 aeolides Grum Grshimailo, 1890femaleKyrgyzstan, Sary Dzhaz riv. bas., Kaindy-Ketta mts., Tashkoro village, 3000 m, 10.7.2003MZH_JL17HE775159HE775215
Colias thisoa Ménétriés, 1832 aeolides Grum Grshimailo, 1890femaleKyrgyzstan, Sary Dzhaz riv. bas., Kaindy-Ketta mts., Tashkoro village, 3000 m, 10.7.2003MZH_JL25HE775163HE775219
Colias thrasibulus thrasibulus Fruhstorfer, 1910maleChina, W Tibet, Mandhata Mt., 4900 m, 15–16.7.2003MZH_JL14HE775157HE775213
Colias tibetana tibetana Riley, 1922maleChina, Tibet, Himalaya Mts., Nyalam, 4200 m, 8.7.2003MZH_JL6HE775149HE775205
Colias tibetana tibetana Riley, 1922maleChina, SW Tibet, Himalaya Mts., Nyalam, 3700–4200 m, 28–30.6.2004MZH_JL63HE775188HE775244
Colias wanda wanda Grum Grshimailo, 1907maleChina, Qinghai, 20km NW of Zhidoi City, 4700–5000 m, 16.7.2000MZH_JL66HE775191-
Colias wanda wanda Grum Grshimailo, 1907maleChina, S. Tibet, Cona, 4500–4700 m, 24–25.6.2004MZH_JL69HE775194-
Colias wiskotti Staudinger, 1882 draconis Grum Grshimailo, 1891maleUzbekistan, Chandalas Mts., Chakmksh village, 2600 m, 27.6.2004MZH_JL71HE775196-
Colias wiskotti Staudinger, 1882 hofmannorum Eckweiler, 2000maleIran, Khorasan, 75km SE of Birjand, 2200 m, 18–20.5.2002MZH_JL68HE775193-
Colias wiskotti Staudinger, 1882 separata Grum Grshimailo, 1888maleKyrgyzstan, Alai mts., 4km SE Tengizbai pass, 3400 m, 3.7.2001MZH_JL65HE775190-
subgenus Eucolias Berger, 1986
Colias tyche tyche (de Boeber, 1812)maleRussia, East Siberia, Lake Baikal, Khamar-Daban Mts., Slyudyanka river, taiga, 800 m, 14.6.2003MZH_JL5HE775148HE775204
Colias tyche tyche (de Boeber, 1812)maleRussia, East Sayan, Buryatia, Mondy env., Huruma river, 1500 m, 6.6.2002MZH_JL12HE775155HE775211
Table 2.

List of GenBank samples of the COI barcode used in this study.

SpeciesGenBank accession number
Colias alpherakiiFJ663407
Colias christophiFJ663409
Colias chrysotheme elenaFJ663410
Colias chrysotheme elenaFJ663411
Colias croceusEF457737
Colias croceusFJ663412
Colias croceusGU688507
Colias croceusHQ004279
Colias croceusHQ004282
Colias eogeneFJ663415
Colias eogeneFJ663416
Colias erate amdensisEF457736
Colias erate poliographusEF457735
Colias erate poliographusEU583852
Colias erate poliographusGU372561
Colias fieldiiEF584859
Colias hyaleFJ663418
Colias hyaleFJ663421
Colias hyaleHQ004297
Colias hyperboreaEF457739
Colias marcopoloFJ663422
Colias marcopoloFJ663423
Colias myrmidoneHQ004303
Colias phicomoneHM393178
Colias regiaFJ663427
Colias tamerlana mongolaFJ663424
Colias tamerlana mongolaFJ663425
Colias tamerlana mongolaFJ663426
Colias thisoa thisoaFJ663429
Colias tycheFJ663430
Colias wiskotti chrysopteraFJ663431
Colias wiskotti chrysopteraFJ663432
Colias wiskotti chrysopteraFJ663433
Colias wiskotti wiskottiFJ663435
Colias wiskotti wiskottiFJ663436
List of specimens used for molecular analyses including GenBank accession numbers. List of GenBank samples of the COI barcode used in this study.

Laboratory methods

Total genomic DNA was extracted form 2-5 legs of dried, pinned butterfly specimens using NucleoSpin® Tissue Kit (Machery-Nagel), according to manufacturer’s protocols, and resuspended in 50 µl ultrapure water. The primer pair LCO-1490 (5’-GGTCAACAAATCATAAAGATATTGG-3’) and HCO-2198 (5’-TAAACTTCAGGGTGACCAAAAAATCA-3’) (Folmer et al. 1994) was used to amplify a ca. 650 bp fragment of the mitochondrial COI gene. The polymerase chain reactions (PCR) were done under the following parameters: initial heating 95 °C for 2 min, following 30 cycles of 94 °C for 30 s, 49 °C for 30 s and 72 °C for 2 min, followed by a final extension of 72 °C for 7 min. The primer pair RpS2 PageBreakPageBreakPageBreakPageBreakPageBreaknF (5’-ATCWCGYGGTGGYGATAGAG-3’) and RpS2 nR (5’-ATGRGGCTTKCCRATCTTGT-3’) (Wahlberg and Wheat 2008) was used to amplify a ca. 400 bp fragment of the nuclear RpS2 gene. The PCR were carried out following the PCR cycling profile described in Wahlberg and Wheat (2008): initial heating 95 °C for 7 min, 40 cycles of 95 °C for 30 s, 50 °C for 30 s, 72 °C for 2 min, and a final extension period of 72 °C for 10 min. Sequencing of the double-stranded PCR product was carried out on an ABI PRISM® 377 Automated Sequencer (Applied Biosystems) following manufacturer’s recommendations. All PCR primers were used for sequencing. Sequences were inspected and edited using Sequence Navigator® (Applied Biosystems).

Sequence analysis

We analysed and clustered our sequence data using parsimony and Neighbour-Joining (NJ) of K2P-distances. We used parsimony and NJ for our newly generated COI sequence dataset, NJ for RpS2 sequences, parsimony for the concatenated COI and RpS2 sequences, and, finally, NJ for the combined COI sequences generated in this study and those in GB. All trees were rooted using (family Papilionidae) and (Pieridae, subfamily Pierinae) as outgroup taxa. Parsimony analysis was performed using NONA (Goloboff 1999) and spawn with the aid of Winclada (Nixon 2002), using a heuristic search algorithm with 1000 random addition replicates (mult*1000), holding 10 trees per round (hold/10), max trees set to 10 000 and applying TBR branch swapping. All base positions were treated as equally weighted characters. Nodal support was assessed with bootstrap resampling (1000 replicates) using Winclada (Nixon 2002). MEGA5 (Tamura et al. 2011) was used for NJ clustering using 1000 bootstrap replicates. The Kimura 2-parameter model was used for NJ clustering of the COI sequences, while the Tamura-Nei model with gamma distributed rates was chosen for the RpS2 sequences.

Results

Sequences

We obtained a 643 bp COI barcode for 56 specimens, and a 409 bp fragment of RpS2 was obtained for 49 specimens (Table 1). A+T content of the COI sequences was 69.22%, and of the RpS2 45.0%. There were 115 parsimony informative sites for COI and 39 for RpS2. Uncorrected pairwise divergences between ingroup taxa ranged between 1.09 and 4.09% (mean 2.77%) for COI and 0.0–1.7% (mean 1.0%) for RpS2. GenBank accession numbers are given in Table 1. Intraspecific uncorrected distances were up to 1.09% (in ) for COI, with specimens of most species differing by less than 4 nucleotide changes.

Identification: COI vs. RpS2

The parsimony analysis of the new COI sequences yielded four equally parsimonious trees (CI = 0.59, RI = 0.75) the strict consensus tree of which is presented in Figure 1. The NJ tree is presented in Figure 2.
Figure 1.

Strict consensus cladogram of COI sequences obtained in this study.

Figure 2.

Neighbour-Joining tree using the K2P-model for the COI sequences obtained in this study.

The majority of the species could be identified with COI alone, as no COI haplotypes were shared between species. Both parsimony and NJ trees recovered 25 (out of 28) species as monophyletic groups (Figures 1–2). Neither , nor formed monophyletic entities, as their sequences were scattered over various parts of the trees. The two samples of were not recovered as sister taxa, for sample MZH_JL5 appeared as sister taxon of . The overall topologies of the parsimony and NJ trees were identical, except for the placement of . Parsimony placed the taxon as sister to a clade of five taxa (Figure 1), while NJ placed it as sister to (Figure 2). The external morphology of is rather different from that of , while some similarities can be found between and , , and (Figure 1). Only 17 of the 39 parsimony informative sites of RpS2 were variable among the 49 ingroup members. NJ only recovered few species as separate lineages due to the shallow divergences (Figure 3). The information content of this gene region is best interpreted as a character-based diagnostic table, as suggested by DeSalle et al. (2005). This gene region yielded species specific (diagnostic) haplotypes for 11 species out of 33 (Table 3).
Figure 3.

Neighbour-Joining tree using the Tamura-Nei model with gamma distributed rates for the RpS2 sequences.

Table 3.

Species haplotypes for 17 variable positions of RpS2 for Central Asian species (RpS2 data matrix positions no 14, 152, 170, 176, 189, 191, 194, 195, 218, 284, 287, 302, 341, 353, 356, 365, 380).

Haplotypepositions of RpS2
MZH_JL35_hyaleTCCCCGGGTCCATTTTC
MZH_JL44_hyaleTCCCCGGGTCCATTTTC
MZH_JL02_staudingeriTCCTCGAGTTCAAATCC
MZH_JL13_staudingeriTCCTCGAGTTCAAATCC
MZH_JL23_staudingeriTCCTCGAGTTCAAATCC
MZH_JL43_cocandica_cocandicaTCCCCGAGTTCAAATCC
MZH_JL41_marcopoloTACCCGAGTTCAAAACC
MZH_JL30_marcopoloTACCCGAGTTCAAAACC
MZH_JL07_ladaTCCCAAAAGTCGATTCC
MZH_JL27_ladaTCCCAAAAGTCGATTCC
MZH_JL25_thisoaTCCCAAAAGTCGATTCC
MZH_JL10_thisoaTCCCAAAAGTCGATTCC
MZH_JL17_thisoaTCCCAAAAGTCGATTCC
MZH_JL05_tycheTCCCAAAAGTCGATTCC
MZH_JL12_tycheTCCCAAAAGTCGTTTCC
MZH_JL39_heosTCCCAAAAGTCGATTCC
MZH_JL46_heosTCCCAAAAGTCGATTCC
MZH_JL53_ninaTCCCAAAAGTCGATTCC
MZH_JL58_ninaCCCCCGAAGTCGATTCC
MZH_JL11_sifanicaTCCCCGAGGTCGWTTCC
MZH_JL64_sifanicaTCTCCGAGGTCGATTCC
MZH_JL57_ladakensisTCCCCGAGGTCGATTCC
MZH_JL06_tibetanaTCCTCGAGGTTATTTCC
MZH_JL09_nebulosaTCCTCGAGGTTATTTCC
MZH_JL26_nebulosaTCCTCGAGGTTATTTCC
MZH_JL14_thrasibulusTCCTCGAGGTTATTTCC
MZH_JL01_eogeneTCCTCGAGGTTATTTCT
MZH_JL04_ladakensisTCTCCGAGGTTATTTCC
MZH_JL15_stoliczkanaTCTCCGAGGTTGTTTCT
MZH_JL19_cocandica_pljushtchiTCCTCGAGTTCATTTCC
MZH_JL34_cocandica_hinducuciaTCCTCGAGTTCATTTCC
MZH_JL03_romanoviTCCTCGAGTTCATTTCC
MZH_JL08_regiaTCCCCGAGTTCATTTCT
MZH_JL42_regiaTCCCCGAGTTCATTTCT
MZH_JL47_romanoviCCCTCGAGTTCATTTCC
MZH_JL51_alpherakiiTCCCCGAGTTCATTTCC
MZH_JL37_alpherakiiCACCCGAGTTCATTTCC
MZH_JL67_christophi_christophiTCCTCGAGTTCATTTCC
MZH_JL45_christophi_kaliTCCTCGAGTTCGTTTCC
MZH_JL40_eogeneTCCTCGAGGTTGTTTCT
MZH_JL24_nebulosaTCCTCGAGGTCGTTTCC
MZH_JL59_montiumCCCTCGAGGTTGTTTCC
MZH_JL61_adelaidaeTCCTCGAGGTCGTTTCC
MZH_JL60_fieldiiTCCTCGAGGTTATTTCC
MZH_JL50_fieldiiTCCTCGAGGTTATTTCT
MZH_JL33_marcopoloTCCCCGAGGTCATTACT
MZH_JL63_tibetanaTCCTCGAGGTTATWTCC
MZH_JL48_beryllaTCCCCGAGGTCGAATCC
MZH_JL55_beryllaTCCCCGAGGTCGAATCC
Strict consensus cladogram of COI sequences obtained in this study. Neighbour-Joining tree using the K2P-model for the COI sequences obtained in this study. Neighbour-Joining tree using the Tamura-Nei model with gamma distributed rates for the RpS2 sequences. Species haplotypes for 17 variable positions of RpS2 for Central Asian species (RpS2 data matrix positions no 14, 152, 170, 176, 189, 191, 194, 195, 218, 284, 287, 302, 341, 353, 356, 365, 380).

Analysis of the concatenated COI + RpS2 data

The parsimony analysis of COI + RpS2 yielded nine trees of length 560 steps (CI = 0.63, RI = 0.72), the strict consensus tree of which is shown in Figure 4. , and were not monophyletic and had the same position as in the COI cladogram (Figure 1).
Figure 4.

Strict consensus cladogram of the concatenated data set of COI + RpS2.

Strict consensus cladogram of the concatenated data set of COI + RpS2.

Analysis of all the COI sequences

The strict consensus cladogram for all the available COI data resolved the taxa in the same positions as in the tree of the new COI sequences only. For ten species of the present study sequences were also available from GB. Sequences of most species clustered together as monophyletic entities, except for , , and . For the GB sequence (GB accession no FJ663427) did not cluster together with our sequences. The GB barcodes of and were shared by these two taxa. Neither the Himalayan and south Tibetan adjacent mountain PageBreakPageBreak fauna (, , , , , ), nor the east Tibetan, Qinghai, Gansu and Sichuan species aggregates (, , , , , , ) were resolved as species clusters similar to the Tian Shan, Pamir and Hindukush species. Several COI haplotypes were noted for a few species, even among specimens obtained from the same locality (e.g. and ). Taxa not resolved as monophyletic clusters were the species and . All the included subspecies of (, and ) showed distinct COI sequences, with as most different.

Discussion

Barcoding

Lukhtanov et al. (2009) tested the utility of COI barcodes for Central Asian butterflies by sampling specimens from a considerable geographical range. They observed that this substantially increased intraspecific variation reducing the interspecific divergences (“barcoding gap”), but that this did not hamper species identification. The present study shows that most taxa form monophyletic entities that can be identified with COI data alone. The RpS2 gene region showed identical sequences in and (Table 3, Figure 3), differing by only three nucleotides from . Based on the molecular data the recognition of these subspecies is not or weakly supported. The fact that the three PageBreak samples were scattered over different parts of the COI tree might be the result of a laboratory contamination due to carry over between samples. The samples were collected on the same day and in the same place. is morphologically distinct from other species, excluding possible misidentification. The RpS2 data, however, could point to two morphologically cryptic species in sympatry (samples MZH_JL24 vs. MZH_JL9 and MZH_JL26), so that the different COI barcodes might represent numts, despite no apparent ‘signs’ (no indels). This discrepancy between morphology and DNA sequence data emphasises the necessity to use multiple samples to detect this sort of challenging issues. Even though and did not form monophyletic groups our results show that COI barcodes are useful for (1) identifying Palaearctic and Central Asian , (2) pointing to a possible cryptic species, and (3) highlighting the necessity to further investigate the question on the subspecific rank of . The utility of RpS2 as a species barcode for spp. is clearly more limited, since e.g. , , , of the subgenus and (subgenus ) have identical sequences (Table 3, Figure 3). Still, RpS2 yielded species specific (diagnostic) haplotypes for 11 species of the subgenus and for (subgenus s.str.).

Congruence with traditional classification: analysis of concatenated COI + RpS2

The strict consensus tree was more resolved than either of the trees resulting from separate analyses of the gene regions (Figure 4). Although the concatenated data did not resolve the phylogenetic relationships among all species, some observations can be made. The majority of the species confined to the adjacent Tian Shan, Pamir and Hindukush mountain ranges form a well supported clade. This includes , , , , , , and . Yet, , which also occurs in these mountain ranges (Peter I and Khozratishoh mountains), was not included in this clade. is morphologically most similar to , thus showing another case of disagreement between morphological and DNA sequence data. , too, lives in the aforementioned mountain ranges, but it has a wider distribution, stretching from Turkey to the Altai Mountains. A third taxon, , is considered closely related to (e.g. Verhulst 2000), a species occurring in southern Siberia and Mongolia. Thus, the origin of and may differ from that of the species confined to the Tian Shan, Pamir and Hindukush mountain range. One sample of (MZH_JL43) was placed within this “mountain” clade, while the other two samples appeared as sister taxa to the Himalayan species . As with , our DNA data disagree with the morphological characters, but it should be noted that this clade is not well supported. Conversely, two morphologically similar Himalayan species, viz. and , were assigned to different clades. In the COI + RpS2 tree they were placed in different, more encompassing species clusters (Figure 4), in the COI NJ tree they were joined with and (Figure 2), while the COI cladogram resolved these taxa together with , , and (Figure 1). The analyses did not support the monophyly of the subgenera and sensu Berger (1986). The species was not resolved as a separate monophyletic lineage, but was resolved into . This is congruent with the results of Pollock et al. (1998) and Brunton (1998). Only the the subgenus , here represented by , is supported as a distinct lineage, placed as sister to all other sp.

Barcodes of Palaearctic spp.

The parsimony (Figure 5) and NJ analyses (Figure 6) of the larger matrix of Palaearctic COI barcodes (total COI) recovered the same species clusters, but some of the species show different placements (e.g. , ). This is not surprising as all internal nodes are very shallow. The samples of and show very low sequence difference, morphologically these taxa are different, and they largely share the same distribution area. An example of species that share the same distribution and that exhibit clear morphological similarities, and which as such were resolved as sister species in both analyses, includes and . Identification of Palaearctic based on COI barcodes is in most cases possible, since shared haplotypes were recorded only for and .
Figure 5.

Strict consensus cladogram of COI sequences for Palaearctic taxa.

Figure 6.

Neighbour-Joining tree using the K2P-model of COI sequences for Palaearctc taxa.

Strict consensus cladogram of COI sequences for Palaearctic taxa. Neighbour-Joining tree using the K2P-model of COI sequences for Palaearctc taxa. Intraspecific variation is notable between some of the recognized subspecies, both among our own samples and those downloaded from GB. The intraspecific variation can partly be explained by morphologically clearly distinct subspecies, such as those of , or by specimens from widely different localities, such the different specimens of (sample FJ663418 from Russia, FJ663421 from Kazakhstan, HQ004297 from Romania and MZH_JL35 and MZH_JL44 from SW Transbaikalia). However, notable intraspecific variation also occurs within populations, such as with all samples originating from the same locality and date, but the limited sampling prevents conclusions on the reasons for this. It is apparent that the understanding of intraspecific variability of the COI barcode for is presently very limited. The combined COI data of our sequences and sequences downloaded from GB include species belonging to one additional subgenus, , represented by and . Only the subgenus , represented by , is well supported as distinct lineage. Yet, one specimen of (FJ663419) clustered together with () and (). The other subgenera were not resolved as clades according to present classification, in agreement with our results for the combined analysis. Our findings generally support COI as a species specific barcode for , but we also highlight the necessity of including multiple individuals of species in molecular barcoding studies. Problematic ‘cases’ of widely divergent barcodes or conflicting morphological and molecular ‘signals’ are found in most if not all barcoding studies, and this study makes no exception.
  13 in total

1.  Biological identifications through DNA barcodes.

Authors:  Paul D N Hebert; Alina Cywinska; Shelley L Ball; Jeremy R deWaard
Journal:  Proc Biol Sci       Date:  2003-02-07       Impact factor: 5.349

2.  An AFLP-based interspecific linkage map of sympatric, hybridizing Colias butterflies.

Authors:  Baiqing Wang; Adam H Porter
Journal:  Genetics       Date:  2004-09       Impact factor: 4.562

3.  Complete DNA barcode reference library for a country's butterfly fauna reveals high performance for temperate Europe.

Authors:  Vlad Dinca; Evgeny V Zakharov; Paul D N Hebert; Roger Vila
Journal:  Proc Biol Sci       Date:  2010-08-11       Impact factor: 5.349

Review 4.  The unholy trinity: taxonomy, species delimitation and DNA barcoding.

Authors:  Rob DeSalle; Mary G Egan; Mark Siddall
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2005-10-29       Impact factor: 6.237

5.  High mitochondrial diversity in geographically widespread butterflies of Madagascar: a test of the DNA barcoding approach.

Authors:  Marjorie C Linares; Iván D Soto-Calderón; David C Lees; Nicola M Anthony
Journal:  Mol Phylogenet Evol       Date:  2008-11-21       Impact factor: 4.286

6.  Limited performance of DNA barcoding in a diverse community of tropical butterflies.

Authors:  Marianne Elias; Ryan I Hill; Keith R Willmott; Kanchon K Dasmahapatra; Andrew V Z Brower; James Mallet; Chris D Jiggins
Journal:  Proc Biol Sci       Date:  2007-11-22       Impact factor: 5.349

7.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.

Authors:  Koichiro Tamura; Daniel Peterson; Nicholas Peterson; Glen Stecher; Masatoshi Nei; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2011-05-04       Impact factor: 16.240

8.  DNA barcoding Central Asian butterflies: increasing geographical dimension does not significantly reduce the success of species identification.

Authors:  Vladimir A Lukhtanov; Andrei Sourakov; Evgeny V Zakharov; Paul D N Hebert
Journal:  Mol Ecol Resour       Date:  2009-02-25       Impact factor: 7.090

9.  DNA barcodes and cryptic species of skipper butterflies in the genus Perichares in Area de Conservacion Guanacaste, Costa Rica.

Authors:  John M Burns; Daniel H Janzen; Mehrdad Hajibabaei; Winnie Hallwachs; Paul D N Hebert
Journal:  Proc Natl Acad Sci U S A       Date:  2008-04-24       Impact factor: 11.205

10.  Genomic outposts serve the phylogenomic pioneers: designing novel nuclear markers for genomic DNA extractions of lepidoptera.

Authors:  Niklas Wahlberg; Christopher West Wheat
Journal:  Syst Biol       Date:  2008-04       Impact factor: 15.683

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.