Literature DB >> 28985301

The Trouble with MEAM2: Implications of Pseudogenes on Species Delimitation in the Globally Invasive Bemisia tabaci (Hemiptera: Aleyrodidae) Cryptic Species Complex.

Wee Tek Tay1, Samia Elfekih1, Leon N Court1, Karl H J Gordon1, Hélène Delatte2, Paul J De Barro3.   

Abstract

Molecular species identification using suboptimal PCR primers can over-estimate species diversity due to coamplification of nuclear mitochondrial (NUMT) DNA/pseudogenes. For the agriculturally important whitefly Bemisia tabaci cryptic pest species complex, species identification depends primarily on characterization of the mitochondrial DNA cytochrome oxidase I (mtDNA COI) gene. The lack of robust PCR primers for the mtDNA COI gene can undermine correct species identification which in turn compromises management strategies. This problem is identified in the B. tabaci Africa/Middle East/Asia Minor clade which comprises the globally invasive Mediterranean (MED) and Middle East Asia Minor I (MEAM1) species, Middle East Asia Minor 2 (MEAM2), and the Indian Ocean (IO) species. Initially identified from the Indian Ocean island of Réunion, MEAM2 has since been reported from Japan, Peru, Turkey and Iraq. We identified MEAM2 individuals from a Peruvian population via Sanger sequencing of the mtDNA COI gene. In attempting to characterize the MEAM2 mitogenome, we instead characterized mitogenomes of MEAM1. We also report on the mitogenomes of MED, AUS, and IO thereby increasing genomic resources for members of this complex. Gene synteny (i.e., same gene composition and orientation) was observed with published B. tabaci cryptic species mitogenomes. Pseudogene fragments matching MEAM2 partial mtDNA COI gene exhibited low frequency single nucleotide polymorphisms that matched low copy number DNA fragments (<3%) of MEAM1 genomes, whereas presence of internal stop codons, loss of expected stop codons and poor primer annealing sites, all suggested MEAM2 as a pseudogene artifact and so not a real species.
© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  NUMT; high throughput -sequencing; invasive pest; mitogenome; pseudogene

Mesh:

Substances:

Year:  2017        PMID: 28985301      PMCID: PMC5647793          DOI: 10.1093/gbe/evx173

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

The use of single-gene based DNA barcoding to resolve species boundaries for cryptic species presents a special challenge. The resolution of such morphologically identical species based on a single gene sequence alignment is only possible if that gene sequence is unambiguously correct and corresponds to what is expected in every case analyzed. For many such taxonomic exercises, mitochondrial genes and primarily the mitochondrial cytochrome oxidase I gene (mtDNA COI) barcode sequence have been selected (e.g., Alam etal. 2015; Leys etal. 2016). The same applies to other mitochondrial sequences such as the cytochrome oxidase II gene (mtDNA COII) (e.g., Sunnucks etal. 2000) and cytochrome b gene (cyt b) (e.g., Mundy etal. 2000), as well as nuclear DNA markers (i.e., nuclear 18S and 28S rRNA gene regions, e.g., Jorger and Schrodl 2013; microsatellite DNA markers, e.g., Cheng etal. 2013). One significant challenge faced in the delimitation of otherwise indistinguishable species using mtDNA COI data sets is the possible presence of nuclear mitochondrial DNA pseudogenes (NUMTs) (Bensasson etal. 2001). PCR products derived from NUMTs are often a result of poor PCR primer efficacies (Moulton etal. 2010; Lobo etal. 2013; Tay etal. 2017). When they are treated as authentic mtDNA genes, failure to identify them is likely to lead to inaccurate phylogenetic inferences due to differences in divergence times between NUMTs and genuine mtDNA genes (Sunnucks etal. 2000; Bensasson etal. 2001). Numerous methodologies are available to assist with the identification of NUMTs (reviewed in Bensasson etal. 2001) and include identifying non-functionality of the gene fragment through the presence of stop codons within protein coding gene regions. Despite this, NUMTs can be overlooked (e.g., Boykin etal. 2007; Karut etal. 2015) when stop codons are found outside the target gene region and this is a particular challenge when the target sequences are short. Arthropod pests of economic importance, such as those infesting stored grain products (Tay etal. 2016a), vectors of plant or animal pathogens, for example, cassava brown streak virus (Maruthi etal. 2005), tomato yellow leaf curl virus (Ghanim etal. 1998), blue tongue virus (Tabachnick 1996; De Liberato etal. 2005), and parasites, for example, varroa mites (Anderson and Trueman 2000), require accurate species identification in order to better understand population structure of these pests, the interpretation of disease outbreaks, detection of vector-host association (e.g., Tabachnick 1996), and incursion patterns (Tay etal. 2016b). Further, biosecurity preparedness (i.e., early detection) and incursion response strategies as part of national border protection, require unambiguous knowledge of species status (e.g., Armstrong and Ball 2005; Collins etal. 2012; Tay etal. 2016b). In the whitefly Bemisia tabaci pest species complex, numerous species belonging to at least 11 (De Barro etal. 2011) sister clades have been proposed based on the partial mtDNA COI gene, the 657 bp 3′ end of the gene (Boykin etal. 2013; Lee etal. 2013). Of these clades, the African/Middle East/Asia Minor clade is of special interest, as it contains the two most invasive members of the complex, Mediterranean (MED, the “true” B. tabaci [Tay etal. 2012]) and Middle East Asia Minor 1 (MEAM1). This clade also contains two other species, Middle East Asia Minor 2 (MEAM2) and Indian Ocean (IO). Whilst IO is not known to be invasive, MED and MEAM1 are globally wide-spread and vectors of highly damaging plant viruses (reviewed in De Barro etal. 2011), whereas MEAM2 has increasingly been detected across globally disparate locations such as Japan (Ueda etal. 2009; AB308110), Peru (this study), Iraq (KX679576; collected in 2015), Turkey (Karut etal. 2015, sequences KK103B, KK104A, KK104B), and Egypt (FJ939600, FJ939602), since its initial detection in the Indian Ocean island of Réunion (Delatte etal. 2005; AJ550177). The incursion pathways of MED and MEAM1 are linked to the worldwide global trade in ornamental plants (Cheek and Macdonald 1994; Dalton 2006), however factors underlying the spread of MEAM2 are less certain although detection frequencies have increased in recent times since its initial report (Delatte etal. 2005). Molecular characterizations using the whole mitogenome have been carried out for only one of the four invasive clade species—that of MED, although other Bemisia species from the complex and Bemisia “JpL” species have also been reported (Baumann 2004; Wang etal. 2013; Tay etal. 2016, 2017). In this study, we characterized the complete mitogenome of MEAM2 using high-throughput sequencing methods, and in the process ascertained the molecular genetic basis for the species delimitation of MEAM2. This effort also enabled the molecular characterization of two remaining “invasive clade” B. tabaci cryptic species (i.e., MEAM1, IO) draft mitogenomes, as well as the draft mitogenome of the Australia B. tabaci (previously biotype “AN”, De Barro etal. 2011) to be characterized via the high-throughput sequencing method. We assessed and discussed the impact of NUMT on phylogenetic inferences on the cryptic B. tabaci species complex.

Materials and Methods

Bemisia tabaci Samples, gDNA Extraction, PCR, and NGS

Five individuals of Bemisia whiteflies from a single Peruvian population, collected on August 14, 2000 from Cañete Valley (GenBank KY951453, KY951454, KX234912, KX234913, KX234914), four from Ouagadougou, Burkina Faso (KX234908, KX234909, KX234910, KX234911), two Australian Bemisia whiteflies (mtDNA COI matched (100%) MEAM1 (DQ174535; Hsieh etal. 2006) from Bundaberg, Australia), and five from Réunion (KX234868, KX234869, KX234870, KX234871, KX234872) were analyzed via standard PCR and Sanger sequencing procedures (e.g., see Dinsdale etal. 2010). Sanger sequencing was carried out at the John Curtin School of Medical Research Biological Resource Facility at the National University of Australia, Canberra. Sanger sequence trace files were assembled using Staden Pregap4 and Gap4 programs (Staden etal. 2000), and species status determined using BlastN searches against the publicly available B. tabaci mtDNA COI database  (last accessed September 6, 2017). All genomic DNA (gDNA) extractions were performed using the Qiagen DNeasy Blood and Tissue kit (Cat. # 69506), including the optional RNase A treatment (Qiagen, Cat. # 19101). Individually extracted and purified gDNA samples were eluted in 25 µl of Qiagen buffer EB (Cat. # 19086) and quantified using a Qubit 2.0 Fluorometer and the Qubit dsDNA High Sensitivity DNA Assay kit (ThermoFisher Scientific, Cat # Q32854). The gDNA from three of the five Peruvian whitefly specimens (KX234913, KX234914, KY951454) were each made into separate NGS gDNA libraries using the protocol of Tay etal. (2016) and sequenced using the Illumina MiSeq sequencer. To better understand the potential genomic origins of MEAM2 COI haplotypes and hence its species status, we further prepared separate Illumina MiSeq libraries of a single individual from each of the three species (i.e., MED, IO, MEAM1) known to be also present in Réunion Island (Delatte etal. 2005). These included one Réunion individual from an “IO” population, one Burkina Faso individual from a MED population, and one MEAM1 individual from an Australian population. The high throughput sequencing gDNA library preparation method followed the Illumina Nextera XT DNA library preparation guide (Part # 15031942 Rev. D, September 2014). Briefly, 1.5 ng samples of gDNA were tagmented (i.e., tagged and fragmented by the Nextera XT transposome), followed by limited PCR cycles (to add unique dual index barcodes for sample tracking and Illumina adapters for cluster formation). The amplified libraries were sized selected and purified using the Beckman Coulter AMPure XP system (Bead to DNA ratio of 0.7) and eluted in 28 µl of Qiagen buffer EB (Cat. # 19086). The purified libraries were then quantified by Qubit dsDNA High Sensitivity DNA Assay as above, their average fragment size estimated using the Agilent 2200 Tapestation and High Sensitivity D1000 screentape (Cat # 5067-5585) and then normalized to a final concentration of 4 nM. The Nextera XT gDNA libraries were pooled, diluted to a final concentration of 11 pM (with 5% spike-in of Illumina Phi X Control v3 library [Cat # FC-110-3001]) and sequenced on the Illumina MiSeq sequencer. The draft mitogenomes were individually assembled using the Asia I mitogenome (GenBank KJ778614) of Tay etal. (2016) as the reference genome within the genomic analysis software Geneious 8.1.9 (Biomatters Ltd., NZ). To confirm the circular nature of the mitogenomes we individually assembled the intergenic region between the NAD2 and COI genes, starting with either the NAD2 or the COI gene and allowing the assembly to bridge across to the adjacent gene.

Mitogenome Annotation and Identification of NUMTs

Assembled mitogenomes were annotated using MITOS (Bernt 2013) prior to manual readjustment within Geneious 8.1.9 to identify potential stop codons in all coding sequences (KY951447, KY951448, KY951449, KY951450, KY951451, KY951452). Assembled draft mitogenomes were reconfirmed for species identity by Blastn searches of the partial (657 bp) mtDNA COI gene region against the GenBank DNA database. To assess the impact of NUMTs in misidentification of MEAM1 as MEAM2, a Peruvian MEAM2 mtDNA COI sequence detected (KX234914), as well as published sequences (Delatte etal. 2005; Ueda etal. 2009; Karut etal. 2015); were used as template reference sequences and assessed for frequencies of SNPs detected at the respective genomic regions/nucleotide positions in the three species (i.e., MEAM1, MED, and IO) known to be present in countries that have also reported MEAM2 (e.g., Reunion, Turkey, Japan, Peru, and Iraq). We also visually identify MiSeq generated DNA fragments that uniquely matched SNP patterns of MEAM2 partial mtDNA COI regions to determine the effects on the amino acid translational processes.

Results and Discussion

Our results supported the notion that MEAM2 partial mtDNA COI sequences reported to-date are likely to be NUMTs. We also generated and characterized mitogenomes of four (MED, MEAM1, IO and AUS) B. tabaci cryptic species from single individuals, of which the complete mitogenomes of three species (MEAM1, IO and AUS) are here reported for the first time. Based on our initial Sanger sequencing, two individuals from the Australian collection were identified as MEAM1. However, the third individual analyzed via NGS from the same collection was identified as belonging to a different member of the complex, AUS (657 bp mtDNA COI partial gene matched 100% sequence identity to Bundaberg, Australia [GU086328]), indicating that the collection consisted of both MEAM1 and AUS. For the randomly selected Réunion individual as well as the Burkina Faso individual, we obtained the expected mitogenomes of IO (657 bp mtDNA COI partial gene matched 100% sequence identity of a Madagascan IO [AJ550171]) and MED (partial mtDNA COI gene (657 bp) shared 100% sequence identity to MED from Sudan [DQ133378]), respectively. From the three Peruvian individuals that were expected to be MEAM2 (i.e., KY951454; KX234913, and KX234914) on the basis of the Sanger sequence derived mtDNA COI partial gene, we instead obtained MEAM1 mitogenomes, as confirmed via partial mtDNA COI gene comparison with published sequences (KY951452 and KX234913 [nt782-1, 439] = 100% sequence identity to MEAM1 from Arizona, USA [HM070411]; and KX234914 [nt782-1, 439] = 99% sequence identity to MEAM1 from, e.g., Florida, USA [GU086340]). MEAM1 had previously been argued to represent a separate Bemisia species from B. tabaci based on behavioral, morphological, and genetic differences (e.g., Bellows etal. 1994; Perring etal. 1992, 1993) and was subsequently named B. argentifolii (Bellows etal. 1994). Thao etal. (2004) provided partial regions (i.e., Cyt b-COIII, 4, 796 bp; GenBank AY521257) of the B. argentifolii mitogenome, however the complete mitogenome of MEAM1/B. argentifolii had not been published. Pairwise sequence comparisons between AY521257 and our reported MEAM1 mitogenomes identified high levels of sequence similarity (99.82% identity) with the corresponding B. argentifolii mitogenome region, whereas similarity between MED, IO, and AUS mitogenome regions were much lower, at 92.52%, 91.51%, and 80.16% sequence identity, respectively (data not shown). Sequencing of these gDNA libraries generated between 2.15 and 28.96 million paired-end (PE) sequences (table 1), from which 10,738 to 131,328 PE sequences were assembled to generate complete mitogenomes in IO, MEAM1, MED, and AUS (table 1). We identified low copy genome fragments through the Illumina MiSeq sequencing platform in MEAM1 individuals that matched unique MEAM2 SNPs (fig. 1). Fragments of gDNA representing the MEAM2 partial mtDNA COI haplotypes also identified the presence of premature stop codons within these low copy number DNA fragments in regions of the mtDNA COI gene, as well as the loss of the expected stop codon at the C-terminal region of the mtDNA COI gene (fig. 1). Corresponding SNP frequencies across DNA fragments generated from high-throughput sequencing, and that potentially represented NUMTs within the 657 bp mtDNA COI partial gene region, were detected at very low frequencies (supplementary table 1, Supplementary Material online), again supporting the notion that NUMTs which had resulted in the misidentification of “MEAM2” sequences, were present as low copy DNA fragments. At the corresponding nucleotide positions between a randomly selected MEAM1 sequence from GenBank and compared against the MEAM1 DNA fragments generated from the high-throughput sequencing library, SNPs detected at nucleotide positions that corresponded to those in MEAM2 were generally observed at highest frequencies (supplementary tables 1, Supplementary Material online). For MEAM2 when compared with MED and IO, there were no particular SNP frequency patterns (supplementary tables 1, Supplementary Material online). Contrasting this, SNPs within suspected MEAM2 sequences (i.e., Japan AB308110, the Peruvian MEAM2 sequence (KX234913), four Turkish MEAM2 haplotypes (Karut etal. 2015) were consistently of the lower frequencies (supplementary tables 1, Supplementary Material online). Characterization of the MEAM1 mitogenomes therefore supported the hypothesis that the MEAM2 sequences were likely associated with low copy DNA fragments from the MEAM1 genome and were most likely either PCR artifacts such as DNA polymerase-introduced errors or nuclear mitochondrial DNA (e.g., NUMTs).
Table 1

Summary Statistics of MiSeq Sequence Data from Bemisia tabaci Cryptic Species of Indian Ocean (IO, KY951448), Mediterranean (MED, KY951447), Middle East-Asia Minor 1 (MEAM1, KY951449, KY951450, KY951452), and Australia (AUS, KY951451)

SpeciesTotal PE-seqMTG PE-seqAverage COI Coverage (±s.d.)Mitogenome LengthsbGenBank
MEAM16,514,26013,986166.9 ± 16.4 s.d.15,666KY951450
MEAM128,964,958131,328990.3 ± 163.4 s.d15,531KY951449
MEAM16,663,49012,17684.0 ± 21.0 s.d.15,526KY951452
MED2,157,71611,380126.7 ± 28.5 s.d.15,631KY951447
IO3,842,61610,738118.3 ± 22.1 s.d.15,626KY951448
AUS4,981,18215,438173.9 ± 25.3 s.d.15,686KY951451
MEDN/AN/AN/A15,632JQ906700
ASIA IN/AN/AN/A15,210KJ778614
ASIA II-7N/AN/AN/A15,515KX714967

The overall published draft mtDNA genomes of B. tabaci cryptic species ranged between 15,210 in B. tabaci Asia I (KJ778614) to 15,686 in B. tabaci AUS (KY951451).

Mitogenome lengths from this study are putative due to the difficulty of assembling complete mitogenomes based on short read DNA sequences as obtained from the Illumina MiSeq sequencing method. N/A (not applicable)—these are either from published data or not available. Average COI coverage information included average sequence reads across the whole mtDNA COI gene and standard deviation (s.d.), as calculated using Geneious version 8.1.9.

. 1.

—Examples of sequence alignments between Bemisia tabaci “Peru” MEAM1 mtDNA COI haplotype gene region, published “MEAM2” mtDNA COI gene region, and NGS candidate NUMT sequences identified from KX234913, KY951449, and the KY951452 individuals. (A) C-terminal region of a Peruvian MEAM1 B. tabaci (KY951450) mtDNA COI gene region showing putative stop codon (black shaded “*” symbol), as well as the B. tabaci “MEAM2” haplotype from Japan (AJ550177), and examples NGS candidate NUMT sequences from the Peruvian B. tabaci MEAM1 (KX234914) with matching SNPs (indicated by red boxes) that matched the Japan MEAM2 haplotype (AJ550177). Deletion of a “T” base (indicated by red triangle) resulted in a frameshift mutation and the loss of the putative mtDNA COI gene stop codon. (B) Internal stop codons (at positions 904S906) detected in candidate NUMT sequences (KY951452_NUMT-01, KY951452_NUMT-02) from the Peruvian MEAM1 individual (KY951452) MiSeq generated DNA fragments when compared with the Peruvian B. tabaci MEAM1 (KX234914) mtDNA COI gene. Stop codons detected in NUMT sequences were the result of a single nucleotide base change at position 906 from a “T” to an “A.” Candidate NUMT sequences were also compared with the Peruvian MEAM2 haplotype (KY951454) obtained via PCR and Sanger sequencing of the same MEAM1 individual (KY951452). Nucleotide positions based on the mtDNA COI gene are provided. Amino acid translation based on the invertebrate mitochondrial genetic codes (Translational Table_5). Significant changes between amino acids are highlighted.

Summary Statistics of MiSeq Sequence Data from Bemisia tabaci Cryptic Species of Indian Ocean (IO, KY951448), Mediterranean (MED, KY951447), Middle East-Asia Minor 1 (MEAM1, KY951449, KY951450, KY951452), and Australia (AUS, KY951451) The overall published draft mtDNA genomes of B. tabaci cryptic species ranged between 15,210 in B. tabaci Asia I (KJ778614) to 15,686 in B. tabaci AUS (KY951451). Mitogenome lengths from this study are putative due to the difficulty of assembling complete mitogenomes based on short read DNA sequences as obtained from the Illumina MiSeq sequencing method. N/A (not applicable)—these are either from published data or not available. Average COI coverage information included average sequence reads across the whole mtDNA COI gene and standard deviation (s.d.), as calculated using Geneious version 8.1.9. —Examples of sequence alignments between Bemisia tabaci “Peru” MEAM1 mtDNA COI haplotype gene region, published “MEAM2” mtDNA COI gene region, and NGS candidate NUMT sequences identified from KX234913, KY951449, and the KY951452 individuals. (A) C-terminal region of a Peruvian MEAM1 B. tabaci (KY951450) mtDNA COI gene region showing putative stop codon (black shaded “*” symbol), as well as the B. tabaci “MEAM2” haplotype from Japan (AJ550177), and examples NGS candidate NUMT sequences from the Peruvian B. tabaci MEAM1 (KX234914) with matching SNPs (indicated by red boxes) that matched the Japan MEAM2 haplotype (AJ550177). Deletion of a “T” base (indicated by red triangle) resulted in a frameshift mutation and the loss of the putative mtDNA COI gene stop codon. (B) Internal stop codons (at positions 904S906) detected in candidate NUMT sequences (KY951452_NUMT-01, KY951452_NUMT-02) from the Peruvian MEAM1 individual (KY951452) MiSeq generated DNA fragments when compared with the Peruvian B. tabaci MEAM1 (KX234914) mtDNA COI gene. Stop codons detected in NUMT sequences were the result of a single nucleotide base change at position 906 from a “T” to an “A.” Candidate NUMT sequences were also compared with the Peruvian MEAM2 haplotype (KY951454) obtained via PCR and Sanger sequencing of the same MEAM1 individual (KY951452). Nucleotide positions based on the mtDNA COI gene are provided. Amino acid translation based on the invertebrate mitochondrial genetic codes (Translational Table_5). Significant changes between amino acids are highlighted. A further piece of supporting evidence that MEAM2 belonged to NUMT was from the recently assembled MEAM1 draft genome (Chen etal. 2016), in which an unknown protein coding gene predicted to be cytochrome c oxidase subunit 1-like mRNA (XM_019045089.1) was identified; it shared 99% sequence homologies with the Peruvian KX234914 MEAM2 partial COI gene. Within this COI-like-mRNA sequence four internal stop codons were identified and subsequently corrected (i.e., modifications involving substitutions of four bases at four genomic stop codons were introduced to the sequence of the model RefSeq protein relative to its source genomic sequence so as to represent the inferred coding sequences [GenBank Locus XM_019045089, 1,632 bp mRNA linear INV November 9, 2016; accessed January 5, 2017]). NUMTs are widespread in all eukaryotic organisms, can both be difficult to detect and introduce bias in the estimation of species diversity and DNA barcoding analyses (reviewed in Hazkani-Covo etal. 2010). Our analysis therefore supported the presence of only three species (i.e., MED, MEAM1, and IO) within the current invasive B. tabaci clade (Asia/Middle East/Asia Minor), and indicated that MEAM2 was a NUMT artifact. With increasing molecular characterization of global B. tabaci cryptic species complex, new species may be identified which could alter the current B. tabaci cryptic species phylogeny and also ultimately the number of species within the invasive B. tabaci clade. Our efforts to understand species composition and to ascertain the spread of invasive B. tabaci based on limited individuals have initially identified MEAM1 in the Australian samples from Bundaberg, Queensland. When additional individuals were sampled in high-throughput sequencing we instead obtained the native AUS species. From the Peruvian individuals initially identified as MEAM2 based on partial mtDNA COI gene using suboptimal primers, high-throughput sequencing have also resulted in MEAM1 mitogenomes being assembled instead. This exercise highlighted the importance of analyzing an adequate number of individuals from a collection and the impact suboptimal PCR primers can have on estimating species composition. These included misidentification of species composition complexity at the population level, and minimizing valuable resources being misdirected to monitor for incursion of nonexistent species, both of which can have profound impacts in terms of border biosecurity responses (e.g., either missing or misidentifying species of biosecurity concern). Several published studies (e.g., Delatte etal. 2005; Ueda etal. 2009; Karut etal. 2015) have used various non Bemisia “universal” PCR primers such as C1-J-2195/L2-N-3014; C1-J-2195/R-BQ-2819; C1-J-2195/tRNA-1576 (Simon etal. 1994; Frohlich etal. 1999; Tsagkarakou etal. 2007; Chu etal. 2011) and we suspect, factors such as reduced annealing site specificity (supplementary table 7, Supplementary Material online ) are contributing to the coamplification of NUMTs. Previous studies reporting the detection of MEAM2, using the C1-J-2195 forward non Bemisia “universal” primer was a common factor. This primer, originally named “COI-RLR”, was developed by Roehrdanz (1993) from the Apis mellifera COI gene (Crozier and Crozier 1993), and was shown to amplify some Lepidoptera, Coleoptera, Diptera, and Hymenoptera, but with unknown efficacies for Hemiptera (Simon etal. 1994) to which Bemisia belongs. Various Bemisia species’ complete mitogenomes are now available (MEAM1, IO, AUS (this study), MED (Wang etal. 2013), Asia I (Tay etal. 2016), AsiaII_7 (originally identified as B. emiliae, but synomised with B. tabaci in 1957, Tay etal. 2017). Direct comparison of primer-binding site efficacies between the C1-J-2195 24-mer oligonucleotide and the intended COI gene target site in these species identified poor primer efficacies that ranged between 33.3% and 45.8% for MEAM1, MED, IO, AUS, Asia I, and AsiaII_7 (supplementary table 7, Supplementary Material online). B. tabaci cryptic species mtDNA COI sequences as generated using the C1-J-2195 primer should therefore be treated with extra caution. The sequencing of full mitogenomes in B. tabaci whiteflies can be achieved from single adults or nymphs (Tay etal. 2016, 2017; this study) and will significantly contribute to development of B. tabaci species-specific primers, although standardization of PCR-primers would be of benefit to the B. tabaci research community (Elfekih etal. 2017). We have shown the consequence of pseudogenes on species delimitation within the B. tabaci cryptic complex, through direct and active searching of genomic fragments obtained from high-throughput sequencing against suspected NUMTs of the “MEAM2” haplotypes. Studies investigating the species status within the B. tabaci complex have, to-date, relied largely on the C1-J-2195 primer and have generated a large volume of haplotype data across the breadth of the B. tabaci complex. These haplotypes, currently >5,100 sequences (GenBank accessed March 17, 2017), will likely contain other unidentified pseudogenes. Future studies focusing on the phylogenetic relationships within the complex will need to be mindful of NUMTs and will require careful treatment of data so as to avoid over-interpretation of B. tabaci phylogeny and species status.

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online. Click here for additional data file.
  38 in total

1.  The Staden package, 1998.

Authors:  R Staden; K F Beal; J K Bonfield
Journal:  Methods Mol Biol       Date:  2000

2.  Mitochondrial pseudogenes: evolution's misplaced witnesses.

Authors:  D Bensasson; D -X. Zhang; D L. Hartl; G M. Hewitt
Journal:  Trends Ecol Evol       Date:  2001-06-01       Impact factor: 17.712

3.  Sequence analysis of DNA fragments from the genome of the primary endosymbiont of the whitefly Bemisia tabaci.

Authors:  Linda Baumann; MyLo Ly Thao; C Joel Funk; Bryce W Falk; James C K Ng; Paul Baumann
Journal:  Curr Microbiol       Date:  2004-01       Impact factor: 2.188

Review 4.  DNA barcodes for biosecurity: invasive species identification.

Authors:  K F Armstrong; S L Ball
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2005-10-29       Impact factor: 6.237

5.  Investigation of the genetic diversity of an invasive whitefly (Bemisia tabaci) in China using both mitochondrial and nuclear DNA markers.

Authors:  D Chu; C S Gao; P De Barro; F H Wan; Y J Zhang
Journal:  Bull Entomol Res       Date:  2011-02-15       Impact factor: 1.750

Review 6.  Bemisia tabaci: a statement of species status.

Authors:  Paul J De Barro; Shu-Sheng Liu; Laura M Boykin; Adam B Dinsdale
Journal:  Annu Rev Entomol       Date:  2011       Impact factor: 19.686

7.  Enhanced primers for amplification of DNA barcodes from a broad range of marine metazoans.

Authors:  Jorge Lobo; Pedro M Costa; Marcos A L Teixeira; Maria S G Ferreira; Maria H Costa; Filipe O Costa
Journal:  BMC Ecol       Date:  2013-09-10       Impact factor: 2.964

8.  How to describe a cryptic species? Practical challenges of molecular taxonomy.

Authors:  Katharina M Jörger; Michael Schrödl
Journal:  Front Zool       Date:  2013-09-27       Impact factor: 3.172

9.  Is agriculture driving the diversification of the Bemisia tabaci species complex (Hemiptera: Sternorrhyncha: Aleyrodidae)?: Dating, diversification and biogeographic evidence revealed.

Authors:  Laura M Boykin; Charles D Bell; Gregory Evans; Ian Small; Paul J De Barro
Journal:  BMC Evol Biol       Date:  2013-10-18       Impact factor: 3.260

10.  Distribution and population genetic variation of cryptic species of the Alpine mayfly Baetis alpinus (Ephemeroptera: Baetidae) in the Central Alps.

Authors:  Marie Leys; Irene Keller; Katja Räsänen; Jean-Luc Gattolliat; Christopher T Robinson
Journal:  BMC Evol Biol       Date:  2016-04-12       Impact factor: 3.260

View more
  12 in total

1.  On species delimitation, hybridization and population structure of cassava whitefly in Africa.

Authors:  S Elfekih; W T Tay; A Polaszek; K H J Gordon; D Kunz; S Macfadyen; T K Walsh; S Vyskočilová; J Colvin; P J De Barro
Journal:  Sci Rep       Date:  2021-04-12       Impact factor: 4.379

2.  Morphology-Based Identification of Bemisia tabaci Cryptic Species Puparia via Embedded Group-Contrast Convolution Neural Network Analysis.

Authors:  Norman MacLeod; Roy J Canty; Andrew Polaszek
Journal:  Syst Biol       Date:  2022-08-10       Impact factor: 9.160

3.  Genome-wide analyses of the Bemisia tabaci species complex reveal contrasting patterns of admixture and complex demographic histories.

Authors:  S Elfekih; P Etter; W T Tay; M Fumagalli; K Gordon; E Johnson; P De Barro
Journal:  PLoS One       Date:  2018-01-24       Impact factor: 3.240

4.  African ancestry of New World, Bemisia tabaci-whitefly species.

Authors:  Habibu Mugerwa; Susan Seal; Hua-Ling Wang; Mitulkumar V Patel; Richard Kabaalu; Christopher A Omongo; Titus Alicai; Fred Tairo; Joseph Ndunguru; Peter Sseruwagi; John Colvin
Journal:  Sci Rep       Date:  2018-02-09       Impact factor: 4.379

5.  Updated mtCOI reference dataset for the Bemisia tabaci species complex.

Authors:  Laura M Boykin; Anders Savill; Paul De Barro
Journal:  F1000Res       Date:  2017-10-13

6.  An integrative approach to discovering cryptic species within the Bemisia tabaci whitefly species complex.

Authors:  Soňa Vyskočilová; Wee Tek Tay; Sharon van Brunschot; Susan Seal; John Colvin
Journal:  Sci Rep       Date:  2018-07-18       Impact factor: 4.379

7.  Comparative transcriptome analysis reveals genetic diversity in the endosymbiont Hamiltonella between native and exotic populations of Bemisia tabaci from Brazil.

Authors:  Bruno Rossitto De Marchi; Tonny Kinene; James Mbora Wainaina; Renate Krause-Sakate; Laura Boykin
Journal:  PLoS One       Date:  2018-07-27       Impact factor: 3.240

8.  KASP Genotyping as a Molecular Tool for Diagnosis of Cassava-Colonizing Bemisia tabaci.

Authors:  Everlyne N Wosula; Wenbo Chen; Massoud Amour; Zhangjun Fei; James P Legg
Journal:  Insects       Date:  2020-05-14       Impact factor: 2.769

9.  Distribution and phylogenetics of whiteflies and their endosymbiont relationships after the Mediterranean species invasion in Brazil.

Authors:  Letícia Aparecida de Moraes; Cristiane Muller; Regiane Cristina Oliveira de Freitas Bueno; Antônio Santos; Vinicius Henrique Bello; Bruno Rossitto De Marchi; Luís Fernando Maranho Watanabe; Julio Massaharu Marubayashi; Beatriz Rosa Santos; Valdir Atsushi Yuki; Hélio Minoru Takada; Danielle Ribeiro de Barros; Carolina Garcia Neves; Fábio Nascimento da Silva; Mayra Juline Gonçalves; Murad Ghanim; Laura Boykin; Marcelo Agenor Pavan; Renate Krause-Sakate
Journal:  Sci Rep       Date:  2018-10-01       Impact factor: 4.379

10.  Genetic Diversity of Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) Colonizing Sweet Potato and Cassava in South Sudan.

Authors:  Beatrice C Misaka; Everlyne N Wosula; Philip W Marchelo-d'Ragga; Trine Hvoslef-Eide; James P Legg
Journal:  Insects       Date:  2020-01-17       Impact factor: 2.769

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.