Literature DB >> 33365629

Mitochondrial genomes of the bird genus Piranga: rates of sequence evolution, and discordance between mitochondrial and nuclear markers.

Luke C Campillo1,2, Kevin J Burns3, Robert G Moyle1, Joseph D Manthey1,4.   

Abstract

We report the characteristics of the mitochondrial genomes of 22 individuals in the bird genus Piranga, including all currently recognized species in the genus (n = 11). Elements follow the standard avian mitogenome series, including two ribosomal RNA (rRNA) genes, 22 transfer RNA (tRNA) genes, 13 protein coding genes, and the mitochondrial control region. Excluding tRNA sequences, sequence divergence rate was lowest in rRNA genes and highest in genes encoding NADH (specifically ND1, ND2, ND3) and the control region. Gene trees assembled from 16 elements (non-tRNAs) varied greatly in topological concordance compared to the recognized species tree (based on thousands of nuclear loci), with no one gene tree precisely recovering all relationships in the genus. We also investigated patterns of concordance between the mitogenome tree and the nuclear species tree and found some discrepancies. Across non-tRNA gene trees (n = 16), the species tree topology was recovered by as few as three elements at a particular node and complete concordance (i.e. 16/16 gene trees matched the species tree topology) was recovered at only one node. We found mitochondrial gene regions that are often used in vertebrate systematics (e.g. CytB, ND2) recovered nearly the exact same topology as the nuclear species tree topology.
© 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Mito-nuclear discordance; mitogenome; tanager

Year:  2019        PMID: 33365629      PMCID: PMC7687373          DOI: 10.1080/23802359.2019.1637286

Source DB:  PubMed          Journal:  Mitochondrial DNA B Resour        ISSN: 2380-2359            Impact factor:   0.658


Introduction

Mitochondrial DNA (mtDNA) has long been used to elucidate phylogenetic relationships across the tree of life. The ubiquity of mitochondrial sequence data in contemporary systematic research, ranging from single markers (Burbrink et al. 2016) to complete mitogenomes (Nguyen and Ho 2016), indicates the continued and pervasive utility of these markers. However, the distinctive transmission of mitochondrial genes (i.e. maternal inheritance and faster rates of fixation), the unique evolutionary history of nuclear genes and gene tree discordance (i.e. gene tree topologies different from one another or gene trees incongruous with the species phylogeny), due to incomplete lineage sorting (ILS) or deep coalescence (Maddison 1997), have proven to be particularly problematic for phylogenetic reconstruction (McGuire et al. 2007). Despite the recognized inherent problems that accompany phylogenetic reconstruction using mitochondrial and/or nuclear genes (reviewed by Toews and Brelsford 2012), mtDNA remains a valuable tool in any systematic toolkit. In the bird genus Piranga, several studies have used mtDNA to investigate phylogenetic relationships (Burns 1998; Barker et al. 2015), but reconstructions varied in topology and nodal support depending on the mtDNA genes used (CytB or CytB + ND2, respectively). Recently, we used restriction-site associated DNA sequencing (RAD-seq; Miller et al. 2007) and target capture of ultraconserved elements (UCEs; Faircloth et al. 2012) to resolve the phylogeny of the genus, producing topologically identical species tree reconstructions with high support across nodes for both sequencing methods (Manthey et al. 2016), but different from previous, mtDNA-based phylogenies (i.e. Burns 1998; Barker et al. 2015). Due to inherent features of target-capture sequencing methods, a by-product of sequencing UCEs is the potential to recover the full mitochondrial genome (mitogenome). Here, we utilize published target-capture sequence data to recover a set of complete mitogenomes for the genus Piranga to investigate patterns of concordance among individual mitochondrial gene trees, the mitogenome tree, and the nuclear species tree.

Methods

We used published Illumina sequence data (Manthey et al. 2016; NCBI SRA BioProject PRJNA296706) to assemble mitochondrial genomes from 22 individuals in the genus Piranga and a single outgroup individual. Full details on the specimens used in this study, including collection locality and tissue accession numbers can be found in Manthey et al. (2016). Here, we list species names and their associated voucher specimen numbers: P. bidentata (MZFC 17737, MZFC 19257), P. erythrocephala (FMNH 343370, FMNH 343371), P. flava (KU 90809, LSU B15408), P. hepatica (KU 4970, KU 9084), P. leucoptera (FMNH 481795, LSU B7783), P. ludoviciana (SDSU 2385, SDSU 2650), P. lutea (KU 89864, LSU B5400), P. olivacea (KU 2699, KU 4672), P. roseogularis (KU 2049, KU 2141), P. rubra (KU 26572, KU 7046), P. rubriceps (LSU B265, LSU B34818), and Cardinalis cardinalis (KU 21828). All mitogenomes are accessioned in NCBI’s GenBank: MH700631-MH700653. Bioinformatics were initially performed for the UCE project (Manthey et al. 2016), utilizing the PHYLUCE software package (Faircloth 2015) and Trinity (version: rnaseq_r2013_08_14; (Grabherr et al. 2011) to assemble contigs for each individual. Mitochondrial genomes, usually the longest one or two contigs of a UCE assembly, were extracted, aligned, and annotated with the guidance of the mitochondrial genome of one Cardinalis cardinalis individual (GenBank #NC025618) using Geneious v9.1.4 (https://www.geneious.com). For each of the elements (non-tRNA mitochondrial elements in Table 1), we estimated gene trees in RAxML with the GTR + Γ substitution model (Silvestro and Michalak 2011; Stamatakis 2014). Gene tree support was assessed with 200 rapid bootstrap replicates. All gene trees and phylip files have been uploaded to FigShare (10.6084/m9.figshare.6807386; 10.6084/m9.figshare.8141819, respectively). We scored for topological concordance between individual gene trees and the concatenated mitogenome tree (also produced in RAxML, with 500 rapid bootstrap replicates) using the program PhyParts (Smith et al. 2015). We did this analysis twice, first with no limit on bipartition support (i.e. bootstrap value could be 0 on any given node in any individual gene tree) and then requiring at least 50% bootstrap support at a given node in a gene tree. Lastly, we calculated difference in likelihood scores (ΔML) between gene trees inferred with topology as a free parameter (i.e. standard ML gene tree inference) and those where topology was constrained to match the mitogenome tree. This allowed a formal assessment of changes in likelihood scores across the different partitions (i.e. genes; Walker et al. 2018).

Results

The order of elements along each newly assembled mitochondrial genome was identical to the published C. cardinalis individual used as the reference genome. Mitochondrial genomes in Piranga ranged from 16,782 to 16,814 bp (mean = 16,804), with variation due to indels found in spacers or the mitochondrial control region. In protein-coding and rRNA genes, as well as the control region, nucleotide identity was conserved between 63 and 90% within Piranga (Table 1), and amino acid conservation in protein coding genes ranged between 79 and 98%. Nucleotide sequence and amino acid residue conservation were positively related (adjusted R2 = 0.688, p < .001). For protein-coding genes, observed rates of substitutions/site (Table 1) were not strongly correlated with the rates calculated from other songbird (Aves: Passeriformes) mitochondrial genomes (R2=0.10, p=.34; Nguyen and Ho 2016). For rRNAs (12S and 16S), we recovered greater than 90% identical nucleotides for each respective gene region. Among commonly used genes (CytB, COI, ND2), we recover much lower proportions of identical nucleotides across taxa (75.3, 77.1, and 63.6%, respectively), suggesting high mutation rates.
Table 1.

Genetic elements used for gene tree analyses, and associated statistics.

Gene regionAlign. length% Identical nucleotide% Pairwise identityΔML scoreNucl. Subs./SiteaVariable residuesbRelative ratebAdjusted R2
12s97890.497.61.6730.1090.3230.614
16s160490.096.91.0140.1160.3980.699
ATP668473.592.117.1470.30716/2271.0140.780
ATP816873.291.99.9050.2987/530.9140.609
CR124365.588.81.8080.4221.3440.698
CO1155177.192.82.5760.2629/5160.8880.843
CO268473.791.40.7320.30314/2271.1110.833
CO378475.692.33.2410.28111/2610.8690.729
CytB114375.392.64.9850.28733/380
ND197868.889.49.0050.35526/3251.3890.874
ND2103863.688.34.2160.39870/3461.3760.833
ND335168.190.03.2610.36816/1161.4950.828
ND4137870.491.211.0350.33145/4591.1240.908
ND4L29773.192.63.5250.2968/990.8830.622
ND5181870.691.32.2290.33284/6050.9740.885
ND651968.090.89.0450.37233/1720.9880.764

Number of variable and total amino acid residues in the protein open reading frame.

Relative rates of sequence evolution for each gene relative to CytB, based on regression analyses of pairwise divergence between all samples and associated R2 value.

Genetic elements used for gene tree analyses, and associated statistics. Number of variable and total amino acid residues in the protein open reading frame. Relative rates of sequence evolution for each gene relative to CytB, based on regression analyses of pairwise divergence between all samples and associated R2 value. When comparing the mitogenome tree to the nuclear species tree, we found the RAxML mitogenome tree (Figure 1) was topologically identical to the nuclear species tree, with all but one node having high (>96%) bootstrap support (P. rubra sister to P. lutea + P. hepatica + P. flava clade had 57% BS support). When comparing each estimated gene tree from the 16 non-tRNA elements to the concatenated mitogenome tree, we found that no single gene was topologically identical to the mitogenome tree. The gene tree most concordant with the nuclear species tree and mitogenome tree topology was the Control Region (concordant at 9 of 10 nodes; Figure 1(A)), with a handful of other genes concordant at eight nodes (CO1, CO2, 16S, ND5, and CytB). We found the difference in likelihood scores between the constrained and unconstrained gene trees to be most similar for CO2 (ΔML = 0.732; Table 1), and greatest for ATP6 (ΔML = 17.147).
Figure 1.

Maximum likelihood mitogenome tree, which is topologically identical to the nuclear DNA species tree from Manthey et al. (2016), showing concordance of individual gene trees at each node. Concordance between the mitogenome tree and each gene tree is shown (A) without any limit on bipartition support (BS ≥ 0%) and B) with a lower limit on support necessary to be included in the comparison (BS ≥ 50%). The black arrow shows the node that had low bootstrap support in the concatenated mitogenome tree (BS = 57%).

Maximum likelihood mitogenome tree, which is topologically identical to the nuclear DNA species tree from Manthey et al. (2016), showing concordance of individual gene trees at each node. Concordance between the mitogenome tree and each gene tree is shown (A) without any limit on bipartition support (BS ≥ 0%) and B) with a lower limit on support necessary to be included in the comparison (BS ≥ 50%). The black arrow shows the node that had low bootstrap support in the concatenated mitogenome tree (BS = 57%).

Discussion

We investigated patterns of topological concordance with the nuclear species tree identified using thousands of loci (Manthey et al. 2016) but failed to recover a single mitochondrial element that was completely concordant with the species tree (which was topologically identical to the concatenated mitogenome tree). Support varied across the dataset, with as few as three gene trees showing the same branching pattern at a given node, while all genes showed the same bipartition at a single node (Figure 1(A)). The number of parsimony informative sites (i.e. phylogenetic informativeness) may cause unresolved nodes in mtDNA gene trees. However, low bootstrap support at particular nodes on individual gene trees may give a false impression of topological concordance when in fact the relationship is relatively unsupported. In fact, many of the individual gene trees had low bootstrap support at internal nodes, and therefore, it is likely they did not greatly contribute to the overall mitogenome tree (Figure 1(B)). Our estimate of the relative intergenic divergence rate of ND2 to CytB (1.37; Table 1) for Piranga is qualitatively similar to the relative intrageneric rate of divergence for a subset of New World birds (1.27) by Smith and Klicka (2010) for the same genetic markers. Nucleotide conservation across the mitogenome varied by gene. For rRNAs, we recovered greater than 90% identical nucleotides, which was not surprising, given the necessary role rRNA plays in protein synthesis. We also found that differences in the likelihood scores between constrained and unconstrained gene trees for the two rRNAs were the highest observed. Among genes commonly used in avian systematics (CytB, COI, ND2), we recover lower proportions of identical nucleotides across taxa (75.3, 77.1, and 63.6%, respectively). Sequence variation is necessary for a gene region to be phylogenetically informative, without variability reconstruction of evolutionary history would fail. Since CytB, COI, and ND2 all have increased rates of substitution, they should have improved utility for phylogenetic reconstruction. Within the mitogenome, we find evidence for variable relationships depending on which region of the mitogenome is considered. However, regardless of the underlying cause of gene tree discordance within the mitogenome, when considering the entire mitogenome, as one linkage group, we recovered an identical topology to the nuclear species tree topology. Altogether, these findings point to a broader issue in phylogenetics, in which researchers erroneously assume different regions of the mitogenome represent separate, independent ‘genes.’ This study, along with published examples of mito-nuclear discordance and discordance between individual mtDNA regions, reiterates the fact that not recognizing the mitogenome as a single linkage group could lead to a multitude of phylogenetic and systematic errors.
  11 in total

Review 1.  The biogeography of mitochondrial and nuclear discordance in animals.

Authors:  David P L Toews; Alan Brelsford
Journal:  Mol Ecol       Date:  2012-06-27       Impact factor: 6.185

2.  PHYLUCE is a software package for the analysis of conserved genomic loci.

Authors:  Brant C Faircloth
Journal:  Bioinformatics       Date:  2015-11-02       Impact factor: 6.937

3.  Rapid and cost-effective polymorphism identification and genotyping using restriction site associated DNA (RAD) markers.

Authors:  Michael R Miller; Joseph P Dunham; Angel Amores; William A Cresko; Eric A Johnson
Journal:  Genome Res       Date:  2006-12-22       Impact factor: 9.043

4.  Asynchronous demographic responses to Pleistocene climate change in Eastern Nearctic vertebrates.

Authors:  Frank T Burbrink; Yvonne L Chan; Edward A Myers; Sara Ruane; Brian Tilston Smith; Michael J Hickerson
Journal:  Ecol Lett       Date:  2016-10-25       Impact factor: 9.492

5.  Analyzing Contentious Relationships and Outlier Genes in Phylogenomics.

Authors:  Joseph F Walker; Joseph W Brown; Stephen A Smith
Journal:  Syst Biol       Date:  2018-09-01       Impact factor: 15.683

6.  Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales.

Authors:  Brant C Faircloth; John E McCormack; Nicholas G Crawford; Michael G Harvey; Robb T Brumfield; Travis C Glenn
Journal:  Syst Biol       Date:  2012-01-09       Impact factor: 15.683

7.  Mitochondrial introgression and incomplete lineage sorting through space and time: phylogenetics of crotaphytid lizards.

Authors:  Jimmy A McGuire; Charles W Linkem; Michelle S Koo; Delbert W Hutchison; A Kristopher Lappin; David I Orange; Julio Lemos-Espinal; Brett R Riddle; Jef R Jaeger
Journal:  Evolution       Date:  2007-10-15       Impact factor: 3.694

8.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

9.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

10.  Analysis of phylogenomic datasets reveals conflict, concordance, and gene duplications with examples from animals and plants.

Authors:  Stephen A Smith; Michael J Moore; Joseph W Brown; Ya Yang
Journal:  BMC Evol Biol       Date:  2015-08-05       Impact factor: 3.260

View more
  1 in total

1.  Positive selection over the mitochondrial genome and its role in the diversification of gentoo penguins in response to adaptation in isolation.

Authors:  D Noll; F Leon; D Brandt; P Pistorius; C Le Bohec; F Bonadonna; P N Trathan; A Barbosa; A Raya Rey; G P M Dantas; R C K Bowie; E Poulin; J A Vianna
Journal:  Sci Rep       Date:  2022-03-08       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.