Literature DB >> 33281468

Sequence and organisation of the mitochondrial genome of Japanese Grosbeak (Eophona personata), and the phylogenetic relationships of Fringillidae.

Guolei Sun1, Chao Zhao1, Tian Xia1, Qinguo Wei1, Xiufeng Yang1, Shi Feng1, Weilai Sha1, Honghai Zhang1.   

Abstract

Mitochondrial DNA is a useful molecular marker for phylogenetic and evolutionary analysis. In the current study, we determined the complete mitochondrial genome of Eophona personata, the Japanese Grosbeak, and the phylogenetic relationships of E. personata and 16 other species of the family Fringillidae based on the sequences of 12 mitochondrial protein-coding genes. The mitochondrial genome of E. personata consists of 16,771 base pairs, and contains 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes, and one control region. Analysis of the base composition revealed an A+T bias, a positive AT skew and a negative GC skew. The mitochondrial gene order and arrangement in E. personata was similar to the typical avian mitochondrial gene arrangement. Phylogenetic analysis of 17 species of Fringillidae, based on Bayesian inference and Maximum Likelihood (ML) estimation, showed that the genera Coccothraustes and Hesperiphona are closely related to the genus Eophona, and further showed a sister-group relationship of E. personata and E. migratoria. Guolei Sun, Chao Zhao, Tian Xia, Qinguo Wei, Xiufeng Yang, Shi Feng, Weilai Sha, Honghai Zhang.

Entities:  

Keywords:  Eophona personata; gene order; mitochondrial genome; phylogenetic analysis

Year:  2020        PMID: 33281468      PMCID: PMC7688617          DOI: 10.3897/zookeys.995.34432

Source DB:  PubMed          Journal:  Zookeys        ISSN: 1313-2970            Impact factor:   1.546


Introduction

(: ), commonly known as the Japanese Grosbeak, is a granivorous passerine with the adults reaching a size of ca. 23 cm. The species is mainly distributed in Far Eastern Asia including Eastern Siberia, Northeast China, North Korea, and Japan. Grosbeaks are migratory birds and move to South China during winter (Clement et al. 1993). Grosbeaks primarily feed on seeds, with preference for certain plants, and insects (Kominami 1987); for example, the seeds of and are favored during autumn and winter (Nimura 1993; Yoshikawa and Kikuzawa 2009). The mitochondrial genome (hereafter mitogenome) is a useful molecular marker for phylogenetic analysis, and is widely used in the evolutionary analysis of a variety of species (Anderson et al. 1981; Boore 1999; Sun et al. 2016). The animal mitogenome is usually a short, closed, circular, double stranded molecule, and comprises of 37 genes: 13 protein-coding genes (PCGs; , , , , , , , COI, COII, COIII, ATP6, ATP8, ), two rRNA genes (12S rRNA and 16S rRNA), and 22 tRNA genes. Compared to nuclear DNA, the mitogenome is a more conserved molecule, and is maternally transmitted (Wolstenholme 1992; Boore 1999; da Fonseca et al. 2008). With ca. 6000 species, passerines account for more than half of the total number of extant birds (Gill et al. 2020). Based on the analysis of hundreds of bird mitochondrial data, at least seven gene arrangements have been found (Zhou et al. 2014). Gene rearrangements in passerines are very common, and most reported passerines have the standard gene order consistent with chickens (Desjardins and Morais 1990). In addition, have at least four gene arrangement (Harrison et al. 2004). The different rearrangements involve the initial tandem duplication, and the partial loss of the segment containing the control region (CR) (Caparroz et al. 2018). The passerine family comprises ca. 50 genera and 230 species (Gill et al. 2020). The phylogenetic relationships between the species in the family and the superfamily are incompletely resolved (Van der Meij et al. 2005). Previous phylogenetic studies of have clarified several aspects of their relationships (Arnaiz-Villena et al. 2001; Van der Meij et al. 2005; Lerner et al. 2011; Zuccon et al. 2012; Tietze et al. 2013; Sangster et al. 2015; Zhao et al. 2016). The aim of this study is to determine the complete mitogenome of and provide a mitogenomic perspective on the phylogenetic relationships of and 16 other species of the family .

Materials and methods

Sample collection and DNA extraction

The specimen of , which had died due to poaching activities, was collected from Shenyang City, Liaoning Province, China, and was stored in the laboratory and then frozen to -80 °C before further processing and analysis. All experiments involving animals were approved by the Qufu Normal University Institutional Animal Care and Use Committee (Permit number: QFNU2018-010) and executed in accordance with the Guide to Animal Experiments of the Ministry of Science and Technology (Beijing, China). DNA was extracted from muscle tissue using the DNeasy Blood and Tissue Kit (Qiagen, Hilden, Germany) following the manufacturer’s protocol.

Cloning and sequencing of mitochondrial genome

Primers were designed based on mitogenome sequences of a sparrowhawk (, GenBank accession number KM360148) and Chinese Grosbeak (KX423959). MEGA5.1, Primer Premier 6.0, and the NCBI-Primer blast database (http://www.ncbi.nlm.nih.gov/BLAST) were used to determine the primers. All the amplified fragments were separated by gel electrophoresis and purified by Agarose Gel Extraction Kit (Qiagen). Purified fragments were cloned into PMD18-T vectors, and transformed into competent cells. Positive clones were identified by blue-white screening and sequenced. Bidirectional sequencing was conducted at Sangon Biotech (Shanghai) using the ABI 3730xl DNA Analyzer (PE Applied Biosystems, San Francisco, CA, USA).

Bioinformatics analysis and statistical procedures

The raw sequences were assembled using the software programs BioEdit (version 7.2. 5) and Chromas Pro 1.7.7 (Goodstadt and Ponting 2001). Genes were identified by aligning the identified sequences with the known sequence of the mitogenome of . The tRNA gene structure was predicted using Scan-SE 1.21 (http://lowelab.ucsc.edu/tRNAscan-SE) and ARWEN (http://130.235.46.10/ARWEN/) (Lowe and Eddy 1997). Phylogenetic analysis of 17 species (accession numbers are provided in Table 2) and (NC_015526) was conducted using Bayesian inference and Maximum Likelihood (ML) estimation based on the sequences of 12 mitochondrial PCGs; gene, which is encoded on the L-strand, was excluded from analysis. The sequences were aligned with ClustalX 1.81 (Thompson et al. 1997; Jeanmougin et al. 1998), and phylogenetic analysis was performed using MrBayes 3.1.2 and PAUP* 4.0 (Swofford 2001; Ronquist and Huelsenbeck 2003).
Table 2.

Base composition (in percentages) of the mitochondrial genomes of 17 species of .

SpeciesTotal length (bp)T (%)C (%)A (%)G (%)A + T content (%)AT-skewGC-skewAccession number
Eophona personata 1677123.032.130.714.253.70.142-0.386 KX812499
Eophona migratoria 1679822.932.330.714.053.70.145-0.397 KX423959
Oreomystis bairdi 1683323.731.630.314.453.90.123-0.373 KM078807
Paroreomyza montana 1683223.531.530.814.254.40.134-0.379 KM078771
Melamprosops phaeosoma 1684024.231.030.514.354.70.114-0.370 NC_025617
Acanthis flammea 1682024.031.430.514.254.50.120-0.378 NC_027285
Loxops coccineus 1558924.231.930.113.854.30.108-0.395 KM078785
Loxia curvirostra 1680523.831.430.614.354.30.125-0.375 KM078800
Carduelis spinus 1682824.031.330.913.854.90.127-0.388 HQ915866
Chloris sinica 1681324.730.530.714.155.40.108-0.369 HQ915865
Serinus canaria 1680523.931.131.113.855.00.130-0.384 KM078794
Haemorhous cassinii 1681224.330.530.914.255.20.120-0.364 KM078786
Coccothraustes coccothraustes 1682323.931.030.914.354.80.127-0.369 KM078789
Hemignathus parvus1683323.831.530.314.454.10.120-0.371 KM078799
Fringilla montifringilla 1680723.332.130.314.353.60.130-0.382 JQ922259
Hesperiphona vespertina 1681023.631.630.814.054.40.132-0.387 KM078770
Crithagra dorsostriata 1680424.230.831.113.855.40.125-0.382 KM078798
Based on the Akaike Information criterion (AIC), GTR+I+G was estimated as the best-fit substitution model using Modeltest 3.7 (Posada and Crandall 1998). Bootstrap tests were based on 1000 replicates (Felsenstein 1981; Wilgenbusch and Swofford 2003). For Bayesian analysis, Metropolis-coupled Markov chain Monte Carlo analysis was performed, with 4 chains run in parallel, for 2,000,000 generations, and the first 25% of each of the sampled 1000 generations were excluded as burn-in. Following Sangster and Luksenburg (2020), we verified the identity and integrity of our mitogenome sequence of with reference sequences of multiple protein-coding genes: NADH dehydrogenase subunit 2 (ND2, 1041 bp; n = 1), part of cytochrome oxidase subunit I (COX1, 698 bp; n = 6), and cytochrome b (CYTB, 1143 bp; n = 2). These markers are commonly used in avian systematics, and reference sequences were available for each marker.

Results

Comparison of mitogenomes of Japanese grosbeak and those of other species of

The complete mitochondrial genome of (KX812499, GenBank) was sequenced and a genome map was constructed (Fig. 1). The genome contains 13 PCGs, 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and one control region (Table 1).
Figure 1.

Circular map of the mitochondrial genome of . tRNAs are denoted as one-letter symbols according to IUPAC-IUB single-letter amino acid codes; L1 = UUR, L2 = CUN, S1 = UCN, S2 = AGY.

Table 1.

Organization of the mitochondrial genome of .

GeneLocation (bp)Size (bp)Spacer (+) or overlap (–)StrandStart codonStop codonAnticodon
D-loop 1–11871187H
tRNAPhe 1188–1255680H GAA
12S rRNA 1256–22309750H
TRNAVal 2231–2300700HTAC
16S rRNA 2301–390416040H
tRNALeu(UUR) 3905–39797516HTAA
ND1 3996–49739786HATGAGG
tRNAIle 4980–5053743HGAT
tRNAGln 5057–512771-1LTTG
tRNAMet 5127–5195690HCAT
ND2 5196–62361041-1HATGTA-
tRNATrp 6236–6305701HTCA
tRNAAla 6307–6375691LTGC
tRNAAsn 6384–6456730LGTT
tRNACys 6457–652367-1LGCA
tRNATyr 6523–6593711LTGA
COX1 6595–81451551-9HGTGAGG
tRNASer(UCN) 8137–8209733LGTC
tRNAAsp 8213–8281698HGTC
COX2 8290–89736841HATGTAA
tRNALys 8975–9043691HTTT
ATPase8 9045–9212168-10HATGTAA
ATPase6 9203–98866847HATGTAA
COX3 9894–106777840HATGT--
tRNAGly 10678–10746690HTCC
ND3 10747–110973511HATGTAA
tRNAArg 11099–11168701HTCG
ND4L 11170–11466297-7HATGTAA
ND4 11460–1283713780HATGTAT
tRNAHis 12838–12907700HGTG
tRNASer(AGY) 12908–1297366-1HGCT
tRNALeu(CUN) 12973–13043710HTAG
ND5 13044–1486118188HATGAGA
CYTB 14870–1601211435HATGTAA
tRNAThr 16018–160866918HTGT
tRNAPro 16105–16174706LTGG
ND6 16181–16699519-71LATGTAG
tRNAGlu 16701–16771711LTTC
Circular map of the mitochondrial genome of . tRNAs are denoted as one-letter symbols according to IUPAC-IUB single-letter amino acid codes; L1 = UUR, L2 = CUN, S1 = UCN, S2 = AGY. Organization of the mitochondrial genome of . The length of the complete mitogenome of is 16,771 bp, and is similar to that of other species (Table 2). The base composition of the genome is C (32.1%), A (30.7%), T (23.0%) and G (14.2%); the proportion of A+T (53.7%) is higher than G+C (46.3%), suggesting a strong A+T bias. The mitogenomes of 17 species showed a positive AT-skew and a negative GC-skew. Base composition (in percentages) of the mitochondrial genomes of 17 species of . Sequence analysis of the 13 PCGs in the mitogenome of revealed that the base composition of the gene was not consistent with the other genes, and the percentage of T and G is much higher than in the other genes, with a positive GC skew (Table 3), whereas the base composition and skewness are highly similar for the other genes. The encoded genes share the common start codon ATN, with ATG most commonly observed. However, the start codon for COI is GTG. The non-coding regions include a control region (D-loop) and a few intergenic spacers. The control region is 1187 bp and is located between the tRNAGlu and tRNAPhe.
Table 3.

Base composition (in percentages) of the genes of .

Gene Proportion of nucleotides %A+T AT skew GC skew %A+C%G+T
T C A G
ND1 25.333.327.314.152.60.039-0.40560.639.4
ND2 23.535.730.810.154.20.135-0.55966.433.6
COX1 23.332.727.516.550.80.081-0.32860.139.9
COX2 20.933.630.315.251.20.183-0.37763.936.1
ATP8 22.039.932.16.054.20.187-0.74072.028.0
ATP6 23.136.830.19.953.20.132-0.57567.033.0
COX3 24.033.327.615.251.50.069-0.37460.839.2
ND3 26.834.227.411.754.10.011-0.49161.538.5
ND4L 24.636.727.311.451.90.052-0.52464.036.0
ND4 23.135.630.510.953.60.138-0.53166.034.0
ND5 23.133.531.611.854.70.156-0.48065.134.9
CYTB 23.835.127.613.651.40.073-0.44262.637.4
ND6 38.99.610.441.049.3-0.5780.62020.080.0
12S rRNA 20.027.731.420.951.40.222-0.13959.140.9
16S rRNA 21.424.234.719.756.10.236-0.10258.941.1
Total23.631.929.315.252.90.108-0.35361.238.8
Base composition (in percentages) of the genes of . Based on the tRNA gene sequences identified, secondary structures of the tRNAs were determined (Fig. 2).
Figure 2.

Predicted secondary structures for the 22 tRNAs in .

Predicted secondary structures for the 22 tRNAs in .

Protein-coding genes and gene order

The 13 PCGs in the mitogenome of spans a length of 11,399 bp, and encode for six NADH dehydrogenase subunits, three cytochrome c oxidase subunits, two ATPases and cytochrome b. The light-strand has nine genes, which includes eight tRNA genes and , and the heavy-strand has 28 genes, which includes 14 tRNA genes, two rRNA genes and 12 protein-coding genes. Relative synonymous codon usage (RSCU) values for the 13 PCGs are shown in Table 4. There are 3798 codons in the 13 PCGs, and codons of leucine, proline, isoleucine, and threonine take a higher proportion (Fig. 3). The codons AAA-lysine, GAA-glutamic acid, AAC-asparagine, UUC-phenylalanine, and AAU-asparagine are AT-rich, and the codons CUC-leucine, CCU-proline, GCC-alanine, and AGC-serine are GC-rich.
Table 4.

Relative synonymous codon usage (RSCU) values for the 13 protein-coding genes in .

CodonCount RSCU CodonCount RSCU
UUU(F)410.56UCU(S)400.82
UUC(F)1051.44UCC(S)551.13
UUA(L)320.37UCA(S)521.07
UUG(L)80.09UCG(S)80.16
CUU(L)830.96CCU(P)1741.69
CUC(L)1621.88CCC(P)1141.11
CUA(L)1922.23CCA(P)1051.02
CUG(L)400.46CCG(P)190.18
AUU(I)800.74ACU(T)991.34
AUC(I)1541.43ACC(T)921.25
AUA(I)890.83ACA(T)941.27
AUG(M)421.00ACG(T)100.14
GUU(V)361.01GCU(A)390.87
GUC(V)421.17GCC(A)881.96
GUA(V)501.40GCA(A)461.02
GUG(V)150.41GCG(A)70.16
UAU(Y)410.70UGU(C)170.76
UAC(Y)761.30UGC(C)281.24
UAA(*)170.59UGA(*)672.31
UAG(*)30.10UGG(W)231.00
CAU(H)970.93CGU(R)270.68
CAC(H)1111.07CGC(R)541.37
CAA(Q)1021.73CGA(R)401.01
CAG(Q)160.27CGG(R)280.71
AAU(N)1090.98AGU(S)390.80
AAC(N)1131.02AGC(S)972.00
AAA(K)891.73AGA(R)350.89
AAG(K)140.27AGG(R)531.34
GAU(D)190.60GGU(G)340.80
GAC(D)441.40GGC(G)531.24
GAA(E)491.78GGA(G)611.43
GAG(E)60.22GGG(G)230.54
Figure 3.

Codon distribution in the mitochondrial genome of .

Relative synonymous codon usage (RSCU) values for the 13 protein-coding genes in . Codon distribution in the mitochondrial genome of . The gene order and arrangement of the region located between and tRNAPhe, is tRNAThr, tRNAPro, ND6, tRNAGlu, control region, and tRNAPhe, as shown in Fig. 4.
Figure 4.

Mitochondrial gene order and arrangement in .

Mitochondrial gene order and arrangement in .

Phylogenetic analysis

Phylogenetic relationships of the 17 species, inferred from Bayesian and Maximum Likelihood analyses, recovered almost identical well-resolved topologies (Fig. 5). Four major clades can be distinguished: clade 1 consisted of the genus ; clade 2 comprised three genera of grosbeaks (, , ); clade 3 comprised of five genera (, , , , and ); and clade 4 consisted of three genera of honeycreepers (, , and ). Clades 3 and 4 were inferred as sister-groups, which together were sister to clade 2. Clade 1 was sister to all other cardeline finches.
Figure 5.

The phylogenetic tree generated for 17 species of . The values indicated at the nodes are Bayesian posterior probabilities (left) and ML bootstrap proportions (right).

The analysis recovered and as sister groups to , with strong support in both Bayesian and ML analysis (Fig. 5). The phylogenetic tree generated for 17 species of . The values indicated at the nodes are Bayesian posterior probabilities (left) and ML bootstrap proportions (right). No evidence for chimerism was found in comparisons of the ND2, COX1, and CYTB fragments with reference sequences on GenBank. Thus, in all cases the mitogenome of clustered with, and was very similar to, reference sequences of this species (data not shown).

Discussion

In this study, we obtained the complete mitogenome sequence of , and performed molecular phylogenetic analysis of 17 species based on the sequences of 12 mitochondrial PCGs. Our results revealed that the complete mitogenome of is 16,771 bp, and contains 13 protein-coding genes, 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and one control region. Analysis of the base composition revealed an A+T bias, a positive AT skew and a negative GC skew. Phylogenetic analysis demonstrated that is the closest genus to and that is the sister taxon of . Our analysis showed that the mitogenome of is similar to that of other species. The genome structure was that of a typical vertebrate mitochondrial genome (Boore 1999). The A+T bias observed in the mitochondrial genome of was similar to that of other vertebrates (Yang et al. 2016), and the positive AT-skew value and a negative GC-skew value observed, is also consistent with the vertebrate mitogenome (Quinn and Wilson 1993). The gene order and arrangement in was similar to the typical arrangement seen in birds (Mindell et al. 1999; Haddrath and Baker 2001; Liu et al. 2013), including (Caparroz et al. 2018). It is generally believed that the rearrangement of the mitochondrial genome represents a rare evolutionary event that can be used to construct the phylogenetic relationships of distantly related groups (Bensch 2000). In birds, two major types of gene order are found which differ by the number of control region copies (Singh et al. 2008). One of these is believed to be the ancestral gene order and the other is the remnant control region 2 gene order (Singh et al. 2008). The phylogenetic relationships observed in this study are in accordance with previous research (Arnaiz-Villena et al. 2001; Zuccon et al. 2012). In the current study, the three grosbeak genera (, , ; clade 2) clustered together and formed a well-defined clade. The grosbeak consists of a group of fairly large and stocky finches, feeding primarily on hard seeds. Our study corroborates a previous study based on the nuclear and mitochondrial sequences (Zuccon et al. 2012). A close phylogenetic relationship between the four genera (, , , ) has been recognised previously (Vaurie 1959; Clement et al. 1993; Dickinson and Christidis 2015), and their close evolutionary relationships are widely accepted. Our analysis showed that the genus diverged very early within the family and was followed a deep divergence between the grosbeak clade and a clade formed by the other members . Our study as well as previous studies suggests that the analysis of phylogenetic relationships of passerines are more accurately resolved and better supported with complete mitogenomes than with short sequences of single genes (Van der Meij et al. 2005; Nguembock et al. 2009).

Conclusions

In this study, the complete mitogenome of was sequenced and analysed for the first time, and the phylogenetic analysis confirmed the taxonomic classification of . The results showed that the genera and have a close relationship with the genus , and this is consistent with the morphologicl similarity observed between them. Our analysis shows the phylogenetic relationship of as a sister group to , and the mitogenome was observed to be very similar between them.
  30 in total

1.  Mitochondrial genomic rearrangements in songbirds.

Authors:  S Bensch; A Härlid
Journal:  Mol Biol Evol       Date:  2000-01       Impact factor: 16.240

2.  Interordinal relationships of birds and other reptiles based on whole mitochondrial genomes.

Authors:  D P Mindell; M D Sorenson; D E Dimcheff; M Hasegawa; J C Ast; T Yuri
Journal:  Syst Biol       Date:  1999-03       Impact factor: 15.683

3.  Multilocus resolution of phylogeny and timescale in the extant adaptive radiation of Hawaiian honeycreepers.

Authors:  Heather R L Lerner; Matthias Meyer; Helen F James; Michael Hofreiter; Robert C Fleischer
Journal:  Curr Biol       Date:  2011-10-20       Impact factor: 10.834

4.  Inferring evolutionary trees with PAUP*.

Authors:  James C Wilgenbusch; David Swofford
Journal:  Curr Protoc Bioinformatics       Date:  2003-02

5.  Sequence and gene organization of the chicken mitochondrial genome. A novel gene order in higher vertebrates.

Authors:  P Desjardins; R Morais
Journal:  J Mol Biol       Date:  1990-04-20       Impact factor: 5.469

6.  Sequence evolution in and around the mitochondrial control region in birds.

Authors:  T W Quinn; A C Wilson
Journal:  J Mol Evol       Date:  1993-10       Impact factor: 2.395

7.  Complete mitochondrial DNA genome sequences of extinct birds: ratite phylogenetics and the vicariance biogeography hypothesis.

Authors:  O Haddrath; A J Baker
Journal:  Proc Biol Sci       Date:  2001-05-07       Impact factor: 5.349

8.  The phylogenetic relationships and generic limits of finches (Fringillidae).

Authors:  Dario Zuccon; Robert Prŷs-Jones; Pamela C Rasmussen; Per G P Ericson
Journal:  Mol Phylogenet Evol       Date:  2011-10-17       Impact factor: 4.286

9.  Phylogeny and rapid northern and southern hemisphere speciation of goldfinches during the Miocene and Pliocene epochs.

Authors:  A Arnaiz-Villena; M Alvarez-Tejado; V Ruíz-del-Valle; C García-de-la-Torre; P Varela; M J Recio; S Ferre; J Martínez-Laso
Journal:  Cell Mol Life Sci       Date:  1998-09       Impact factor: 9.261

10.  The adaptive evolution of the mammalian mitochondrial genome.

Authors:  Rute R da Fonseca; Warren E Johnson; Stephen J O'Brien; Maria João Ramos; Agostinho Antunes
Journal:  BMC Genomics       Date:  2008-03-04       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.