Literature DB >> 26273219

The complete mitochondrial genome sequence of the little egret (Egretta garzetta).

Yi Zou1, Mei-Dong Jing1, Xiao-Xin Bi1, Ting Zhang1, Ling Huang1.   

Abstract

Many phylogenetic questions in the Ciconiiformes remain unresolved and complete mitogenome data are urgently needed for further molecular investigation. In this work, we determined the complete mitogenome sequence of the little egret (Egretta garzetta). The genome was 17,361 bp in length and the gene organization was typical of other avian mtDNA. In protein-coding genes (PCGs), a C insertion was found in ND3, and COIII and ND4 terminated with incomplete stop codons (T). tRNA-Val and tRNA-Ser (AGY) were unable to fold into canonical cloverleaf secondary structures because they had lost the DHU arms. Long repetitive sequences consisting of five types of tandem repeats were found at the 3' end of Domain III in the control region. A phylogenetic analysis of 11 species of Ciconiiformes was done using complete mitogenome data and 12 PCGs. The tree topologies obtained with these two strategies were identical, which strongly confirmed the monophyly of Ardeidae, Threskiorothidae and Ciconiidae. The phylogenetic analysis also revealed that Egretta was more closely related to Ardea than to Nycticorax in the Ardeidae, and Platalea was more closely related to Threskiornis than to Nipponia in the Threskiornithidae. These findings contribute to our understanding of the phylogenetic relationships of Ciconiiformes based on complete mitogenome data.

Entities:  

Keywords:  Egretta garzetta; mitochondrial genome; phylogenomics

Year:  2015        PMID: 26273219      PMCID: PMC4530654          DOI: 10.1590/S1415-4757382220140203

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

With more than 9,000 living species, Aves is the most diverse class of vertebrates. The huge number of species, complex morphological characters and wide range of ecological behaviors make it difficult to solve the phylogenetic relationship of birds in traditional taxonomy (Bock, 1956; Howard and Moore, 1980; Monroe and Sibley, 1993). The order Ciconiiformes, consisting of more than 110 species of large or medium size waders, has traditionally be classified into five families (Ciconiidae, Threskiornithidae, Ardeidae, Balaenicipitidae and Scopidae) (Howard and Moore, 1980; Austin, 1985; Gill, 1990; Clements, 2000; Zheng, 2002). However, there have been various uncertainties regarding the evolutionary relationships of different taxa in this order: (1) The phylogenetic relationships among the five families have been questioned in morphological studies (Kahl, 1972; Cracraft, 1981), (2) the Family Ardeidae was divided into two subfamilies (Ardeinae and Botaurinae) by Bock (1956) and Zheng (1997), but into four subfamilies (Ardeinae, Nycticoracinae, Botaurinae and Tigrisomatinae) by Payne and Risley (1976), and (3) the phylogenetic status of several species in the traditional classification of the subfamily Ardeinae has been questioned. For example, the great egret was initially placed in an independent genus Casmerodius (Peter, 1931), but was put in Egretta by Bock (1956) and Ardea by Payne and Risley (1976). Similarly, the intermediate egret was initially included in Egretta, but then placed in Mesophoyx by Sibley and Monroe (1990). The taxonomic position of the cattle egret had also changed many times; in early taxonomic literature this species belonged to Bubulcus (Peter, 1931), but was subsequently placed in Ardeola by Bock (1956) and in Egretta by Payne and Risley (1976). Genome sequences, which provide direct information on evolutionary history, are perfect markers for phylogenetic studies since the resulting analyses can be used to assess and revise the conclusions of traditional taxonomy. In the last 30 years, molecular investigations have shed new light on the evolutionary history of the Ciconiiformes. Based on DNA hybridization results, Sibley merged Ciconiiformes and four other orders (Gaviiformes, Podicipediformes, Lariformes and Charadriiformes) into a huge new order. However, recent molecular studies have proposed the paraphyly of Ciconiiformes because the herons and ibises in this group showed a close relationship with Pelecaniformes, whereas the storks were closely related to Sphenisciformes (Hedges and Sibley, 1994; Cracraft ; Hackett ; Pacheco ). The North American Classification Committee (NACC) has recommended that the families Ardeidae, Threskiornithidae, Balaenicipitidae and Scopidae be merged into Pelecaniformes, and Ciconiiformes was restricted to include only the Ciconiidae. Molecular studies of the Ardeidae have indicated that day herons and night herons are closely related, and that Nycticoracinae should be merged into Ardeinae, while the tiger herons and boat-billed heron were basal lineages and should be placed in the Tigrisomatinae and Cochleariinae, respectively (Sheldon, 1987; Sheldon and Kinnarney, 1993; Sheldon , 2000). This four-subfamily classification (Ardeinae, Botaurinae, Tigrisomatinae and Cochleariinae) has been generally accepted. Molecular investigations of the subfamily Ardeinae have shown that the great egret and intermediate egret form a monophyletic lineage that is more closely related to Ardea than to Egretta, indicating that they should not be placed in Egretta (Sheldon, 1987; Sibley and Monroe, 1990; Sheldon and Kinnarney, 1993; Sheldon , 2000; Chang ). In molecular systematics, the topologies of phylogenetic trees vary with the molecular markers used and the number of taxa involved (Zwickl and Hillis, 2002). Consequently, some phylogenetic uncertainties in the Ardeinae (such as the evolutionary status of the cattle egrets Ardeola and Butorides) have not been resolved (Chang ; Zhou XP, 2008, PhD thesis, Xiamen University, China). Mitochondrial DNA (mtDNA), with its intrinsic characteristics (small genome size, simple genome structure, exclusively maternal inheritance, lack of extensive recombination and rapid rate of evolution), has been extensively used in taxonomic and phylogenetic studies of vertebrates (Ingman ; Sheldon ; Gentile ; Zhang and Wake, 2009; Pacheco ; Suzuki ). Compared to individual genes, complete mitogenomes contain more information on an organisms or taxon’s evolutionary history, reduce stochastic errors and minimize the effect of homoplasy in phylogenetic studies (Campbell and Lapointe, 2011). Phylogenies based on complete mitogenomes are generally consistent with those derived from nuclear genes if appropriate sampling of taxa and analysis are applied (Arnason ; Reyes ; Kjer and Honeycut, 2007). Complete mitogenomes have increasingly been used to address the evolution and radiation of birds (Moum ; Sato ; Pacheco ). To date, more than 260 avian mitogenomes have been deposited in GenBank, only four of which involve species belonging to the Ardeidae (Egretta eulophotes, Ardea novaehollandiae, Ixobrychus cinnamomeus and Nycticorax nycticora). The lack of complete mitogenome data is an important limitation in solving the evolutionary puzzles of the Ardeidae and Ciconiiformes. In this report, we describe the complete mitogenome sequence of the little egret (Egretta garzetta) and provide a comprehensive analysis of its genome characters. Although the phylogenetic status of this species has been well-defined by morphological and molecular studies (Bock, 1956; Payne and Risley, 1976; McCracken and Sheldon, 1997; Rabosky and Matute, 2013), the availability of its complete mitogenome data will provide useful information for molecular phylogenetic studies and conservation biology of the Ardeidae.

Material and Methods

Sample collection and extraction of genomic DNA

One specimen of E. garzetta was collected from Wuyi Mountain, Fujian Province, China. The specimen was identified based on external characteristics, using the system of Sibley and Monroe (1990). Total genomic DNA was extracted from muscle tissue with a Wizard Genomic DNA purification kit (Promega, Madison, WI, USA) according to the manufacturer’s instructions. The concentration of extracted DNA was determined using a spectrophotometer and adjusted to 50 ng/μL.

PCR amplification and sequencing

The E. garzetta mtDNA was obtained by polymerase chain reactions (PCR) using 28 primer sets reported by Sorenson . The PCR products for each set of primers were < 1,500 bp in size and all fragment sequences overlapped each other by at least 200 bp. PCR amplifications were done with a Mycycler Gradient thermocycler (Bio-Rad) in a final volume of 50 μL, including 5 μL of 10x EXTaq buffer (Mg2+-free; Takara Biotech, Dalian, China), 2.5 mM of each dNTP, 75 mM MgCl2, 10 μM of each primer, 1.5 U of EX Taq polymerase (Takara of Biotech, Dalian, China) and approximately 20–50 ng of total genomic DNA. The reaction included an initial denaturation at 94 °C for 3 min, followed by 35 cycles consisting of denaturation at 94 °C for 10 s, annealing at 50–56 °C for 30 s and extension at 72 °C for 2 min, with a final extension at 72 °C for 10 min. There was a negative control in each round of PCR to check for contamination. The products were electrophoresed on 1.5% agarose gels staining with ethidium bromide and visualized by ultraviolet transillumination. The PCR products were purified with a gel extraction kit (Sangon BioMedical, Shanghai, China) and directly sequenced (both directions) with an ABI 3730XL automatic sequencer (Perkin-Elmer) using an ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction kit (with AmpliTaq DNA polymerase FS, Applied Biosystems).

Sequence assembly, annotation and analysis

Sequence assembly and annotation were done using the DNASTAR software package (Lasergene version 5.0; Madison, WI, USA). The boundaries of protein-coding genes and rRNA genes were determined by aligning our sequences with the complete mtDNA sequences of A. novaehollandiae (NC_008551) and Gallus gallus (NC_001323; Galliformes: Phasianidae) in GenBank. The boundaries and the cloverleaf secondary structures of tRNAs were identified by tRNAscan-SE v 1.12 with the default settings. The complete nucleotide sequence was submitted to GenBank under accession no. NC_023981 and the blast sequences are submitted to DRYAD (doi:10.5061/dryad.3g604). The base composition for protein-coding genes (PCGs), the codon usage of 13 PCGs and the pairwise distances among mitogenomes of the species studied were calculated with MEGA version 5 (Tamura ).

Phylogenetic inference using mitogenomes

The phylogenetic relationships among E. garzetta and four other species in the Ardeidae (A. novaehollandiae, E. eulophotes, I. cinnamomeus and N. nycticorax), four species in the Threskiornithidae (Platalea leucorodia, Platalea minor, Threskiornis aethiopicus and Nipponia nippon) and two species in the Ciconiidae (Ciconia boyciana, Ciconia ciconia) were constructed with complete mtDNA sequences and 12 PCGs (excluding ND6). Two species in the family Anatidae, order Anseriformes (Branta canadensis, NC_007011; Anas platyrhynchos, EU009397) were designated as outgroups. The relevant information for each genome is presented in Table S1. The program Modeltest version 3.7 (Posada and Crandall, 1998) was used to choose an appropriate substitution model of sequence evolution. The GTR+I+G model was selected as the best fitting model. For the Bayesian procedure, four independent Markov chains were run for 10,000,000 generations by sampling one tree per 1,000 generations and allowing adequate time for convergence. After discarding the first 2,500 trees (25%) as part of a burn-in procedure that was determined by checking for the likelihood of being stationary, we used the remaining 7,500 sampling trees to construct a 50% majority rule consensus tree. Two independent runs were used to provide additional confirmation of the convergence of the Bayesian posterior probabilities (BPP) distribution.

Results and Discussion

Genome organization and base composition

The complete mitogenome of E. garzetta is a circular molecule 17,361 bp in length (Figure 1). This size is intermediate to all available ardeid mitogenomes, which range from 17,180 bp (I. cinnamomeus; Zhang ) to 17,829 bp (N. nycticorax, NC_015807). The gene organization is identical to that of typical avian mtDNA (Wolstenholme, 1992; Boore, 1999; Roques ; Gibb ; Kan ; Zhang, ; Figure 1). Table 1 shows the various features of this genome. There are six regions in which genes overlapped by 29 bp and 18 intergenic spacer regions comprising a total of 97 bp.
Figure 1

Gene organization of the E. garzetta mitogenome. ND1–6 refers to NADH dehydrogenase subunits 1–6, COI–III refer to cytochrome c oxidase subunits 1–3, ATP6 and ATP8 refer to ATPase subunits 6 and 8, and Cyt b refers to cytochrome b. Twenty-two tRNA genes are designated by single-letter amino acid codes.

Table 1

Organization of the E. garzetta mitochondrial genome.

GenePosition a Size (bp)Spacer (+)/Overlap (−)Strand b Codon


FromToStart c Stop c
tRNA-Phe169690H
12s-rRNA7010409710H
tRNA-Val10411111710H
16s-rRNA1112271816070H
tRNA-Leu (UUR)27192792748H
ND1 280137789787HATGAGA
tRNA-Ile378638567111H
tRNA-Gln38683937700L
tRNA-Met39384005680H
ND2 4006504410390HATGTAG
tRNA-Trp50455116722H
tRNA-Ala511951866810L
tRNA-Asn51975270743L
tRNA-Cys5274534067−1L
tRNA-Tyr534054117213L
CO I 542569751551−9HGTGAGG
tRNA-Ser (UCN)69677040742L
tRNA-Asp70437111691H
CO II 711377966841HATGTAA
tRNA-Lys77987867701H
ATP8 78698036168−10HATGTAA
ATP6 80278710684−1HATGTAA
CO III 871094937840HATGT d
tRNA-Gly94949562690H
ND3 956399143522HATTTAA
tRNA-Arg99179985691H
ND4L 998710283297−7HATGTAA
ND4 102771165413780HATGT d
tRNA-His1165511724700H
tRNA-Ser (AGY)117251179268−1H
tRNA-Leu (CUN)1179211863720H
ND5 1186413678181510HATGAGA
Cyt b 136891483111433HATGTAA
tRNA-Thr14835149047011H
tRNA-Pro1491614987728L
ND6 14996154724773LATGAGA
tRNA-Glu1547615549740L
Control region155501736118120H

Position numbering starts with the 5′ position of the Control region;

Genes transcribed from the L or H strand;

Start and stop codons of protein-coding genes;

Protein-coding genes overlapping with tRNA genes end with an incomplete stop codon.

Position numbering starts with the 5′ position of the Control region; Genes transcribed from the L or H strand; Start and stop codons of protein-coding genes; Protein-coding genes overlapping with tRNA genes end with an incomplete stop codon. The base composition of the E. garzetta mitogenome revealed a slight bias towards A+T (31.5% A, 23.2% T, 31.8% C and 13.5% G). The A+T content for the whole H-strand, different genes and control regions was estimated for 11 mitogenomes in Ciconiiformes (Table 2). This analysis showed that, except for the first codon of PCGs, other portions of these mitogenomes showed varying degrees of preference for A/T. The equations AT-SKEW= (A−T)/(A+T) and GC-SKEW= (G−C)/(G+C) can be used to calculate the skew for a given strand to investigate nucleotide bias (Perna and Kocher, 1995). The positive AT-skew (0.138) and negative GC-skew (−0.399) for the E. garzetta mitogenome suggested the occurrence of more A and C than T and G, which is consistent with other avian mitogenomes (Haring ; Kan ; Yang ; Zhang ).
Table 2

Genomic characteristics of 11 avian mtDNAs.

SpeciesHeavy-strand12 Protein-coding genesLrRNA geneSrRNA genetRNA geneControl region






Length (bp)AT%Length (bp)AT% (all)AT% (1st)AT% (2nd)AT% (3rd)Length (bp)AT%Length (bp)AT%Length (bp)AT%Length (bp)AT%
P. leucorodia 15,58555.310,87454.949.458.756.81,59956.497453.21,56758.71,14056.1
P. minor 15,78455.410,87555.049.458.756.61,59956.297553.21,55258.91,35256.6
T. aethiopicus 15,82655.210,87454.650.257.755.41,59855.197352.21,56759.11,38257.8
N. nippon 15,56754.010,87453.249.358.751.71,60355.597752.21,55257.51,16059.1
A. novaehollandiae 16,35455.410,87553.449.857.752.01,60655.497051.41,55557.21,92267.1
E. eulophotes 16,42155.010,87053.348.957.753.01,60554.897151.61,55056.31,99764.7
E. garzetta 16,24555.010,87353.449.857.853.01,60754.797150.81,55356.21,81265.3
I. cinnamomeus 16,02757.110,87356.151.058.858.51,59155.997153.91,55558.91,60965.3
N. nycticorax 16,66556.110,87354.850.158.955.81,59656.097352.11,55157.02,24462.8
C. boyciana 16,48753.810,87152.448.357.550.41,61253.696851.61,55057.32,05360.5
C. ciconia 16,21253.810,87452.648.357.551.51,60854.096851.71,55057.21,77959.5
Average16,10755.110,87354.049.558.254.11,60255.297252.21,55557.71,67761.3

Protein-coding genes and codon usage

The total length of 13 PCGs in the E. garzetta mitogenome was 11,225 bp, and most of the PCGs were separated by one or more tRNAs (Figure 1). The gene sizes and structures were not significantly different from those of other avian species (Yamamoto ; Haring ; Yang ; Kan ; Zhang ). There is a C insertion at position 174 in ND3, and this insertion was also found in some species of Palaeognathae, e.g., NC_002784, NC_002778 and NC_002782 (Härlid ) and Neognathae, e.g., NC_011307 and NC_010962 (Zhang ). Other analyses have proposed that the insertion is not C at position 174 but A at position 175, as reported in the mitogenomes of Otis tarda (Gruiformes: Otididae, NC_014046) (Yang ) and Trachemys scripta(Testudoformes: Emydidae) (Russell and Beckenbach, 2008). The function of this extra C or A in ND3 and its phylogenetic implications are not well known (Russell and Beckenbach, 2008), but the effect of this insertion on gene expression can be removed by RNA alternative splicing or a frameshift (Mindell ). The average A+T value of 13 PCGs in E. garzetta is 53.10% (Table 3). Except for ND1, the other PCGs had positive AT-skew (0.016 ∼ 0.563) and negative GC-skew (−0.295 ∼ −0.733), indicating the occurrence of more A and C than T and G (Table 3). The nucleotide compositions of three codons in PCGs were estimated for 11 species (Table 4). The results showed that the smallest and greatest variations occurred in the second (A 0.5%, G 0.3%, C 0.6%, T 0.5%) and third (A 4.4%, G 3.0%, C 5.5%, T 3.7%) codons, respectively. The second codon is generally considered to have undergone maximum selective pressure, followed by the first and third codons and other non-coding regions. Different selective pressures result in different nucleotide variability (Zhong ). Table 4 also shows that the G content of the third codon (only 4.1%) was the smallest of the three codons. A similar phenomenon has also been found in mammalian mitogenomes (Reyes ; Gibson ).
Table 3

Base composition for protein-coding genes found in mtDNA of E. garzetta.

GeneLength (bp)Proportion of nucleotides (%)AT SkewGC Skew

ACGTA+T
ND197826.3834.4612.6826.4852.86−0.002−0.462
ND2103932.6333.5910.1123.6856.310.159−0.537
COX1155128.2430.1116.3825.2753.510.056−0.295
COX268431.4331.4314.1822.9554.380.156−0.378
ATP816832.1438.695.9523.2155.350.161−0.733
ATP668430.1236.849.9423.1053.220.132−0.575
COX378428.5731.7615.4324.2352.800.082−0.346
ND335226.7036.0811.3625.8552.550.016−0.521
ND4L29729.9735.3511.4523.2353.200.127−0.511
ND4137831.4936.219.6522.6454.130.163−0.579
ND5181531.9035.4310.8521.8253.720.188−0.531
CYTB114327.4737.1012.6022.8350.300.092−0.493
ND647737.5341.9310.0610.4848.010.563−0.613
Average30.3535.3111.5922.7553.100.146−0.506
Table 4

Nucleotide compositon of the 13 protein-coding genes.

Species1st codon position2nd codon position3rd codon potion



A%G%C%T%A%G%C%T%A%G%C%T%
P. leucorodia 29.720.230.020.120.112.329.338.341.43.840.414.4
P. minor 29.720.229.920.220.012.429.238.441.34.040.014.7
T. aethiopicus 29.520.429.720.320.012.429.638.041.04.140.814.1
N. nippon 29.520.430.719.420.012.429.338.339.25.443.312.1
A. novaehollandiae 30.120.130.419.420.112.229.738.040.63.944.111.4
E. eulophotes 30.320.030.619.120.012.429.737.940.64.043.911.5
E. garzetta 30.020.130.519.420.112.229.738.040.54.043.711.8
I. cinnamomeus 30.819.428.821.020.112.329.438.243.22.439.614.8
N. nycticorax 30.520.030.019.520.212.329.338.240.44.639.915.1
C. boyciana 29.720.531.118.719.812.429.838.038.84.645.111.5
C. ciconia 29.720.531.118.719.712.529.838.039.14.444.512.0
Range1.31.12.32.30.50.30.60.54.43.05.53.7
Average30.020.230.319.620.012.329.538.140.64.142.313.0
The start and stop codons for the PCGs of the E. garzettamitogenome are shown in Table 1. COIII and DN4 terminated with an incomplete stop codon (T). The use of an incomplete stop codon (T) is common in avian (Härlid ; Haring ; Yang ; Zhang ) and mammalian (Wolstenholme, 1992; Arnason ; Gibson ; Bi ; Chen ; Song ) mitogenomes, and can form a complete UAA terminal signal by posttranscriptional polyadenylation (Ojala ; Boore, 2004). The ND6 gene was located in the L-strand and its base composition was very different from the other 12 PCGs (Table 3) so it was excluded from the codon usage analysis. Twelve E. garzetta PCGs consisted of 3,626 codons, excluding termination codons (Table S2). The usage frequencies of 21 amino acids ranged from 0.69% (Cys) to 17.9% (Leu). Except for Leu, the most frequently used amino acids were Ile (11.47%), Thr (9.93%) and Ala (7.73%), which was similar with those of other ardeid species (Zhang ).

Ribosomal and transfer RNA genes

Animal mitogenomes contain small (srRNA) and large (lrRNA) subunits of rRNA (Wu ; Gibson ; Kan ; Krajewski ; Bi ; Chen ; Zhang ; Gao ), and E. garzetta was no exception (Figure 1). The A+T content for srRNAand lrRNA was 50.8% and 54.7%, respectively, and these values were relatively small among the 11 mitogenomes (Table 2). Based on the respective anticodons and secondary structures, 22 tRNA genes were identified and their sizes ranged from 67 bp (tRNA) to 74 bp (tRNA UUR, tRNA, tRNAUCN, tRNA). Twenty tRNAs can fold into canonical cloverleaf secondary structures, while tRNA-Val and tRNA-Ser (AGY) lost the DHU (dihydrouracil) arms. The cloverleaf structures of tRNA-Val and tRNA-Ser (AGY) were identified by comparing them with counterparts in the E. eulophotes mitogenome (NC_009736). In vertebrate mitogenomes, tRNA-Ser (AGY) generally cannot fold into the canonical cloverleaf secondary structure (Härlid ; Shi ; Wu ; Yang ; Gao ). Although the gene sizes and anticodon nucleotides agreed with those described for other vertebrates, there were some atypical pairings in the stem regions, such as A-A, A-C, U-C and U-U wobbles. Generally, the tRNA cloverleaf structure contained 7 bp in the aminoacyl stem, 5 bp in the TΨC and anticodon stems, and 4 bp in the D-stem. However, some tRNAs, e.g., tRNA-Phe, tRNA-Leu (CUN) and tRNA-Ile, lacked one or two bp in the T-stem, anticodon stem or D-stem.

Non-coding regions

The non-coding region (the control region, mtCR) of the E. garzetta mitogenome was determined as1,812 bp in length and located between tRNA and tRNA (Table 1, Figure 1). The mtCR controls the replication and transcription of animal mitogenomes (Shadel and Clayton, 1997; Taanman, 1999). Based on the nucleotide composition, the mtCR region of E. garzetta contains three domains: a 5′-peripheral domain (Domain I), a central conserved domain (Domain II) and a 3′-peripheral domain (Domain III), an organization that was similar to that of other birds (Southern ; Saccone ; Randi ; Roques ; Wang ; Yang ; Zhang ; Figure 2).
Figure 2

Schematic representation of the control region in the mitogenome of E. garzetta. The first box represents the extended termination-associated sequences (ETAS1 and ETAS2). Boxes F, E, D and C represent the conserved sequence boxes in the central domain. CSB – conserved sequence block, CSB-like – a sequence similar to CSB, LSP and HSP – light-strand and heavy-strand transcription promoters, respectively, and Rs – tandem repeats in the control region.

In Domain I (nt 1–328), two putative extended termination-associated sequence blocks (ETAS1 and ETAS2) were recognized and two putative termination-associated sequences (TAS, conserved palindromic motifs 5′-TACAT-3′ and 5′-TATAT-3′) that act as a signal to terminate synthesis of the control region (Saccone ; Randi and Lucchini, 1998; Yamamoto ; Haring ; Roques ) were found in ETAS1. In some birds and mammals, there is a C structure located close to the 5′-peripheral domain of Domain I that can potentially form a stable goose hairpin structure (Quinn and Wilson, 1993; Douzery and Randi, 1997; Sbisà ; Randi and Lucchini, 1998); this structure consists of a stem with seven complementary ‘C’s/‘G’s and a loop containing a TCCC motif (Dufresne ; Yang ). This structure is speculated to be related to H-strand termination (Dufresne ). The hairpin structure cannot be formed in any of the available ardeid mitogenomes because the interrupted poly-C sequences in Domain I of four species (A. novaehollandiaeNC_008551, E. eulophotes NC_009736, N. nycticora NC_015807 and E. garzetta NC_023981) are not followed by a G stretch and Domain I of I. cinnamomeus has no poly-C sequence (Zhang ). A sequence block similar to the conserved sequence block (CSB1) was found in Domain I (Figure 2) and similar structures have been observed in other avian mitogenomes (Desjardins and Morais, 1990; Quinn and Wilson, 1993; Randi and Lucchini, 1998; Kan ; Zhang ). In Domain II (nt 329–794), four conserved sequence boxes (F, E, D and C) were detected (Figure 2) after aligning with reported counterparts in birds and mammals (Walberg and Clayton, 1981; Southern ; Desjardins and Morais, 1990; Quinn and Wilson, 1993; Randi and Lucchini, 1998; Roques ; Kan ; Yang ; Zhang ). Domain III (nt 795–1812) comprised a conserved sequence block (CSB-1) that regulates mtDNA replication (Figure 2). A poly(C) sequence located upstream of the CSB1 was assumed to represent the origin of H-strand replication (OH) (Walberg and Clayton, 1981; Figure 2). A poly (T) sequence located downstream of the CSB1 was also observed in the mtCR of other birds (NC_008551, NC_009736; NC_015807; Kan ; Zhang ). The bidirectional light- and heavy-strand transcription promoters (LSP/HSP) described in other birds (L’abbé ; Randi and Lucchini, 1998; Ritchie and Lambert, 2000; Kan ; Zhang ) also existed in Domain III of E. garzetta. In addition, long tandem repeats were found at the 3′ end of Domain III and could be divided into two regions: the first region (nt 977 to 1399) contained three types of tandem repeats: 5′-TACTTTAAAGCACTAAAA-3′ (6×18 bp), 5′-TTTCATTAAAAATATACTATACCCTTCATGAAC-3′ (5×33 bp), and 5′-TGTATCCTTATATCTTTATGT TACCTTTAC-3′ (4×30 bp) while the second region (nt 1406 to 1804) comprised two types of tandem repeats: 5′-TAAACAA-3′ (26×7 bp) and 5′-CAAACAA-3′ (30×7 bp). The existence of repetitive sequences contributed to the large size of the mtCR and the high content of A. Similar tandem repeats (CAAA or CAAACAA) were found in species of Charadriiformes (NC_003712, NC_003713, NC_007978, NC_018548, NC_017601, NC_024069; Wenink ) and Gruiformes (Yang ), and in C. boyciana in Ciconiiformes (Yamamoto ). These repetitive sequences have been speculated to result from the pause of H-strand replication and subsequent slipped mispairing (Fumagalli ). The presence of similar conserved repeat sequences in different animal groups (Douzery and Randi, 1997; Nesbø ) has led some researchers to propose that these tandem repeats may have an important role in regulating mitogenome replication and transcription (Delarbre ; Delport ).

Phylogenomic relationships of 11 species in Ciconiiformes

Mitochondrial sequences provide valuable information for tracing the history of gene rearrangements and phylogenetic reconstructions (Härlid ; Braband ; Oh ; Yang ; Cerasale ). The availability of an increasing number of complete avian mitogenomes has allowed the construction of phylogenetic trees with better resolution, the results of which show better agreement with morphological and nuclear marker studies (Zhang and Wake, 2009; Pacheco ). The phylogenetic tree that included E. garzetta and ten other species in Ciconiiformes (Table S1) was constructed using complete mitogenome sequences, with A. platyrhynchos(EU009397) and B. canadensis (NC_007011) as outgroups. Since some investigators have preferred to use PCGs in tree construction (Härlid ; Gibson ; Shen ; Zhang ), we also ran an analysis with 13 PCGs to assess the congruence between these two strategies. The results showed that although several regions (tRNAs, CR, rRNAs and ND6) presented some problems in the analysis, e.g., difficulties in alignment, numerous gaps, potential saturation and heterogeneous base composition (Gardner ; Sullivan and Joyce, 2005; Krajewski ; Oh ), the topologies of the phylogenetic trees generated by the two strategies were the same (Figure 3).
Figure 3

Bayesian tree based on the complete mitochondrial genome data and 13 PCGs with the GIR+I+G model. The horizontal length of each branch corresponds to the substitution rates estimated with the model. Anas platyrhynchos and Branta canadensis were used as outgroups. Numbers on the branches are the bootstrap values for Bayesian posterior probability.

The phylogenetic relationships among species/genera within the three families examined here were consistent with the conclusions of previous investigations (Sheldon ; Chang ; Zhang ). The monophyly of the Ardeidae, Threskiorothidae and Ciconiidae was strongly confirmed (posterior probabilities = 1.00; Figure 3). In the Ardeidae, I. cinnamomeus was the basal clade and Egretta more closely related to Ardea than to Nycticorax. In Threskiornithidae, Platalea was more closely related to Threskiornis than to Nipponia. The relationships revealed by the phylogenetic trees were also supported by the pairwise distances among mitogenomes (Table S3). With regard to the evolutionary relationships among the three families, our results supported a closer relationship between Threskiorothidae and Ciconiidae than between Threskiorothidae and Ardeidae, a conclusion similar to that based on amino acid data from 12 PCGs (Zhang ), but different from that of Hackett and Pacheco . Since the topologies of molecular phylogenetic trees often vary with the markers and taxa used (Zwickl and Hillis, 2002), divergent evolutionary relationships have often been suggested for the families of Ciconiiformes (Gibb ; Hackett ; Pacheco ; Zhang ; this study). More complete mitogenome data for the Ardeidae (and other families in Ciconiiformes) are urgently needed for detailed molecular systematic analyses in this order. The mitogenome sequence data presented here represent a contribution to this long-term goal.
  65 in total

1.  An extra nucleotide is not translated in mitochondrial ND3 of some birds and turtles.

Authors:  D P Mindell; M D Sorenson; D E Dimcheff
Journal:  Mol Biol Evol       Date:  1998-11       Impact factor: 16.240

2.  Organization and evolution of the mitochondrial DNA control region in the avian genus Alectoris.

Authors:  E Randi; V Lucchini
Journal:  J Mol Evol       Date:  1998-10       Impact factor: 2.395

3.  Mammalian mitochondrial D-loop region structural analysis: identification of new conserved sequences and their functional and evolutionary implications.

Authors:  E Sbisà; F Tanzariello; A Reyes; G Pesole; C Saccone
Journal:  Gene       Date:  1997-12-31       Impact factor: 3.688

4.  Sequence and gene organization of the chicken mitochondrial genome. A novel gene order in higher vertebrates.

Authors:  P Desjardins; R Morais
Journal:  J Mol Biol       Date:  1990-04-20       Impact factor: 5.469

5.  Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes.

Authors:  N T Perna; T D Kocher
Journal:  J Mol Evol       Date:  1995-09       Impact factor: 2.395

6.  The mitochondrial control region of Cervidae: evolutionary patterns and phylogenetic content.

Authors:  E Douzery; E Randi
Journal:  Mol Biol Evol       Date:  1997-11       Impact factor: 16.240

7.  Determination of the complete nucleotide sequence and haplotypes in the D-loop region of the mitochondrial genome in the oriental white stork, Ciconia boyciana.

Authors:  Y Yamamoto; K Murata; H Matsuda; T Hosoda; K Tamura; J Furuyama
Journal:  Genes Genet Syst       Date:  2000-02       Impact factor: 1.517

8.  The mitochondrial genome of the onychophoran Opisthopatus cinctipes (Peripatopsidae) reflects the ancestral mitochondrial gene arrangement of Panarthropoda and Ecdysozoa.

Authors:  Anke Braband; Stephen L Cameron; Lars Podsiadlowski; Savel R Daniels; Georg Mayer
Journal:  Mol Phylogenet Evol       Date:  2010-05-20       Impact factor: 4.286

9.  The mitochondrial genome of the Cinnamon Bittern, Ixobrychus cinnamomeus (Pelecaniformes: Ardeidae): sequence, structure and phylogenetic analysis.

Authors:  Liqin Zhang; Li Wang; Vinita Gowda; Ming Wang; Xifeng Li; Xianzhao Kan
Journal:  Mol Biol Rep       Date:  2012-06-15       Impact factor: 2.316

10.  Site specific rates of mitochondrial genomes and the phylogeny of eutheria.

Authors:  Karl M Kjer; Rodney L Honeycutt
Journal:  BMC Evol Biol       Date:  2007-01-25       Impact factor: 3.260

View more
  2 in total

1.  Two mitochondrial genomes in Alcedinidae (Ceryle rudis/Halcyon pileata) and the phylogenetic placement of Coraciiformes.

Authors:  Xiaomin Sun; Ruoping Zhao; Ting Zhang; Jie Gong; Meidong Jing; Ling Huang
Journal:  Genetica       Date:  2017-08-08       Impact factor: 1.082

2.  Turdoides affinis mitogenome reveals the translational efficiency and importance of NADH dehydrogenase complex-I in the Leiothrichidae family.

Authors:  Indrani Sarkar; Prateek Dey; Sanjeev Kumar Sharma; Swapna Devi Ray; Venkata Hanumat Sastry Kochiganti; Renu Singh; Padmanabhan Pramod; Ram Pratap Singh
Journal:  Sci Rep       Date:  2020-10-01       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.