| Literature DB >> 26273219 |
Yi Zou1, Mei-Dong Jing1, Xiao-Xin Bi1, Ting Zhang1, Ling Huang1.
Abstract
Many phylogenetic questions in the Ciconiiformes remain unresolved and complete mitogenome data are urgently needed for further molecular investigation. In this work, we determined the complete mitogenome sequence of the little egret (Egretta garzetta). The genome was 17,361 bp in length and the gene organization was typical of other avian mtDNA. In protein-coding genes (PCGs), a C insertion was found in ND3, and COIII and ND4 terminated with incomplete stop codons (T). tRNA-Val and tRNA-Ser (AGY) were unable to fold into canonical cloverleaf secondary structures because they had lost the DHU arms. Long repetitive sequences consisting of five types of tandem repeats were found at the 3' end of Domain III in the control region. A phylogenetic analysis of 11 species of Ciconiiformes was done using complete mitogenome data and 12 PCGs. The tree topologies obtained with these two strategies were identical, which strongly confirmed the monophyly of Ardeidae, Threskiorothidae and Ciconiidae. The phylogenetic analysis also revealed that Egretta was more closely related to Ardea than to Nycticorax in the Ardeidae, and Platalea was more closely related to Threskiornis than to Nipponia in the Threskiornithidae. These findings contribute to our understanding of the phylogenetic relationships of Ciconiiformes based on complete mitogenome data.Entities:
Keywords: Egretta garzetta; mitochondrial genome; phylogenomics
Year: 2015 PMID: 26273219 PMCID: PMC4530654 DOI: 10.1590/S1415-4757382220140203
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Figure 1Gene organization of the E. garzetta mitogenome. ND1–6 refers to NADH dehydrogenase subunits 1–6, COI–III refer to cytochrome c oxidase subunits 1–3, ATP6 and ATP8 refer to ATPase subunits 6 and 8, and Cyt b refers to cytochrome b. Twenty-two tRNA genes are designated by single-letter amino acid codes.
Organization of the E. garzetta mitochondrial genome.
| Gene | Position | Size (bp) | Spacer (+)/Overlap (−) | Strand | Codon | ||
|---|---|---|---|---|---|---|---|
|
|
| ||||||
| From | To | Start | Stop | ||||
| tRNA-Phe | 1 | 69 | 69 | 0 | H | ||
| 12s-rRNA | 70 | 1040 | 971 | 0 | H | ||
| tRNA-Val | 1041 | 1111 | 71 | 0 | H | ||
| 16s-rRNA | 1112 | 2718 | 1607 | 0 | H | ||
| tRNA-Leu (UUR) | 2719 | 2792 | 74 | 8 | H | ||
|
| 2801 | 3778 | 978 | 7 | H | ATG | AGA |
| tRNA-Ile | 3786 | 3856 | 71 | 11 | H | ||
| tRNA-Gln | 3868 | 3937 | 70 | 0 | L | ||
| tRNA-Met | 3938 | 4005 | 68 | 0 | H | ||
|
| 4006 | 5044 | 1039 | 0 | H | ATG | TAG |
| tRNA-Trp | 5045 | 5116 | 72 | 2 | H | ||
| tRNA-Ala | 5119 | 5186 | 68 | 10 | L | ||
| tRNA-Asn | 5197 | 5270 | 74 | 3 | L | ||
| tRNA-Cys | 5274 | 5340 | 67 | −1 | L | ||
| tRNA-Tyr | 5340 | 5411 | 72 | 13 | L | ||
|
| 5425 | 6975 | 1551 | −9 | H | GTG | AGG |
| tRNA-Ser (UCN) | 6967 | 7040 | 74 | 2 | L | ||
| tRNA-Asp | 7043 | 7111 | 69 | 1 | H | ||
|
| 7113 | 7796 | 684 | 1 | H | ATG | TAA |
| tRNA-Lys | 7798 | 7867 | 70 | 1 | H | ||
|
| 7869 | 8036 | 168 | −10 | H | ATG | TAA |
|
| 8027 | 8710 | 684 | −1 | H | ATG | TAA |
|
| 8710 | 9493 | 784 | 0 | H | ATG | T |
| tRNA-Gly | 9494 | 9562 | 69 | 0 | H | ||
|
| 9563 | 9914 | 352 | 2 | H | ATT | TAA |
| tRNA-Arg | 9917 | 9985 | 69 | 1 | H | ||
|
| 9987 | 10283 | 297 | −7 | H | ATG | TAA |
|
| 10277 | 11654 | 1378 | 0 | H | ATG | T |
| tRNA-His | 11655 | 11724 | 70 | 0 | H | ||
| tRNA-Ser (AGY) | 11725 | 11792 | 68 | −1 | H | ||
| tRNA-Leu (CUN) | 11792 | 11863 | 72 | 0 | H | ||
|
| 11864 | 13678 | 1815 | 10 | H | ATG | AGA |
|
| 13689 | 14831 | 1143 | 3 | H | ATG | TAA |
| tRNA-Thr | 14835 | 14904 | 70 | 11 | H | ||
| tRNA-Pro | 14916 | 14987 | 72 | 8 | L | ||
|
| 14996 | 15472 | 477 | 3 | L | ATG | AGA |
| tRNA-Glu | 15476 | 15549 | 74 | 0 | L | ||
| Control region | 15550 | 17361 | 1812 | 0 | H | ||
Position numbering starts with the 5′ position of the Control region;
Genes transcribed from the L or H strand;
Start and stop codons of protein-coding genes;
Protein-coding genes overlapping with tRNA genes end with an incomplete stop codon.
Genomic characteristics of 11 avian mtDNAs.
| Species | Heavy-strand | 12 Protein-coding genes | LrRNA gene | SrRNA gene | tRNA gene | Control region | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
| ||||||||||
| Length (bp) | AT% | Length (bp) | AT% (all) | AT% (1st) | AT% (2nd) | AT% (3rd) | Length (bp) | AT% | Length (bp) | AT% | Length (bp) | AT% | Length (bp) | AT% | |
|
| 15,585 | 55.3 | 10,874 | 54.9 | 49.4 | 58.7 | 56.8 | 1,599 | 56.4 | 974 | 53.2 | 1,567 | 58.7 | 1,140 | 56.1 |
|
| 15,784 | 55.4 | 10,875 | 55.0 | 49.4 | 58.7 | 56.6 | 1,599 | 56.2 | 975 | 53.2 | 1,552 | 58.9 | 1,352 | 56.6 |
|
| 15,826 | 55.2 | 10,874 | 54.6 | 50.2 | 57.7 | 55.4 | 1,598 | 55.1 | 973 | 52.2 | 1,567 | 59.1 | 1,382 | 57.8 |
|
| 15,567 | 54.0 | 10,874 | 53.2 | 49.3 | 58.7 | 51.7 | 1,603 | 55.5 | 977 | 52.2 | 1,552 | 57.5 | 1,160 | 59.1 |
|
| 16,354 | 55.4 | 10,875 | 53.4 | 49.8 | 57.7 | 52.0 | 1,606 | 55.4 | 970 | 51.4 | 1,555 | 57.2 | 1,922 | 67.1 |
|
| 16,421 | 55.0 | 10,870 | 53.3 | 48.9 | 57.7 | 53.0 | 1,605 | 54.8 | 971 | 51.6 | 1,550 | 56.3 | 1,997 | 64.7 |
|
| 16,245 | 55.0 | 10,873 | 53.4 | 49.8 | 57.8 | 53.0 | 1,607 | 54.7 | 971 | 50.8 | 1,553 | 56.2 | 1,812 | 65.3 |
|
| 16,027 | 57.1 | 10,873 | 56.1 | 51.0 | 58.8 | 58.5 | 1,591 | 55.9 | 971 | 53.9 | 1,555 | 58.9 | 1,609 | 65.3 |
|
| 16,665 | 56.1 | 10,873 | 54.8 | 50.1 | 58.9 | 55.8 | 1,596 | 56.0 | 973 | 52.1 | 1,551 | 57.0 | 2,244 | 62.8 |
|
| 16,487 | 53.8 | 10,871 | 52.4 | 48.3 | 57.5 | 50.4 | 1,612 | 53.6 | 968 | 51.6 | 1,550 | 57.3 | 2,053 | 60.5 |
|
| 16,212 | 53.8 | 10,874 | 52.6 | 48.3 | 57.5 | 51.5 | 1,608 | 54.0 | 968 | 51.7 | 1,550 | 57.2 | 1,779 | 59.5 |
| Average | 16,107 | 55.1 | 10,873 | 54.0 | 49.5 | 58.2 | 54.1 | 1,602 | 55.2 | 972 | 52.2 | 1,555 | 57.7 | 1,677 | 61.3 |
Base composition for protein-coding genes found in mtDNA of E. garzetta.
| Gene | Length (bp) | Proportion of nucleotides (%) | AT Skew | GC Skew | ||||
|---|---|---|---|---|---|---|---|---|
|
| ||||||||
| A | C | G | T | A+T | ||||
| ND1 | 978 | 26.38 | 34.46 | 12.68 | 26.48 | 52.86 | −0.002 | −0.462 |
| ND2 | 1039 | 32.63 | 33.59 | 10.11 | 23.68 | 56.31 | 0.159 | −0.537 |
| COX1 | 1551 | 28.24 | 30.11 | 16.38 | 25.27 | 53.51 | 0.056 | −0.295 |
| COX2 | 684 | 31.43 | 31.43 | 14.18 | 22.95 | 54.38 | 0.156 | −0.378 |
| ATP8 | 168 | 32.14 | 38.69 | 5.95 | 23.21 | 55.35 | 0.161 | −0.733 |
| ATP6 | 684 | 30.12 | 36.84 | 9.94 | 23.10 | 53.22 | 0.132 | −0.575 |
| COX3 | 784 | 28.57 | 31.76 | 15.43 | 24.23 | 52.80 | 0.082 | −0.346 |
| ND3 | 352 | 26.70 | 36.08 | 11.36 | 25.85 | 52.55 | 0.016 | −0.521 |
| ND4L | 297 | 29.97 | 35.35 | 11.45 | 23.23 | 53.20 | 0.127 | −0.511 |
| ND4 | 1378 | 31.49 | 36.21 | 9.65 | 22.64 | 54.13 | 0.163 | −0.579 |
| ND5 | 1815 | 31.90 | 35.43 | 10.85 | 21.82 | 53.72 | 0.188 | −0.531 |
| CYTB | 1143 | 27.47 | 37.10 | 12.60 | 22.83 | 50.30 | 0.092 | −0.493 |
| ND6 | 477 | 37.53 | 41.93 | 10.06 | 10.48 | 48.01 | 0.563 | −0.613 |
| Average | 30.35 | 35.31 | 11.59 | 22.75 | 53.10 | 0.146 | −0.506 | |
Nucleotide compositon of the 13 protein-coding genes.
| Species | 1st codon position | 2nd codon position | 3rd codon potion | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| ||||||||||
| A% | G% | C% | T% | A% | G% | C% | T% | A% | G% | C% | T% | |
|
| 29.7 | 20.2 | 30.0 | 20.1 | 20.1 | 12.3 | 29.3 | 38.3 | 41.4 | 3.8 | 40.4 | 14.4 |
|
| 29.7 | 20.2 | 29.9 | 20.2 | 20.0 | 12.4 | 29.2 | 38.4 | 41.3 | 4.0 | 40.0 | 14.7 |
|
| 29.5 | 20.4 | 29.7 | 20.3 | 20.0 | 12.4 | 29.6 | 38.0 | 41.0 | 4.1 | 40.8 | 14.1 |
|
| 29.5 | 20.4 | 30.7 | 19.4 | 20.0 | 12.4 | 29.3 | 38.3 | 39.2 | 5.4 | 43.3 | 12.1 |
|
| 30.1 | 20.1 | 30.4 | 19.4 | 20.1 | 12.2 | 29.7 | 38.0 | 40.6 | 3.9 | 44.1 | 11.4 |
|
| 30.3 | 20.0 | 30.6 | 19.1 | 20.0 | 12.4 | 29.7 | 37.9 | 40.6 | 4.0 | 43.9 | 11.5 |
|
| 30.0 | 20.1 | 30.5 | 19.4 | 20.1 | 12.2 | 29.7 | 38.0 | 40.5 | 4.0 | 43.7 | 11.8 |
|
| 30.8 | 19.4 | 28.8 | 21.0 | 20.1 | 12.3 | 29.4 | 38.2 | 43.2 | 2.4 | 39.6 | 14.8 |
|
| 30.5 | 20.0 | 30.0 | 19.5 | 20.2 | 12.3 | 29.3 | 38.2 | 40.4 | 4.6 | 39.9 | 15.1 |
|
| 29.7 | 20.5 | 31.1 | 18.7 | 19.8 | 12.4 | 29.8 | 38.0 | 38.8 | 4.6 | 45.1 | 11.5 |
|
| 29.7 | 20.5 | 31.1 | 18.7 | 19.7 | 12.5 | 29.8 | 38.0 | 39.1 | 4.4 | 44.5 | 12.0 |
| Range | 1.3 | 1.1 | 2.3 | 2.3 | 0.5 | 0.3 | 0.6 | 0.5 | 4.4 | 3.0 | 5.5 | 3.7 |
| Average | 30.0 | 20.2 | 30.3 | 19.6 | 20.0 | 12.3 | 29.5 | 38.1 | 40.6 | 4.1 | 42.3 | 13.0 |
Figure 2Schematic representation of the control region in the mitogenome of E. garzetta. The first box represents the extended termination-associated sequences (ETAS1 and ETAS2). Boxes F, E, D and C represent the conserved sequence boxes in the central domain. CSB – conserved sequence block, CSB-like – a sequence similar to CSB, LSP and HSP – light-strand and heavy-strand transcription promoters, respectively, and Rs – tandem repeats in the control region.
Figure 3Bayesian tree based on the complete mitochondrial genome data and 13 PCGs with the GIR+I+G model. The horizontal length of each branch corresponds to the substitution rates estimated with the model. Anas platyrhynchos and Branta canadensis were used as outgroups. Numbers on the branches are the bootstrap values for Bayesian posterior probability.