| Literature DB >> 27827992 |
Shi-Chun Chen1, Xiao-Qing Wang2, Pin-Wu Li3, Xiang Hu4, Jin-Jun Wang5, Ping Peng6.
Abstract
There are numerous gene rearrangements and transfer RNA gene absences existing in mitochondrial (mt) genomes of Aleyrodidae species. To understand how mt genomes evolved in the family Aleyrodidae, we have sequenced the complete mt genome of Aleurocanthus camelliae and comparatively analyzed all reported whitefly mt genomes. The mt genome of A. camelliae is 15,188 bp long, and consists of 13 protein-coding genes, two rRNA genes, 21 tRNA genes and a putative control region (GenBank: KU761949). The tRNA gene, trnI, has not been observed in this genome. The mt genome has a unique gene order and shares most gene boundaries with Tetraleurodes acaciae. Nineteen of 21 tRNA genes have the conventional cloverleaf shaped secondary structure and two (trnS₁ and trnS₂) lack the dihydrouridine (DHU) arm. Using ARWEN and homologous sequence alignment, we have identified five tRNA genes and revised the annotation for three whitefly mt genomes. This result suggests that most absent genes exist in the genomes and have not been identified, due to be lack of technology and inference sequence. The phylogenetic relationships among 11 whiteflies and Drosophila melanogaster were inferred by maximum likelihood and Bayesian inference methods. Aleurocanthus camelliae and T. acaciae form a sister group, and all three Bemisia tabaci and two Bemisia afer strains gather together. These results are identical to the relationships inferred from gene order. We inferred that gene rearrangement plays an important role in the mt genome evolved from whiteflies.Entities:
Keywords: Aleurocanthus camelliae; genome evolution; mitochondrial genome; rearrangement; whitefly
Mesh:
Substances:
Year: 2016 PMID: 27827992 PMCID: PMC5133843 DOI: 10.3390/ijms17111843
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The mitochondrial genomes of Aleurocanthus camelliae. The transcriptional orientation is indicated with arrows. Protein-coding genes, ribosomal RNA genes and transfer RNA genes are shown in bright colors. All of them in the map follow standard abbreviations. tRNA genes for the two serine and two leucine tRNAs: S1 = AGN, S2 = UCN, L1 = CUN and L2 = UUR. The putative control region is indicated in black.
Annotation and gene organization of the mitochondrial genome of Aleurocanthus camelliae.
| Gene a | Region | Size | Inc b | AT% | AT-Skew c | GC-Skew | Start Codon | Stop Codon | Anticodon |
|---|---|---|---|---|---|---|---|---|---|
|
| 1–1539 | 1539 | 1 | 64.39 | −0.322 | 0.263 | ATG | TAA | - |
|
| 1542–1606 | 65 | 2 | 70.77 | −0.087 | 0.474 | - | - | TAA |
|
| 1607–2270 | 664 | 0 | 68.67 | −0.197 | 0.202 | ATT | T- e | - |
|
| 2271–2340 | 70 | 0 | 71.43 | 0.000 | −0.100 | - | - | CTT |
|
| 2347–2408 | 62 | 6 | 66.13 | −0.171 | 0.333 | - | - | GTC |
|
| 2418–2564 | 147 | 9 | 62.59 | −0.217 | 0.273 | ATA | TAA | - |
|
| 2558–3208 | 651 | −7 | 71.27 | −0.388 | 0.241 | ATT | TAG | - |
|
| 3233–3293 | 61 | 24 | 72.13 | 0.000 | 0.059 | - | - | TTC |
|
| 3301–3363 | 63 | 7 | 76.19 | 0.292 | 0.067 | - | - | GAA |
|
| 3364–5023 | 1660 | 0 | 73.13 | 0.137 | −0.166 | ATA | T- | - |
|
| 5024–5084 | 61 | 0 | 81.97 | 0.040 | 0.455 | - | - | GTG |
|
| 5086–6366 | 1281 | 1 | 69.95 | −0.007 | −0.179 | ATA | TAG | - |
|
| 6363–6659 | 297 | −4 | 76.77 | −0.018 | −0.246 | ATG | TAA | - |
|
| 6661–6724 | 64 | 1 | 76.56 | −0.102 | 0.333 | - | - | TGT |
|
| 6723–6784 | 62 | −2 | 72.58 | 0.111 | −0.059 | - | - | TGG |
|
| 6816–7250 | 435 | 31 | 70.57 | −0.270 | 0.328 | ATA | TAA | - |
|
| 7250–8383 | 1134 | −1 | 64.90 | −0.351 | 0.186 | ATG | TAA | - |
|
| 8384–8439 | 56 | 0 | 80.36 | −0.067 | 0.273 | - | - | TGA |
|
| 8450–9359 | 910 | 10 | 72.86 | 0.005 | −0.198 | ATT | T- | - |
|
| 9360–9422 | 63 | 0 | 84.13 | 0.132 | 0.200 | - | - | TAG |
|
| 9423–10612 | 1190 | 0 | 77.23 | 0.134 | −0.055 | - | - | - |
|
| 10707–11435 | 729 | 94 | 75.99 | 0.022 | 0.074 | - | - | - |
|
| 11483–11548 | 66 | 47 | 78.79 | 0.115 | 0.143 | - | - | ATT |
|
| 11553–11623 | 71 | 4 | 87.32 | 0.000 | −0.111 | - | - | TTG |
|
| 11617–11680 | 64 | −7 | 73.44 | −0.021 | 0.059 | - | - | TAC |
|
| 11678–11742 | 65 | −3 | 73.85 | −0.042 | 0.059 | - | - | TCG |
|
| 11744–11805 | 62 | 1 | 75.81 | −0.021 | 0.200 | - | - | TGC |
|
| 11806–12159 | 354 | 0 | 72.60 | −0.276 | 0.052 | ATT | TAA | - |
|
| 12161–12222 | 63 | 1 | 82.54 | −0.038 | 0.273 | - | - | TCC |
|
| 12223–13009 | 786 | 0 | 66.92 | −0.198 | 0.115 | ATG | TAA | - |
|
| 13025–13086 | 62 | 15 | 61.29 | −0.263 | 0.083 | - | - | TCT |
| CR d | 13087–13961 | 875 | 0 | 66.86 | −0.050 | 0.117 | - | - | - |
|
| 13962–14028 | 67 | 0 | 77.61 | 0.077 | −0.067 | - | - | CAT |
|
| 14029–15003 | 975 | 0 | 68.72 | −0.287 | 0.279 | ATA | TAA | - |
|
| 15002–15064 | 63 | −2 | 82.54 | 0.077 | 0.091 | - | - | TCA |
|
| 15063–15124 | 60 | −2 | 86.67 | 0.154 | 0.500 | - | - | GTA |
|
| 15125–15187 | 63 | 0 | 92.06 | 0.138 | 0.200 | - | - | GCA |
a Genes located in the different strand from that of cox1 are underlined; b inc = intergenic nucleotides, indicating gap nucleotides (positive value) and overlapping nucleotides (negative value) of two adjacent genes; c AT-skew = (A − T)/(A + T), GC-skew = (G − C)/(G + C); d CR = control region (putative); e “T-” indicates the protein-coding gene terminate translation with single nucleotide “T”.
Nucleotide composition of the mitochondrial genome of Aleurocanthus camelliae.
| Region | A% | C% | G% | T% | AT% | AT-Skew | GC-Skew |
|---|---|---|---|---|---|---|---|
| Full length | 31.02 | 12.38 | 16.72 | 39.87 | 70.90 | −0.125 | 0.149 |
| PCGs | 29.27 | 14.12 | 16.64 | 39.96 | 69.49 | −0.154 | 0.082 |
| 1st codon | 35.12 | 13.20 | 20.62 | 31.05 | 66.37 | 0.061 | 0.219 |
| 2nd codon | 18.81 | 17.70 | 15.12 | 48.37 | 68.08 | −0.440 | −0.078 |
| 3rd codon | 33.88 | 11.47 | 14.18 | 40.47 | 74.02 | −0.089 | 0.106 |
| tRNA genes | 39.51 | 9.75 | 13.04 | 37.71 | 77.19 | 0.023 | 0.145 |
| rRNA genes | 41.90 | 11.67 | 11.57 | 34.86 | 76.61 | 0.092 | −0.004 |
Codon usage for the 13 mitochondrial protein-coding genes of Aleurocanthus camelliae.
| Codon | N | % | RSCU | Codon | N | % | RSCU | Codon | N | % | RSCU |
|---|---|---|---|---|---|---|---|---|---|---|---|
| UAA (*) | 8 | 0.22 | 1.60 | AAA (K) | 79 | 2.19 | 1.30 | AGA (S) | 76 | 2.11 | 1.63 |
| UAG (*) | 2 | 0.06 | 0.40 | AAG (K) | 43 | 1.19 | 0.71 | AGC (S) | 23 | 0.64 | 0.49 |
| GCA (A) | 62 | 1.72 | 1.56 | CUA (L) | 91 | 2.52 | 1.54 | AGG (S) | 22 | 0.61 | 0.47 |
| GCC (A) | 23 | 0.64 | 0.58 | CUC (L) | 16 | 0.44 | 0.27 | AGU (S) | 55 | 1.52 | 1.18 |
| GCG (A) | 14 | 0.39 | 0.35 | CUG (L) | 42 | 1.16 | 0.71 | UCA (S) | 83 | 2.30 | 1.78 |
| GCU (A) | 60 | 1.66 | 1.51 | CUU (L) | 87 | 2.41 | 1.48 | UCC (S) | 18 | 0.50 | 0.39 |
| UGC (C) | 17 | 0.47 | 0.74 | UUA (L) | 178 | 4.93 | 1.33 | UCG (S) | 17 | 0.47 | 0.36 |
| UGU (C) | 29 | 0.80 | 1.26 | UUG (L) | 90 | 2.49 | 0.67 | UCU (S) | 80 | 2.22 | 1.71 |
| GAC (D) | 22 | 0.61 | 0.70 | AUA (M) | 225 | 6.23 | 1.54 | ACA (T) | 76 | 2.11 | 1.62 |
| GAU (D) | 41 | 1.14 | 1.30 | AUG (M) | 67 | 1.86 | 0.46 | ACC (T) | 19 | 0.53 | 0.40 |
| GAA (E) | 57 | 1.58 | 1.37 | AAC (N) | 48 | 1.33 | 0.63 | ACG (T) | 6 | 0.17 | 0.13 |
| GAG (E) | 26 | 0.72 | 0.63 | AAU (N) | 104 | 2.88 | 1.37 | ACU (T) | 87 | 2.41 | 1.85 |
| UUC (F) | 46 | 1.27 | 0.26 | CCA (P) | 26 | 0.72 | 1.11 | GUA (V) | 85 | 2.35 | 1.30 |
| UUU (F) | 304 | 8.42 | 1.74 | CCC (P) | 18 | 0.50 | 0.77 | GUC (V) | 21 | 0.58 | 0.32 |
| GGA (G) | 62 | 1.72 | 1.39 | CCG (P) | 13 | 0.36 | 0.55 | GUG (V) | 49 | 1.36 | 0.75 |
| GGC (G) | 20 | 0.55 | 0.45 | CCU (P) | 37 | 1.02 | 1.57 | GUU (V) | 106 | 2.94 | 1.63 |
| GGG (G) | 55 | 1.52 | 1.23 | CAA (Q) | 29 | 0.80 | 1.14 | UGA (W) | 68 | 1.88 | 1.36 |
| GGU (G) | 42 | 1.16 | 0.94 | CAG (Q) | 22 | 0.61 | 0.86 | UGG (W) | 32 | 0.89 | 0.64 |
| CAC (H) | 22 | 0.61 | 0.86 | CGA (R) | 18 | 0.50 | 1.60 | UAC (Y) | 48 | 1.33 | 0.65 |
| CAU (H) | 29 | 0.80 | 1.14 | CGC (R) | 5 | 0.14 | 0.44 | UAU (Y) | 99 | 2.74 | 1.35 |
| AUC (I) | 48 | 1.33 | 0.28 | CGG (R) | 12 | 0.33 | 1.07 | - | - | - | - |
| AUU (I) | 291 | 8.06 | 1.72 | CGU (R) | 10 | 0.28 | 0.89 | - | - | - | - |
* represents stop codon.
Figure 2Putative secondary structures of the tRNA genes identified in this study. Bars indicate Watson-Crick base pairings, and dots between G and U pairs mark canonical base pairings appearing in RNA. Seven tRNA genes identified in this study of the mitochondrial genomes of Aleurodicus dugesii (Ad), Neomaskellia andropogonis (Na) and Tetraleurodes acaciae (Ta) with species name abbreviation below gene name.
Figure 3Arrangement of mitochondrial genes in the family Aleyrodidae and the hypothetical ancestral insect. Circular genomes have been arbitrarily linearized for ease of comparison. Gene names are the standard abbreviations used in the present study. Transfer RNA genes are designated by the single-letter amino acid codes, and red letters represent the tRNAs identified in this study. Genes which are underlined are encoded on the different strand from that of cox1. White boxes represent transfer RNA genes and bright colors represent 13 protein-coding genes and 2 ribosomal RNA genes.
Figure 4Phylogeny from Aleyrodidae mitochondrial genome sequences. Numbers above the branches show maximum likelihood (ML) bootstrap support values and Bayesian (BI) posterior probabilities for the phylogenies. Only support above 50% is shown. All mitochondrial genomes of clade A contain the rearrangement of cox3-trnG-nad3.
PCR primers for amplification of the mitochondrial genome of Aleurocanthus camelliae.
| Region | Primer Name | Primer Sequence (5′–3′) | References |
|---|---|---|---|
|
| C1-J-2195 | TTGATTTTTTGGTCATCCAGAAGT | [ |
| N5-N7793 | TTAGGTTGRGATGGNYTAGG | [ | |
|
| AC-N5J | CAGCAGTTACTAGAGTTGATGAGT | In this study |
| AC-CBN | TCTCCACAAACAATAAAACAAGC | In this study | |
|
| CBF1 | TATGTACTACCATGAGGACAAATATC | [ |
| AC-16SN | GTCTAAGTCTTAAATAAACTCTGC | In this study | |
|
| 16Sar | CGCCTGTTTAACAAAAACAT | [ |
| 16Sbr | CCGGTCTGAACTCAGATCACGT | [ | |
|
| AC-16SJ | TTAGAGGAACCTGCTTAGTAATC | In this study |
| AC-12SN | GCAGAGTTTATTTAAGACTTAGAC | In this study | |
|
| 12SRNA-F | AAACTAGGATTAGATACCCTATTAT | [ |
| 12SRNA-R | AAGAGCGACGGGCGATGTGT | [ | |
|
| AC-12SJ | CCCTGATACAAAAGGTACAAGTCT | In this study |
| AC-C1N | CCCATAATAGCAAATACAATCCCT | In this study |
GenBank accession numbers and mitochondrial genome sizes of the species included in phylogenetic analysis in this study.
| Species | Genome Size | Acc. Number | References |
|---|---|---|---|
| 15,210 bp | KJ778614 | [ | |
|
| 15,632 bp | JQ906700 | [ |
|
| 15,322 bp | AY521259 | [ |
|
| 15,300 bp | KR819174 | [ |
|
| 14,968 bp | KF734668 | [ |
|
| 15,080 bp | AY521262 | [ |
|
| 14,496 bp | AY572539 | [ |
|
| 15,388 bp | AY572538 | [ |
|
| 15,723 bp | AY521251 | [ |
|
| 18,414 bp | AY521265 | [ |
|
| 19,517 bp | DMU37541 | [ |