| Literature DB >> 26459858 |
Mingyong Tang1, Zhiwen Chen2, Corrinne E Grover3, Yumei Wang4, Shuangshuang Li5, Guozheng Liu6, Zhiying Ma7, Jonathan F Wendel8, Jinping Hua9.
Abstract
BACKGROUND: The mitochondrial genome from upland cotton, G. hirsutum, was previously sequenced. To elucidate the evolution of mitochondrial genomic diversity within a single genus, we sequenced the mitochondrial genome from Sea Island cotton (Gossypium barbadense L.).Entities:
Mesh:
Substances:
Year: 2015 PMID: 26459858 PMCID: PMC4603758 DOI: 10.1186/s12864-015-1988-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Genome map of Gossypium barbadense mitochondrial genome. The map shows both the gene map (outer circle) and repeat map (inner map). Genes exhibited on the inside of outer circle are transcribed in a clockwise direction, while genes on the outside of outer circle are transcribed in a reverse direction. The inner circle reveals the distribution of repeats in G. barbadense mt genome with curved lines and ribbons connecting pairs of repeats and width proportional to repeat size. The red ribbons represent > = 1 Kb repeats and the blue lines represent repeats between 100 bp to 1 Kb. The numbers give genome coordinates in kilobases
General features of mitochondrial genomes of G. barbadense and G. hirsutum
| Genome features |
|
|
|---|---|---|
| Total genome size (bp) | 677,434 | 621,884 |
| GC content (%) | 44.98 | 44.95 |
| Total repeated sequences (bp)a | 236,070 | 132,305 |
| Percentage of genome (%) | (34.85) | (21.27) |
|
| 12,118 | 10,013 |
|
| 5.778 | 5,583 |
| unclassified LTR-retrotransposon | 11,954 | 9,662 |
| unclassified retroelement | 3,011 | 2,591 |
|
| 1,076 | 1,590 |
| unclassified transposable element | 6,928 | 5,791 |
| unclassified | 195,205 | 97,075 |
| Chloroplast-like sequences (bp) | 5,383 | 6,833 |
| Percentage (%) | (0.80) | (1.10) |
| Coding sequences (bp)b | 36,365 | 31,721 |
| Percentage (%) | (5.37) | (5.10) |
| Gene contentb | 75 | 68 |
| Protein-coding genesc | 40 | 36 |
| rRNA | 6 | 4 |
| tRNA | 29 | 28 |
aRepeats > 100 bp
bAll copies of duplicated genes but not pseudogenes are included
cThe extra protein-coding genes in G. barbadense are atp1, mttB, nad4 and nad9
Genes identified in the G. barbadense mitochondrial genome
| 1. Complex I genes | 6. Cytochrome c biogenesis genes | 11. rRNA genes |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7. Ribosomal proteins, LSU |
|
|
|
|
|
|
|
|
|
| 12. tRNA-coding genes |
|
|
|
|
|
|
|
|
|
|
|
|
| 8. Ribosomal proteins, SSU |
|
|
|
|
|
| |
| 2. Complex II genes |
|
| |
|
|
|
| |
|
|
|
| |
| 3. Complex III gene |
|
| |
|
|
|
| |
| 4. Complex IV genes |
|
| |
|
| 9. Maturase gene |
| |
|
|
|
| |
|
| 10. protein translocation system subunit gene |
| |
| 5. Complex V genes |
|
| |
|
|
|
| |
|
|
| ||
|
|
| ||
|
|
| ||
|
|
| ||
|
|
|
A total of 40 protein-coding genes, 6 rRNA-coding genes and 29 tRNA-coding genes were identified (excluding pseudogenes), in addition to 3 pseudogenes and 10 chloroplast-derived sequences. Genes present in duplicate are denoted with a hyphenated number (e.g., −1 or −2). a represents the second gene copy to G. barbadense
Frequency distribution of repeat lengths in the mitogenomes of G. hirsutum and G. barbadense
| Size (bp) | Number | Total length of repeats (bp) | Coverage (%) | |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| 20–39 | 192 | 55 | 10,747 | 4,281 | 1.7 | 0.6 |
| 40–59 | 69 | 83 | 9,667 | 10,139 | 1.6 | 1.5 |
| 60–79 | 35 | 29 | 9,567 | 5,183 | 1.5 | 0.8 |
| 80–99 | 11 | 12 | 8,365 | 2,194 | 1.3 | 0.3 |
| 100–1,000 | 32 | 24 | 18,368 | 12,670 | 3.0 | 1.9 |
| >10 K | 4 | 4 | 117,300 | 223,400 | 18.9 | 33.0 |
Fig. 2Dot matrix analyses between G. barbadense and G. hirsutum, C. papaya, A. thaliana (individually) by whole genomic alignment. The blue and red lines refer inverted and direct syntenic regions, respectively
Fig. 3Syntenic blocks larger than 10 Kb between G. barbadense and G. hirsutum with curved ribbons connecting pairs of syntenic blocks and width proportional to blocks size. The numbers give genome coordinates in kilobases
Fig. 4Distribution of Ks values between two cotton mitochondrial genomes and C. papaya
dS and dN/dS values of 35 genes between two cotton mitochondrial genomes and C. papaya
| Genes | dS | dS | dN/dS | dN/dS |
|---|---|---|---|---|
|
|
|
|
| |
|
| 0.0311 | 0.0273 | 0.4950 | 0.5641 |
|
| 0.0222 | 0.0222 | 0.2589 | 0.2589 |
|
| 0.0671 | 0.0671 | 0.5962 | 0.6574 |
|
| 0.0320 | 0.0320 | 0.3857 | 0.3563 |
|
| 0.0960 | 0.0960 | 0.2296 | 0.2296 |
|
| 0.0177 | 0.0177 | 0.8689 | 0.8689 |
|
| 0.0087 | 0.0087 | 3.1102 | 3.1102 |
|
| 0.0148 | 0.0148 | 0.9031 | 0.9031 |
|
| 0.0673 | 0.0673 | 0.2110 | 0.2110 |
|
| 0.0648 | 0.0648 | 0.2121 | 0.2121 |
|
| 0.0527 | 0.0527 | 0.1506 | 0.1506 |
|
| 0.0528 | 0.0472 | 0.2680 | 0.2993 |
|
| 0.0451 | 0.0453 | 0.3832 | 0.3039 |
|
| 0.0663 | 0.0663 | 0.2157 | 0.2157 |
|
| 0.0555 | 0.0555 | 0.8377 | 0.8377 |
|
| 0.0919 | 0.0919 | 0.6743 | 0.6743 |
|
| 0.0649 | 0.0649 | 0.8732 | 0.8732 |
|
| 0.0570 | 0.0570 | 0.4892 | 0.4892 |
|
| 0.0162 | 0.0162 | 1.4726 | 1.4726 |
|
| 0.0659 | 0.0659 | 0.4163 | 0.4163 |
|
| 0.0529 | 0.0501 | 0.5313 | 0.6058 |
|
| 0.0306 | 0.0306 | 1.1463 | 1.1463 |
|
| 0.1099 | 0.1099 | 1.0637 | 1.0637 |
|
| 0.0513 | 0.0513 | 1.1205 | 1.1205 |
|
| 0.0176 | 0.0176 | 1.5716 | 1.5716 |
|
| 0.0362 | 0.0362 | 0.5901 | 0.5901 |
|
| 0.0398 | 0.0398 | 0.8391 | 0.8391 |
|
| 0.0828 | 0.0828 | 0.6046 | 0.6046 |
|
| 0.0752 | 0.0752 | 0.1628 | 0.1628 |
|
| 0.0814 | 0.0814 | 0.5694 | 0.5694 |
|
| 0.0718 | 0.0718 | 0.7649 | 0.7649 |
|
| 0.0367 | 0.0367 | 0.9328 | 0.9328 |
|
| 0.0216 | 0.0216 | 0.3991 | 0.3991 |
|
| 0.0851 | 0.0851 | 0.2572 | 0.2572 |
|
| 0.0134 | 0.0134 | 0.3387 | 0.3387 |
Pseudogenes in 41 mitochondrial genomes sequenced
| No. | Species | Pseudogenes |
|---|---|---|
| 1 |
|
|
| 2 |
|
|
| 3 |
|
|
| 4 |
|
|
| 5 |
|
|
| 6 |
|
|
| 7 |
|
|
| 8 |
|
|
| 9 |
|
|
| 10 |
|
|
| 11 |
|
|
| 12 |
|
|
| 13 |
|
|
| 14 |
|
|
| 15 |
|
|
| 16 |
|
|
| 17 |
|
|
| 18 |
|
|
| 19 |
|
|
| 20 |
|
|
| 21 |
|
|
| 22 |
|
|
| 23 |
|
|
| 24 |
|
|
| 25 |
|
|
| 26 |
|
|
| 27 |
|
|
| 28 |
|
|
| 29 |
|
|
| 30 |
|
|
| 31 |
|
|
| 32 |
|
|
| 33 |
|
|
| 34 |
|
|
| 35 |
|
|
| 36 |
|
|
| 37 |
|
|
| 38 |
|
|
| 39 |
|
|
| 40 |
|
|
| 41 |
|
|
apossible pseudogenes, (2) two copies of pseudogenes, cp pseudogenes of chloroplast origin and Gossypium species are given in bold
Fig. 5Distribution of pseudogenes in 41 mitochondrial genomes sequenced of land plants. The scale of pseudogene number is depicted on the side, and ranges from 0 copies to 2 copies, with 0.5 representing possible pseudogenes. The numbers 1–41 on the x-axis correspond to the number given to each mitochondrial genome (see Table 5)
Fig. 6The putative origin mechanism of rps3-2 and R2. a A structural comparison of rps3 and rps3-2 is shown in the top panel. Exons are depicted as black bars, introns as straight lines, and the striped box indicates the exonic sequence lost in rps3-2. Locations for each of the exons are given in parentheses b The lower panel illustrates the possible formation mechanism of rps3-2 and R2. A2: Chr. 5 represents chromosome 5 in the G. arboreum genome and MT represents mitochondrial genome of G. barbadense, with the top MT graph indicating the arrangement before transferring and the bottom indicating the arrangement after intracellular transferring. The red rectangle and red bar indicate the transferred sequences from the mitochondrial genome to the nuclear genome, respectively. The blue bar represents the flanking sequences transferred along with the latter half of exon 2, depicted again as a striped box. Included in these graphs are the bordering regions between rp3 and R2′ (28,235 bp) and between rps3-2 and R2 (26,936 bp)
Fig. 7Alignment of rp3 from the G. barbadense mitochondrial genome and the corresponding transfer found in the G. arboreum nuclear genome. Here, 1–755 bp in the alignment represents the 5′ flanking regions of rps3, and 756–1,298 bp consists of exon2-2 from the intact and transferred rps3 copy, respectively
Fig. 8The loss of tRNAs in 37 plant mitochondrial genomes. The phylogenetic tree was constructed based on nucleotide sequences of 17 mitochondrial genes including nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad9, cob, cox1, cox2, cox3, atp1, atp4, atp6, atp8 and atp9 using maximum likelihood (ML) method with the model GTR + G + I in MEGA5.05 [59]. The amino acids at the nodes represent the corresponding tRNA gene losses from mitochondrial genomes, and the arrow stands for tRNA loss in S. latifolia