| Literature DB >> 23572386 |
David Roy Smith1, Patrick J Keeling.
Abstract
Recently, it was shown that gene conversion between the ends of linear mitochondrial chromosomes can cause telomere expansion and the duplication of subtelomeric loci. However, it is not yet known how widespread this phenomenon is and how significantly it has impacted organelle genome architecture. Using linear mitochondrial DNAs and mitochondrial plasmids from diverse eukaryotes, we argue that telomeric recombination has played a major role in fashioning linear organelle chromosomes. We find that mitochondrial telomeres frequently expand into subtelomeric regions, resulting in gene duplications, homogenizations, and/or fragmentations. We suggest that these features are a product of subtelomeric gene conversion, provide a hypothetical model for this process, and employ genetic diversity data to support the idea that the greater the effective population size the greater the potential for gene conversion between subtelomeric loci.Entities:
Keywords: gene duplication; inverted repeat; mitochondrial DNA; nucleotide diversity; plasmid; telomere
Mesh:
Substances:
Year: 2013 PMID: 23572386 PMCID: PMC3673629 DOI: 10.1093/gbe/evt059
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Architecture of Linear Organelle Genomes with Palindromic Telomeres
| Species | Genome Size (kb) | Proteins | Telomere Size (kb) | Subtelomeric Gene Duplication and/or Homogenization | GenBank Accession |
|---|---|---|---|---|---|
| Mitochondrial DNA | |||||
| Apicomplexans | |||||
| | 5.9 | 3 | 0.13 | Yes | AB499085 |
| | 6 | 3 | 0.16 | No | NC_009902 |
| | 5.8 | 3 | 0.06 | No | AB499086 |
| | 5.9 | 3 | 0.08 | Yes | AB499087 |
| | 5.9 | 3 | 0.08 | No | NT_167255 |
| | 8.2 | 3 | 1.6 | Yes | AB499091 |
| | 6.0 | 3 | 0.05 | No | AB499090 |
| | 5.9 | 3 | 0.1 | Yes | AB499089 |
| Ciliates | |||||
| | 51.7 | >30 | 6.3 | Yes | NC_015981 |
| | 69.8 | >30 | 1.5–2.5 | Yes | JN383843 |
| | 47.7 | >30 | 2.9 | Yes | NC_008337 |
| | 47.5 | >30 | 3.3 | Yes | NC_008338 |
| | 47 | >30 | 2.9 | Yes | NC_008339 |
| | 47.3 | >30 | 2.9 | Yes | NC_000862 |
| | 47.6 | >30 | 2.8 | Yes | AF396436 |
| Cnidarians | |||||
| | 29 | 15 | 0.8–1.1 | Yes | JN642329–44 |
| | 16.9 | 15 | 0.42 | Yes | NC_008446 |
| | 17 | 15 | 0.42 | Yes | HQ694729 |
| | ∼17 | 15 | >0.3 | Yes | HQ694730 |
| | ∼17 | 13 | >0.2 | Yes | NC_016465 |
| | 15.9 | 13 | 0.2–0.4 | Yes | NC_011220–1 |
| | 16.3 | 13 | 1.5 | Yes | NC_010214 |
| | ∼16 | 13 | >0.3 | Yes | NC_016463 |
| Fungi | |||||
| | 32.8 | 14 | 1.9 | No | DQ376035 |
| | 29.8 | 15 | 0.7 | No | NC_014337 |
| | 39.2 | 14 | 5 | Yes | EF536359 |
| | 29.6 | 14 | 1.2 | No | AF402142 |
| | 22.9 | 14 | 0.3–1.2 | No | NC_013660 |
| Green algae | |||||
| | 15.8 | 8 | 0.5 | No | EU306622 |
| | 13 | 7 | 0.9 | No | EF645804 |
| | 16.2 | 7 | 1.3 | No | AY062933–4 |
| | 16.1 | 7 | 1.3 | No | GU108480–1 |
| Heterokonts | |||||
| | 41 | 33 | 2.2 | Yes | NC_002571 |
| | 48.7 | 57 | 15.6 | Yes | NC_014338 |
| Mitochondrial plasmid (plasmid name) | |||||
| Amoebozoan | |||||
| | 14.5 | ≥2 | 1.5–1.8 | Yes | D29637 |
| Fungi | |||||
| | 5.1 | 1 | 0.6 | No | X15982 |
| | 8 | 2 | 0.7 | Yes | NC_004935 |
| | 6.8 | 2 | 0.3 | Yes | X15648 |
| | 10.3 | 2 | 0.4 | Yes | NC_010425 |
| | 8.2 | 2 | 1.1 | Yes | L40494 |
| | 11.5 | 2 | 0.05 | No | NC_015334 |
| | 6 | 1 | 0.7 | No | X63909 |
| | 7.1 | 2 | 0.4 | Yes | X55361 |
| | 7.1 | 2 | 0.4 | Yes | NC_000843 |
| | 8.6 | 2 | 1.4 | Yes | X52106 |
| | 7.2 | 2 | 0.5 | Yes | Y11606 |
| | 9.9 | 2 | 0.4 | No | AF126285 |
| | 8.4 | 2 | 1 | Yes | X60707 |
| Land plants | |||||
| | 10.4 | ≥2 | 0.4 | No | Y10854 |
| | 11.6 | ≥2 | 0.3 | Yes | AB073400 |
| | 6.4 | ≥1 | 0.2–1.5 | Yes | X02451 |
| | 5.5 | ≥1 | 0.2–1.5 | Yes | J01426 |
aNumber of protein-coding genes based on GenBank accession. Number may change as annotations improve. We tried to ignore intronic and hypothetical ORFs, but for some species, particularly ciliates, it was difficult to distinguish them from standard ORFs. Duplicate genes were counted only once.
bTelomere size includes genes and gene fragments that have been incorporated into the palindromic repeat. Values should be considered approximations because in many cases the extreme ends of the genome are not yet sequenced.
cSubtelomeric gene duplication is when a protein-, rRNA-, or tRNA-coding sequence is completely or partially incorporated into the terminal palindromic repeats (telomeres) and thus present twice (or more) within the genome: once at each end of a chromosome or chromosomes. Subtelomeric gene homogenization is when a coding region located within, or close to, the terminal palindromic repeats spreads its sequence to other coding or noncoding regions adjacent to the telomeres. See figure 1 for examples. Homogenization events ≥ 5 nt were considered. The genes involved in subtelomeric duplication and/or homogenization and the length of the homogenization tracts are listed in supplementary table S1, Supplementary Material online.
dThe mitochondrial genomes from these species are fragmented into more than one chromosome: Alatina moseri (8), H. magnipapillata (2), P. parva (2), and P. piriformis (2). Genome size is based on the concatenation of all chromosomes.
FExamples of subtelomeric gene duplication and homogenization. Linear mitochondrial genomes and plasmids typically have palindromic telomeres (charcoal), which often contain coding regions (gold). In certain cases, a gene will overlap the telomeric and nontelomeric regions (e.g., cox1 from the Hydra oligactis mtDNA), causing one section of the gene (ψ) to be duplicated at each end of the chromosome (gold) and another section to be present only once (dark orange). Occasionally, segments of two different genes when they each border a telomere can become homogenized, as observed for the N-termini of the DNApol and RNApol genes from the Pichia kluyveri mitochondrial plasmid. See table 1 for more examples of subtelomeric gene duplication and homogenization within linear organelle DNAs. Note: genomes and telomeres are not to scale. Genes within the genomic core (light gray) are not shown. Chromosome lengths (in kilobases) and plasmid names are labeled on the left of the chromosomes. Gene arrows show the transcriptional polarity.
FHypothetical model for telomere expansion via gene conversion. (A) Linear organelle chromosome with identical palindromic telomeres (charcoal) on the left (L) and right (R) ends and four genes (w, x, y, and z) (orange) in the genome core (light gray)—Gene arrows show the transcriptional polarity. (B) Two copies of the same linear organelle chromosome aligned in opposite orientations. Recombination (X) between the left and right telomeres of the different chromosomes causes the branch migration of a Holliday junction across the telomere/subtelomere border (dotted line), resulting in the formation of heteroduplex DNA. Heteroduplex is resolved by sequence correction against either strand (in this case the sequence from the right subtelomere), resulting in expansion of the palindromic repeat into noncoding regions of the genome core. (C) Overtime, the same processes described in (B) result in the expansion of the telomeres into coding regions within the genome core and the almost-complete integration of gene z into the chromosome ends. (D) Eventually, telomeric recombination and gene conversion cause multiple genes to be integrated into the telomeres and gene homogenization between the C-termini of the x and w genes.
Within-Species Mitochondrial DNA Genetic Diversity
| Species | Lineage | Genetic Diversity | Subtelomeric Gene Duplication and/or Homogenization | |
|---|---|---|---|---|
| Silent Sites | Telomeres | |||
| Cnidarian | 0.079 | 0.044 | Yes | |
| Cnidarian | 0.024 | 0.033 | Yes | |
| Apicomplexan | 0.021 | 0.040 | Yes | |
| Ciliate | 0.013 | NA | Yes | |
| Ciliate | 0.012 | NA | Yes | |
| Green algae | 0.009 | 0.008 | No | |
| Apicomplexan | 0.005 | <0.001 | No | |
| Fungi | <0.001 | <0.001 | No | |
| Fungi | <0.001 | <0.001 | No | |
| Green algae | <0.001 | 0.018 | No | |
Note.—NA, not available. Genetic diversity is the average pairwise number of nucleotide differences per site. Silent sites include synonymous, noncoding, and/or intronic positions. Values for A. moseri, B. bovis, C. parapsilosis, C. reinhardtii, and P. parva come from the literature (Smith and Lee 2008, 2011; Smith and Keeling 2012; Smith et al. 2012; Valach et al. 2012). Those for the other species were calculated using available mitochondrial genome data—see Materials and Methods and supplementary table S2, Supplementary Material online, for details.