| Literature DB >> 23204304 |
Ibrar Ahmed, Patrick J Biggs, Peter J Matthews, Lesley J Collins, Michael D Hendy, Peter J Lockhart.
Abstract
A characteristic feature of eukaryote and prokaryote genomes is the co-occurrence of nucleotide substitution and insertion/deletion (indel) mutations. Although similar observations have also been made for chloroplast DNA, genome-wide associations have not been reported. We determined the chloroplast genome sequences for two morphotypes of taro (Colocasia esculenta; family Araceae) and compared these with four publicly available aroid chloroplast genomes. Here, we report the extent of genome-wide association between direct and inverted repeats, indels, and substitutions in these aroid chloroplast genomes. We suggest that alternative but not mutually exclusive hypotheses explain the mutational dynamics of chloroplast genome evolution.Entities:
Mesh:
Year: 2012 PMID: 23204304 PMCID: PMC3542561 DOI: 10.1093/gbe/evs110
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FColocasia esculenta var. RR chloroplast genome (GenBank accession: JN105690). Brown lines in the outer circle represent the LSC and SSC regions, cyan lines represent the IRs, whereas inner green lines show AT and blue lines show GC percentage throughout the cp genome.
Comparison among Total Size (bp) and Sizes of the LSC, SSC, and Two IR Regions in Taro and Other Aroid Chloroplast Genomes
| Species | GenBank ID | Genome Size | LSC | SSC | IR |
|---|---|---|---|---|---|
| JN105689 | 162,424 | 89,670 (55.21) | 22,208 (13.67) | 25,273 (31.12) | |
| JN105690 | 162,546 | 89,817 (55.26) | 22,075 (13.58) | 25,327 (31.16) | |
| NC010109 | 165,955 | 89,906 (54.17) | 13,603 (8.20) | 31,223 (37.63) | |
| JN160603 | 168,788 | 91,222 (54.04) | 14,056 (8.33) | 31,755 (37.63) | |
| JN160604 | 169,337 | 92,015 (54.34) | 13,956 (8.24) | 31,683 (37.42) | |
| JN160605 | 168,704 | 91,454 (54.21) | 13,394 (7.94) | 31,930 (37.85) |
Note.—Percentage proportions of the LSC, SSC, and IRs are given in parenthesis.
FCircos plot of taro (Colocasia esculenta) var. RR showing the relationship between short repeats within the chloroplast genome and distribution of indels and substitutions in pairwise comparisons of taro var. RR cp genome with other aroid cp genomes. All data in the histogram tracks are shown in nonoverlapping 250 bp bins, with the taro var. RR genome taken as a reference for the coordinate space. Tracks from the outermost to innermost show: taro var. RR chloroplast ideogram (LSC in purple, SSC in red, and IRs in cyan); genome annotation on the positive and negative strand (genes in green; tRNAs in blue and rRNAs in purple); five circular plots showing comparisons between C. esculenta var. RR with 1) C. esculenta var. GP, 2) Lemna minor, 3) Spirodela polyrhiza, 4) Wolffia australiana, and 5) Wolffiella lingulata. For each genome comparison, the number of indels in each 250 bp bin is shown in orange (scale of 0–10), and the number of substitutions is shown in blue (scale of 0–160). Across these five plots, the light green coloring indicates the coding regions. The figure center shows the results of Reputer mapping using the taro var. RR chloroplast genome. Two ends of a red line mark the two locations of the forward (direct) repeats, whereas those of a green line mark the two locations of the reverse (inverted) repeats on the genome. In this part of the figure, the large IRs are not plotted, as they would obscure a large part of the figure. Number of repeats shown in the diagram is 667, with a size range from 15 to 48 bp (average repeat size: 16 bp).
Comparisons among the Pairwise Alignments (Colocasia esculenta var. RR taken as a Reference) to Calculate the Correlations between 1) Repeats and Substitutions, 2) Insertion-Deletions (indels) and Substitutions, and 3) Repeats and Indels
| Comparison | |||||
|---|---|---|---|---|---|
| Repeats and substitutions | |||||
| Correlation between repeats and substitutions ( | 0.245 | 0.391 | 0.416 | 0.424 | 0.491 |
| Significance of correlation ( | 6.44 | 10.81 | 11.657 | 11.92 | 14.37 |
| Coefficient of determination ( | 0.060 | 0.152 | 0.173 | 0.180 | 0.241 |
| Insertion–deletions (indels) and substitutions | |||||
| Correlation between indels and substitutions ( | 0.391 | 0.220 | 0.245 | 0.323 | 0.387 |
| Significance of correlation ( | 10.82 | 5.75 | 6.43 | 8.71 | 10.69 |
| Coefficient of determination ( | 0.153 | 0.048 | 0.060 | 0.105 | 0.150 |
| Repeats and indels | |||||
| Correlation between repeats and indels ( | 0.640 | 0.168 | 0.178 | 0.224 | 0.212 |
| Significance of correlation ( | 21.20 | 4.33 | 4.59 | 5.87 | 5.51 |
| Coefficient of determination ( | 0.409 | 0.028 | 0.032 | 0.050 | 0.045 |
Note.—The alignments compared closely related (var. RR to var. GP) and distantly related (var. RR to W. lingulata, W. australiana, L. minor, and S. polyrhiza) aroid chloroplast genomes. The alignments were partitioned into 651 nonoverlapping bins of 250 bp size each to calculate these correlations.
***All correlations were highly significant at 0.001α and 649 degree of freedom.
FResults showing (a) the number of mean non-zero data points used to calculate the substitution density and (b) the values of substitution density in 125 bp sequence adjacent to the ILPs for the parental alignment as well as two of its partitions, A and B (Partition A contains ILPs associated with SSR indels in coding and noncoding regions, while partition B contains ILPs associated with large indels in coding and noncoding regions). Lower than 150 values for non-zero data points at >25 bp distance in (a) represents that taking an average for 1,000 random iterations, lesser than 150 ILPs are 125 bp apart from each other in all three types of comparisons.
Summary Statistics for BWA Mapping of 75 Base, Paired-End Reads Obtained from the Colocasia esculenta var. RR and var. GP Morphotypes to the Lemna minor Chloroplast Genome and to Their Assembled Chloroplast Genomes
| Parameter | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RR1 | RR2 | RPE | GR1 | GR2 | GPE | RR1 | RR2 | RPE | GR1 | GR2 | GPE | |
| Genome coverage (%) | 68.5 | 68.1 | 85 | 68.5 | 68.7 | 85.6 | 99.99 | 100 | 100 | 100 | 100 | 100 |
| Average coverage depth | 129 | 128 | 337 | 319 | 317 | 825 | 296 | 294 | 593 | 665 | 659 | 1,338 |
| Maximum coverage depth | 674 | 623 | 1,531 | 1,984 | 1,853 | 5,194 | 641 | 656 | 1,020 | 1,940 | 2,021 | 3,304 |
Note.—The acronyms RR1, RR2, and RPE represent mapping with the read 1, read 2, and paired-end (reads 1 and 2 taken together) reads obtained from the var. RR morphotype. Similarly, GR1, GR2, and GPE represent mapping with the read 1, read 2, and paired-end reads obtained from the var. GP morphotype.