| Literature DB >> 35741799 |
Shenglong Kan1,2, Xuezhu Liao1,2, Zhiqiang Wu1,2.
Abstract
There is a paradox in the plant mitochondrial genome, that is, the genic region evolves slowly while the intergenic region evolves rapidly. Thus, the intergenic regions of the plant mitochondrial genome are difficult to align across different species, even in closely related species. Here, to character the mechanism of this paradox, we identified interspecific variations in the Ginkgo biloba, Oryza sativa, and Arabidopsis thaliana mitochondrial and plastid genome at a genome-wide level. The substitution rate of synonymous sites in genic regions was similar to the substitution rate of intergenic regions, while the substitution rate of nonsynonymous sites in genic regions was lower than that in intergenic regions, suggesting the mutation inputs were the same among different categories within the organelle genome, but the selection pressure varied. The substitution rate of single-copy regions was higher than that of IR (inverted repeats) in the plastid genome at an intraspecific level. The substitution rate of single-copy regions was higher than that of repeats in the G. biloba and A. thaliana mitochondrial genomes, but lower in that of O. sativa. This difference may be related to the length and distribution of repeats. Copy number variations that existed in the G. biloba and O. sativa mitochondrial genomes were confirmed. This study reveals the intraspecific variation pattern of organelle genomes at a genome-wide level, and that copy number variations were common in plant mitochondrial genomes.Entities:
Keywords: copy number variation; mutation rate; organelle; seed plants; single-nucleotide polymorphisms
Mesh:
Substances:
Year: 2022 PMID: 35741799 PMCID: PMC9222611 DOI: 10.3390/genes13061036
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Intraspecific variations in mitogenomes s. SNPs: single nucleotide polymorphisms; MAF: minor allele frequency.
| Sequence Type | Sites | SNPs | SNPs per Site | SNP MAF | Indels | Indels per Site | Indel MAF | |
|---|---|---|---|---|---|---|---|---|
|
| Protein Coding | 34,407 | 28 | 0.0008 | 0.4606 | 0 | 0.0000 | NA |
| Nonsynonymous | 25,519 | 20 | 0.0008 | 0.4850 | 0 | 0.0000 | NA | |
| Synonymous | 8888 | 8 | 0.0009 | 0.3996 | 0 | 0.0000 | NA | |
| rRNA | 5006 | 6 | 0.0012 | 0.1819 | 1 | 0.0002 | 0.1447 | |
| tRNA | 1746 | 2 | 0.0011 | 0.5000 | 0 | 0.0000 | NA | |
| Pseudogene | 0 | 0 | NA | NA | 0 | 0.0000 | NA | |
| Intron | 39,183 | 25 | 0.0006 | 0.3971 | 13 | 0.0003 | 0.2202 | |
| Intergenic | 266,202 | 242 | 0.0009 | 0.3917 | 125 | 0.0005 | 0.1930 | |
| Single-copy region | 318,455 | 261 | 0.0008 | 0.4369 | 127 | 0.0004 | 0.1984 | |
| Repeat | 28,089 | 42 | 0.0015 | 0.1351 | 12 | 0.0004 | 0.1611 | |
| Total | 346,544 | 303 | 0.0009 | 0.3951 | 139 | 0.0004 | 0.1952 | |
|
| Protein Coding | 30,747 | 17 | 0.0006 | 0.0000 | 3 | 0.0001 | 0.0370 |
| Nonsynonymous | 23,009 | 10 | 0.0004 | 0.0000 | 3 | 0.0001 | 0.0370 | |
| Synonymous | 7738 | 7 | 0.0009 | 0.0000 | 0 | 0.0000 | NA | |
| rRNA | 5324 | 53 | 0.0100 | 0.0189 | 26 | 0.0049 | 0.1026 | |
| tRNA | 1471 | 0 | 0.0000 | NA | 0 | 0.0000 | NA | |
| Pseudogene | 1561 | 0 | 0.0000 | NA | 0 | 0.0000 | NA | |
| Intron | 99,594 | 215 | 0.0022 | 0.1094 | 60 | 0.0006 | 0.0827 | |
| Intergenic | 351,823 | 331 | 0.0009 | 0.1652 | 92 | 0.0003 | 0.1712 | |
| Single-copy region | 225,262 | 518 | 0.0023 | 0.1462 | 145 | 0.0006 | 0.1296 | |
| Repeat | 265,258 | 98 | 0.0004 | 0.0355 | 36 | 0.0001 | 0.1306 | |
| Total | 490,520 | 616 | 0.0013 | 0.1286 | 181 | 0.0004 | 0.1298 | |
|
| Protein Coding | 31,272 | 48 | 0.0015 | 0.0000 | 1 | 0.0000 | 0.0000 |
| Nonsynonymous | 23,381 | 28 | 0.0012 | 0.0000 | 1 | 0.0000 | 0.0000 | |
| Synonymous | 7891 | 20 | 0.0025 | 0.0000 | 0 | 0.0000 | NA | |
| rRNA | 5222 | 4 | 0.0008 | 0.0000 | 0 | 0.0000 | NA | |
| tRNA | 1689 | 0 | 0.0000 | NA | 0 | 0.0000 | NA | |
| Pseudogene | 1256 | 5 | 0.0040 | 0.0000 | 0 | 0.0000 | NA | |
| Intron | 28,422 | 74 | 0.0026 | 0.0000 | 13 | 0.0005 | 0.0538 | |
| Intergenic | 299,947 | 1170 | 0.0039 | 0.0015 | 206 | 0.0007 | 0.0222 | |
| Single-copy region | 326,203 | 1100 | 0.0034 | 0.0016 | 173 | 0.0005 | 0.0202 | |
| Repeat | 41,605 | 201 | 0.0048 | 0.0000 | 47 | 0.0011 | 0.0378 | |
| Total | 367,808 | 1301 | 0.0035 | 0.0014 | 220 | 0.0006 | 0.0240 |
Figure 1Scatter plot of length and number of SNPs for different categories within mitogenome. (a) Ginkgo biloba, (b) Oryza sativa, (c) Arabidopsis thaliana. The red dotted line indicates the relationship between length and the number of SNPs in intergenic region. Nonsynonymous and synonymous indicates nonsynonymous sites and synonymous sites of the coding regions of protein-coding genes, respectively.
Figure 2Percentage of transition and transversion rates within the genome of organelles in three species. The mt indicates mitogenome; pt indicates plastid genome.
Intraspecific variations in plastid genomes. SNPs: single nucleotide polymorphisms.
|
|
|
| ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence Type | Sites | SNPs | SNPs per Site | Indels | Indels per Site | Sites | SNPs | SNPs per Site | Indels | Indels per Site | Sites | SNPs | SNPs per Site | Indels | Indels per Site | |
| Total | Protein Coding | 66,348 | 42 | 0.0006 | 5 | 0.0001 | 58,818 | 60 | 0.001 | 11 | 0.0002 | 79,227 | 839 | 0.0106 | 19 | 0.0002 |
| Nonsynonymous | 50,043 | 31 | 0.0006 | 5 | 0.0001 | 44,295 | 36 | 0.0008 | 11 | 0.0002 | 60,108 | 376 | 0.0063 | 14 | 0.0002 | |
| Synonymous | 16,305 | 11 | 0.0007 | 0 | 0 | 14,524 | 24 | 0.0017 | 0 | 0 | 19,119 | 463 | 0.0242 | 5 | 0.0003 | |
| rRNA | 8925 | 4 | 0.0004 | 0 | 0 | 9182 | 0 | 0 | 0 | 0 | 8929 | 1 | 0.0001 | 2 | 0.0002 | |
| tRNA | 10,778 | 5 | 0.0005 | 3 | 0.0003 | 9346 | 1 | 0.0001 | 2 | 0.0002 | 10,233 | 70 | 0.0068 | 20 | 0.002 | |
| Pseudogene | 0 | 0 | NA | 0 | NA | 883 | 1 | 0.0011 | 0 | 0 | 0 | 0 | NA | 0 | 0 | |
| Intron | 6523 | 3 | 0.0005 | 4 | 0.0006 | 10,615 | 5 | 0.0005 | 28 | 0.0026 | 12,711 | 217 | 0.0171 | 77 | 0.0061 | |
| Intergenic | 64,414 | 98 | 0.0015 | 49 | 0.0008 | 45,681 | 64 | 0.0014 | 69 | 0.0015 | 43,378 | 1203 | 0.0277 | 577 | 0.0133 | |
| Total | 156,988 | 152 | 0.001 | 61 | 0.0004 | 134,525 | 131 | 0.001 | 110 | 0.0008 | 154,478 | 2330 | 0.0151 | 695 | 0.0045 | |
| LSC | Protein Coding | 53,262 | 25 | 0.0005 | 4 | 0.0001 | 42,708 | 54 | 0.0013 | 5 | 0.0001 | 43,401 | 537 | 0.0124 | 14 | 0.0003 |
| Nonsynonymous | 40,232 | 19 | 0.0005 | 4 | 0.0001 | 32,181 | 33 | 0.001 | 5 | 0.0002 | 32,720 | 190 | 0.0058 | 9 | 0.0003 | |
| Synonymous | 13,030 | 6 | 0.0005 | 0 | 0 | 10,527 | 21 | 0.002 | 0 | 0 | 10,681 | 347 | 0.0325 | 5 | 0.0005 | |
| rRNA | 0 | 0 | NA | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| tRNA | 6300 | 5 | 0.0008 | 3 | 0.0005 | 4559 | 1 | 0.0002 | 2 | 0.0004 | 6057 | 69 | 0.0114 | 19 | 0.0031 | |
| Pseudogene | 0 | 0 | NA | 0 | NA | 223 | 1 | 0.0045 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Intron | 5774 | 3 | 0.0005 | 4 | 0.0007 | 5798 | 5 | 0.0009 | 27 | 0.0047 | 7823 | 196 | 0.0251 | 69 | 0.0088 | |
| Intergenic | 24,993 | 58 | 0.0023 | 43 | 0.0017 | 18,122 | 57 | 0.0031 | 67 | 0.0037 | 17,960 | 1043 | 0.0581 | 505 | 0.0281 | |
| Total | 99,254 | 91 | 0.0009 | 54 | 0.0005 | 80,592 | 118 | 0.0015 | 101 | 0.0013 | 84,170 | 1845 | 0.0219 | 607 | 0.0072 | |
| IR | Protein Coding | 2892 | 2 | 0.0007 | 0 | 0 | 7542 | 0 | 0 | 0 | 0 | 22,542 | 9 | 0.0004 | 1 | 0 |
| Nonsynonymous | 2169 | 2 | 0.0009 | 0 | 0 | 5672 | 0 | 0 | 0 | 0 | 17,249 | 9 | 0.0005 | 1 | 0.0001 | |
| Synonymous | 723 | 0 | 0 | 0 | 0 | 1870 | 0 | 0 | 0 | 0 | 5293 | 0 | 0 | 0 | 0 | |
| rRNA | 8925 | 4 | 0.0004 | 0 | 0 | 9182 | 0 | 0 | 0 | 0 | 8929 | 1 | 0.0001 | 2 | 0.0002 | |
| tRNA | 4324 | 0 | 0 | 0 | 0 | 4707 | 0 | 0 | 0 | 0 | 4096 | 1 | 0.0002 | 1 | 0.0002 | |
| Pseudogene | 0 | 0 | NA | 0 | NA | 660 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Intron | 749 | 0 | 0 | 0 | 0 | 3830 | 0 | 0 | 0 | 0 | 3808 | 0 | 0 | 0 | 0 | |
| Intergenic | 27,502 | 14 | 0.0005 | 2 | 0.0001 | 24,859 | 0 | 0 | 1 | 0 | 22,082 | 3 | 0.0001 | 5 | 0.0002 | |
| Total | 35,467 | 20 | 0.0006 | 2 | 0.0001 | 41,598 | 0 | 0 | 1 | 0 | 52,528 | 14 | 0.0003 | 9 | 0.0002 | |
| SSC | Protein Coding | 10,194 | 15 | 0.0015 | 1 | 0.0001 | 8568 | 6 | 0.0007 | 6 | 0.0007 | 13,284 | 293 | 0.0221 | 4 | 0.0003 |
| Nonsynonymous | 7645 | 10 | 0.0013 | 1 | 0.0001 | 6448 | 3 | 0.0005 | 6 | 0.0009 | 10,139 | 177 | 0.0175 | 4 | 0.0004 | |
| Synonymous | 2549 | 5 | 0.002 | 0 | 0 | 2120 | 3 | 0.0014 | 0 | 0 | 3145 | 116 | 0.0369 | 0 | 0 | |
| rRNA | 0 | 0 | NA | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| tRNA | 154 | 0 | 0 | 0 | 0 | 80 | 0 | 0 | 0 | 0 | 80 | 0 | 0 | 0 | 0 | |
| Pseudogene | 0 | 0 | NA | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Intron | 0 | 0 | NA | 0 | NA | 987 | 0 | 0 | 1 | 0.001 | 1080 | 21 | 0.0194 | 8 | 0.0074 | |
| Intergenic | 11,919 | 26 | 0.0022 | 4 | 0.0003 | 2700 | 7 | 0.0026 | 1 | 0.0004 | 3336 | 157 | 0.0471 | 67 | 0.0201 | |
| Total | 22,267 | 41 | 0.0018 | 5 | 0.0002 | 12,335 | 13 | 0.0011 | 8 | 0.0006 | 17,780 | 471 | 0.0265 | 79 | 0.0044 | |
Figure 3Sequencing depth of mitogenome. (a) Ginkgo biloba individual 1, (b) G. biloba individual 2, (c) G. biloba individual 3, (d) Oryza sativa individual 1, (e) O. sativa individual 2, (f) O. sativa individual 3. Pink color indicates the repeat sequences in mitogenomes.