| Literature DB >> 32790831 |
Hanhan Xia1, Wei Zhao2,3, Yong Shi4,5,6, Xiao-Ru Wang2,3, Baosheng Wang4,5,6.
Abstract
Short tandem repeats (STRs) contribute to structural variation in plant mitochondrial genomes, but the mechanisms underlying their formation and expansion are unclear. In this study, we detected high polymorphism in the nad7-1 region of the Pinus tabuliformis mitogenome caused by the rapid accumulation of STRs and rearrangements over a few million years ago. The STRs in nad7-1 have a 7-bp microhomology (TAG7) flanking the repeat array. We then scanned the mitogenomes of 136 seed plants to understand the role of microhomology in the formation of STR and mitogenome evolution. A total of 13,170 STRs were identified, and almost half of them were associated with microhomologies. A substantial amount (1,197) of microhomologies was long enough to mediate structural variation, and the length of microhomology is positively correlated with the length of tandem repeat unit. These results suggest that microhomology may be involved in the formation of tandem repeat via microhomology-mediated pathway, and the formation of longer duplicates required greater length of microhomology. We examined the abundance of these 1,197 microhomologies, and found 75% of them were enriched in the plant mitogenomes. Further analyses of the 400 prevalent microhomologies revealed that 175 of them showed differential enrichment between angiosperms and gymnosperms and 186 differed between angiosperms and conifers, indicating lineage-specific usage and expansion of microhomologies. Our study sheds light on the sources of structural variation in plant mitochondrial genomes and highlights the importance of microhomology in mitochondrial genome evolution.Entities:
Keywords: zzm321990 Pinuszzm321990 ; microhomology; mitochondrial genome; short tandem repeat; structural variation
Year: 2020 PMID: 32790831 PMCID: PMC7643612 DOI: 10.1093/gbe/evaa172
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.Sequence structure of 19 haplotypes identified in nad7-1 of P. tabuliformis. (A) The two conserved blocks are indicated by a black rectangle; seven motifs are indicated by pentagons with colors corresponding to those used in (B); the highly variable motif Rp is indicated by gray pentagons with the length labeled. The superscript latter "a" and "b" indicated two different Rp motifs with same length; and the TAG7 microhomology is indicated by a blue open triangle. Three insertions specific to a subset of motifs are indicated by yellow, light blue, and black arrows. A large sequence block (DUP) duplicated in H11–19 is outlined in red. (B) Alignment of the seven motifs.
. 2.Tandem repeats associated with end homology in plant species. (A) Frequency of tandem repeats associated with end homology in angiosperm and gymnosperm species. In each box, horizontal lines from top to bottom refer to the first quartile, median, and third quartile. Each red, green, and blue dot represents an angiosperm, conifer, and gymnosperm (excluding conifers), respectively. (B) Length of end homology against repeat units. Correlations were evaluated by Pearson’s correlation coefficient tests. (C) Distribution of end homology lengths. Values of ≥6 bp suggest microhomology-mediated tandem duplication.
. 3.Differential enrichment of microhomologies in mitogenomes of angiosperms, gymnosperms, and conifers. (A) Microhomology “TAAAGGT” (TAG7); (B) “ATATACG”; (C) “AGCAAGC”; (D) microhomology “AGTCTTC.” Each red, green, and blue dot represents a representative angiosperm, conifer, and nonconifer gymnosperm, respectively. P values of Mann Whitney U tests after Benjamini–Hochberg correction for ECH values of angiosperm vs. gymnosperm and angiosperm vs. conifer are shown in each boxplot. The top, middle, and bottom horizontal lines in each boxplot indicate the first quartile, median, and third quartile, respectively.