| Literature DB >> 36140788 |
Haoqi Wang1,2, Xuezhu Liao2,3, Luke R Tembrock4, Zuoren Yang1,5, Zhiqiang Wu2,3.
Abstract
DNA originating from organellar genomes are regularly discovered in nuclear sequences during genome assembly. Nevertheless, such insertions are sometimes omitted during the process of nuclear genome assembly because the inserted DNA is assigned to organellar genomes, leading to a systematic underestimation of their frequency. With the rapid development of high-throughput sequencing technology, more inserted fragments from organelle genomes can now be detected. Therefore, it is necessary to be aware of the insertion events from organellar genomes during nuclear genome assembly to properly attribute the impact and rate of such insertions in the evolution of nuclear genomes. Here, we investigated the impact of intracellular gene transfer (IGT) from the plastome to the nuclear genome using genome assemblies that were refined through time with technological improvements from two model species, Arabidopsis thaliana and Oryza sativa. We found that IGT from the plastome to the nuclear genome is a dynamic and ongoing process in both A. thaliana and O. sativa, and mostly occurred recently, as the majority of transferred sequences showed over 95% sequence similarity with plastome sequences of origin. Differences in the plastome-to-nuclear genome IGT between A. thaliana and O. sativa varied among the different assembly versions and were associated with the quality of the nuclear genome assembly. IGTs from the plastome to nuclear genome occurred more frequently in intergenic regions, which were often associated with transposable elements (TEs). This study provides new insights into intracellular genome evolution and nuclear genome assembly by characterizing and comparing IGT from the plastome into the nuclear genome for two model plant species.Entities:
Keywords: NUPTs (nuclear plastid DNAs); genome assembly index; genome evolution; intracellular gene transfer; plastid
Mesh:
Substances:
Year: 2022 PMID: 36140788 PMCID: PMC9498363 DOI: 10.3390/genes13091620
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Basic information on A. thaliana genomic data.
| Version | Name | Time | Ecotype | Assembly | Sequencing Tech | BUSCO |
|---|---|---|---|---|---|---|
| phytozome13 | Phy13 | 2013 | Columbia | Athaliana_167.fa.gz | Next-generation sequencing | 99.30% |
| tair10 | T10 | 2014 | Columbia | GCA_000001735.2 | Next-generation sequencing | 99.30% |
| almost complete | GWH | 2021 | Columbia | GWHBDNP00000000.1 | ONT and PacBio and Hi-C | 99.40% |
| no-gap | Col | 2021 | Columbia | Col-CEN | ONT and PacBio HiFi long-read | 99.40% |
Basic information on O. sativa genomic data.
| Version | Name | Time | Strain | Assembly | Sequencing Tech | BUSCO |
|---|---|---|---|---|---|---|
| draft sequence | 9311 | 2002 | Indica | GCA_000004675.2 | Whole-genome shotgun sequencing | 96.30% |
| near complete | SH498 | 2017 | Indica | GCA_002151415.1 | PacBio | 98.50% |
| no-gap | MH63 | 2021 | Indica | GCA_001623365.2 | PacBio Sequel II | 98.70% |
| no-gap | ZS97 | 2021 | Indica | GCA_001623345.3 | PacBio Sequel II | 98.70% |
Figure 1The length ratio and number of transferred plastome sequences in nuclear chromosomes in O. sativa and A. thaliana with 95% and higher identity. (a) Length proportion of plastome transfers to nuclear chromosomes in O. sativa across four different size categories. (b) The number of plastome sequences transferred to nuclear chromosomes in O. sativa across four different size categories. (c) Length proportion of plastome transfers to nuclear chromosomes in A. thaliana across four different size categories. (d) The number of plastome sequences transferred to nuclear chromosomes in A. thaliana across four different size categories. The bar charts represent the relative length of all transferred fragments to the total length of each nuclear chromosome as a percent, assembly versions are ordered along the x axis from oldest to newest.
Figure 2The GC and TE (transposable element) content in flanking sequences around plastome transferred sequences. (a,c) The GC content of flanking sequences by size category and assembly version in O. sativa and A. thaliana, respectively. (b,d) The number of TEs in flanking sequences by size category and assembly version in O. sativa and A. thaliana, respectively.
Figure 3Differences in plastome-to-nuclear genome IGT between different assembly versions in O. sativa and A. thaliana. (a) The length proportion of A. thaliana plastome inserts to each genic region in the category with identity ≥ 95%. (b) The number of A. thaliana plastome inserts to each genic region in the category with identity ≥ 95%. (c) The length proportion of O. sativa plastome inserts to each genic region in the category with identity ≥ 95%. (d) The number of O. sativa plastome inserts to each genic region in the category with identity ≥ 95%. Exon refers to the exon region of an annotated gene, intron refers to the intron region of an annotated gene, and inter refers to the intergenic regions between annotated genes. (e) TE content of O. sativa plastome inserts to each nuclear genic region in the category with identity ≥ 95%. (f) TE content of A. thaliana plastome inserts to each nuclear genic region in the category with identity ≥ 95%.