| Literature DB >> 36247578 |
Jianguang Liang1, Rui Chen2,3, Fantao Zhang4, Qian Wang2, Yingxia Yang2, Mingjie Lv2, Shuangyong Yan3, Shan Gao5.
Abstract
Background: Plant chloroplast DNA (cpDNA) typically has a circular structure, including a large single-copy region (LSC), a small single-copy region (SSC) and two inverted repeats (IR1 and IR2). The organization of these four elementary regions LSC-IR1-SSC-IR2 is highly conserved across all plant cpDNAs. Very few structural variations (SVs) occurring at the elementary-region level have been reported.Entities:
Keywords: SDSA; asymmetric recombination; inverted repeat; structural variation; transposon-like element
Year: 2022 PMID: 36247578 PMCID: PMC9559570 DOI: 10.3389/fpls.2022.929352
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Figure 1Full-length chloroplast genome of Dongxiang wild rice. According to the structure LSC-IR1-SSC-IR2, the genome sequence of circular cpDNA was clockwise linearized, starting at the first three nts “CCC” of LSC (indicated by a red arrow). In total, 4 rRNA, 30 tRNA, 76 protein-coding genes and 6 ORFs are annotated. The genes encoded on the forward strand and reverse strands are indicated by symbols written in black and red text, respectively. The genome sequences and the gene annotations are provided in the supplementary file 1 and 2, respectively. LSC, large single-copy region; SSC, small single-copy region; IR1, inverted repeat 1; IR2, inverted repeat 2. * The gene rps12 has two exons and the second exon has two copies in IR1 and IR2.
Annotated genes in cpDNAs.
| Category | Group | Genes |
|---|---|---|
| Self-replication$ | Large subunit of ribosomal proteins (8) |
|
| Small subunit of ribosomal proteins (11) |
| |
| DNA-dependent RNA polymerase (4) |
| |
| Ribosomal RNA genes (4) | rRNA 16S, 23S, 4.5S, 5S | |
| Transfer RNA genes (30) | 30 tRNA genes (6 contain an intron) | |
| Genes for photosynthesis$ | Subunits of NADH-dehydrogenase (11) |
|
| Subunits of photosystem I (5) |
| |
| Subunits of photosystem II (15) |
| |
| Subunits of cytochrome b/f complex (6) |
| |
| Subunits of ATP synthase (6) |
| |
| Subunit of rubisco (1) |
| |
| Others$ | Maturase K |
|
| Envelope membrane protein |
| |
| Subunit of Acetyl-CoA-carboxylase |
| |
| C-type cytochrome synthesis gene |
| |
| Protease |
| |
| Unknown function$ | Conserved hypothetical ORFs (4) |
|
| Only present in Rice | Genes (3) and ORFs (6) |
|
$76 protein-coding genes are present in Onobrychis cpDNAs. In the second column, the number in parentheses after the group name is the gene number of this group. In the third column, the number of asterisks after the gene names indicates the number of introns contained in the genes. The duplicated genes in two IRs are counted once. Three genes accD, ycf1 and ycf2 are absent in rice cpDNAs, while three genes infA, rpl22 and rps16 are absent in Onobrychis cpDNAs. In addition, ORF63, 70, 91, 106, 133, and 249 were predicted in rice cpDNAs. The sequences of all these genes are provided in the .
Figure 2Large structural variations in cpDNAs and their formation. Plant chloroplast DNA (cpDNA) typically has a circular structure, including a large single-copy region (LSC), a small single-copy region (SSC) and two inverted repeats (IR1 and IR2). The organization of these four elementary regions LSC-IR1-SSC-IR2 is highly conserved across all plant cpDNAs. (A) A large inversion resulted in a reverse orientation of 26 genes (from psbK to rbcL) in the LSCs and the loss/acquisition of rps16 and accD (in red color). Another large inversion resulted in a reverse orientation of 10 genes (in green color). The genomic positions of the genes were annotated according to the Nipponbare cpDNA (RefSeq: NC_001320) and the Onobrychis gaubae cpDNA (GenBank: LC647182). (B) A large inversion of SSC results in a reverse orientation of SSC (denoted as ssc). According to the orientation of SSC, cpDNAs can be classified into the forward SSC (SSC-F) and reverse SSC (SSC-R) type cpDNAs, which were also designated as wild-type cpDNAs (denoted as wild) and mutants (denoted as mutant1). The frequent inversion of a whole SSC was named as SSC switching. Wildr and mutant1r represents cpDNAs that have reverse complimentary sequences to wild and mutant1, respectively. LSC and SSC are reverse complimentary to lsc and ssc, respectively. As IR1 is identical to IR2, both LSC-IR1-ssc-IR2 and LSC-IR2-ssc-IR1 represent the same sequence (C) The recombination between mutant1r and wild resulted in a cpDNA (mutant2) that lost a IR and another (mutant3) that acquired a IR. Although the IR−lacking cpDNA (mutant2) was not detected in the present study, it had been previously reported in Onobrychis spp (Moghaddam et al., 2022). While the 319-bp deleted segment was at the 3’ end of the IR1 the 22810-bp insert includes five nucleotides “ATTTT” plus a 2000-bp segment of LSC (denoted as i1) and a whole IR1. The 319-bp lacking IR1 is denoted as IR1d. A high-score segment pair (HSP) that may be involved as recombination sites (indicated by *) was detected between the 5’ end of the 319-bp deleted segment (the below sequence) and that of i1 (the above sequence). Junction reads (indicated by #) were used to validate these structural variations (Materials and methods). The sequence (in black box) synthesized during recombination contain an insert of five nucleotides “ATTTT” (indicated by asterisks below) that resulted in a repeat TTGAATTTTGATT.