| Literature DB >> 33976800 |
Fengming Ren1,2, Liqiang Wang3, Ying Li1,4, Wei Zhuo2, Zhichao Xu1,4, Haojie Guo5, Yan Liu2, Ranran Gao1, Jingyuan Song1,4.
Abstract
The increasingly wide application of chloroplast (cp) genome super-barcode in taxonomy and the recent breakthrough in cp genetic engineering make the development of new cp gene resources urgent and significant. Corydalis is recognized as the most genotypes complicated and taxonomically challenging plant taxa in Papaveraceae. However, there currently are few reports about cp genomes of the genus Corydalis. In this study, we sequenced four complete cp genomes of two endangered lithophytes Corydalis saxicola and Corydalis tomentella in Corydalis, conducted a comparison of these cp genomes among each other as well as with others of Papaveraceae. The cp genomes have a large genome size of 189,029-190,247 bp, possessing a quadripartite structure and with two highly expanded inverted repeat (IR) regions (length: 41,955-42,350 bp). Comparison between the cp genomes of C. tomentella, C. saxicola, and Papaveraceae species, five NADH dehydrogenase-like genes (ndhF, ndhD, ndhL, ndhG, and ndhE) with psaC, rpl32, ccsA, and trnL-UAG normally located in the SSC region have migrated to IRs, resulting in IR expansion and gene duplication. An up to 9 kb inversion involving five genes (rpl23, ycf2, ycf15, trnI-CAU, and trnL-CAA) was found within IR regions. The accD gene was found to be absent and the ycf1 gene has shifted from the IR/SSC border to the SSC region as a single copy. Phylogenetic analysis based on the sequences of common CDS showed that the genus Corydalis is quite distantly related to the other genera of Papaveraceae, it provided a new clue for recent advocacy to establish a separate Fumariaceae family. Our results revealed one special cp genome structure in Papaveraceae, provided a useful resources for classification of the genus Corydalis, and will be valuable for understanding Papaveraceae evolutionary relationships.Entities:
Keywords: Corydalis saxicola; Corydalis tomentella; Papaveraceae; chloroplast genome; taxonomic study
Year: 2021 PMID: 33976800 PMCID: PMC8093665 DOI: 10.1002/ece3.7312
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
FIGURE 1The habitat of C. saxicola and C. tomentella. (a) The distant view of steep cliff growing C. saxicola; (b) the close shot of C. saxicola; (c) the close shot of C. tomentella. The yellow arrows indicated the Corydalis plants
FIGURE 2Schematic representation of the chloroplast genomes of C. tomentella. The map contains four rings. From the center going outward, the first circle shows forward and reverse repeats connected with red and green arcs, respectively. The next circle shows tandem repeats marked with short bars. The third circle shows microsatellite sequences identified by MISA. The fourth circle is drawn using drawgenemap and shows the gene structure of the plastome. The genes are colored on the basis of their functional categories. Genes inside and outside of the circle are transcribed in clockwise and counterclockwise directions, respectively. IR, inverted repeat; LSC, large single copy; SSC, small single copy. The red rectangles indicated the nine gens (ndhF, ndhD, ndhL, ndhG, ndhE, psaC, ccsA, rpl32, and trnL‐UAG) normally located in the SSC region have migrated to IRs; the green rectangles indicated the reversed segment involving five genes (rpl23, ycf2, ycf15, trnl‐CAU, and trnL‐CAA)
Summary of chloroplast genome features of C. tomentella and C. saxicola
| Species | Voucher No. | Genbank No. | Total | Length (bp) | GC content (%) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| IR | LSC | SSC | Total | IR | LSC | SSC | ||||
|
| MHJ1 | MT093187 | 190,247 | 41,955 | 96,701 | 9,636 | 40.3 | 42.2 | 39.2 | 35.4 |
| MHJ2 | MT077878 | 190,198 | 42,002 | 96,530 | 9,664 | 40.2 | 42.2 | 39.0 | 35.4 | |
|
| YHL1 | MT077877 | 189,155 | 42,350 | 94,744 | 9,711 | 40.2 | 42.2 | 39.1 | 35.1 |
| YHL2 | MT077879 | 189,029 | 42,164 | 94,993 | 9,708 | 40.3 | 42.2 | 39.1 | 35.1 | |
List of genes in the two Corydalis chloroplast genomes
| Group of genes | Gene names | Number of genes |
|---|---|---|
| Photosystem I |
| 5 (2) |
| Photosystem II |
| 14 |
| Cytochrome b/f complex |
| 6 |
| ATP synthase |
| 6 |
| NADH‐dehydrogenase |
| 11 (6) |
| RubisCO large subunit |
| 1 |
| DNA dependent RNA polymerase |
| 4 |
| Small subunit of ribosome |
| 12 (2) |
| Large subunit of ribosome |
| 9 (3) |
| Proteins of unknown function |
| 5 (2) |
| Other genes |
| 5 (1) |
| Transfer RNAs | 37 tRNAs( | 37/38 |
| Ribosomal RNAs |
| 4 (4) |
One or two asterisks followed genes indicate the number of contained introns, respectively. (×2) indicates the number of the repeat unit is 2. The numbers in parenthesis at the line of “Number” indicate the total number of repeated genes.
FIGURE 3Sequence identity plot comparison of the C. tomentella and C. saxicola cp genomes. Gray arrows and thick black lines above the alignment indicate genes with their orientation and the position of the inverted repeats (IRs), respectively. A cutoff of 70% identity was used for the plots, and the Y‐scale represents the percent identity ranging from 50% to 100%
FIGURE 4Sequence identity plot comparison of the cp genomes of C. tomentella, C. saxicola, P. somniferum, P. rhoeas, and C. hymenoides. Gray arrows and thick black lines above the alignment indicate genes with their orientation and the position of the inverted repeats (IRs), respectively. A cutoff of 70% identity was used for the plots, and the Y‐scale represents the percent identity ranging from 50% to 100%
FIGURE 5Comparison of the borders of LSC, SSC, and IR regions among the eight chloroplast genomes. Number above the gene features indicates the distance between the ends of genes and the border sites. Ψ: pseudogenes
Interspersed repeat sequences and tandem repeat sequences of C. saxicola and C. tomentella
| Species | Voucher No. | SSR | Interspersed repeat sequences | ||||
|---|---|---|---|---|---|---|---|
| Total | Mono SSR | Total | T | F | P | ||
|
| MHJ1 | 172 | 100 | 111 | 61 | 39 | 11 |
| MHJ2 | 174 | 102 | 112 | 62 | 39 | 11 | |
|
| YHL1 | 171 | 96 | 132 | 82 | 23 | 27 |
| YHL2 | 170 | 96 | 133 | 83 | 26 | 24 | |
Abbreviations: F, Forward repeats; P, palindromic repeats; T, tandem repeats.
FIGURE 6ML tree of C. saxicola and C. tomentella and its relative species based on common protein coding sequences