| Literature DB >> 36045717 |
Abstract
Asparagus cochinchinensis is a traditional Chinese medicinal plant. The chloroplast (cp) genome study on A. cochinchinensis is poorly understood. In this research, we collected the data from the cp genome assembly and gene annotation of A. cochinchinensis, followed by further comparative analysis with six species in the genus Asparagus. The cp genome of A. cochinchinensis showed a circular quadripartite structure in the size of 157,095 bp, comprising a large single-copy (LSC), a small single-copy (SSC), and two inverted repeat (IR) regions. A total of 137 genes were annotated, consisting of 86 protein-coding genes, 8 ribosomal RNAs, 38 transfer RNAs, and 5 pseudo-genes. Forty scattered repetitive sequences and 247 simple sequence repeats loci were marked out. In addition, A/T-ending codons were shown to have a basis in the codon analysis. A cp genome comparative analysis revealed that a similar gene composition was detected in the IR and LSC/SSC regions with Asparagus species. Based on the complete cp genome sequence in Asparagaceae, the result showed that A. cochinchinensis was closely related to A. racemosus by phylogenetic analysis. Therefore, our study providing A. cochinchinensis genomic resources could effectively contribute to the phylogenetic analysis and molecular identification of the genus Asparagus.Entities:
Keywords: Asparagus; Asparagus cochinchinensis; chloroplast genome; molecular identification; phylogenetic analysis
Year: 2022 PMID: 36045717 PMCID: PMC9372710 DOI: 10.1515/biol-2022-0098
Source DB: PubMed Journal: Open Life Sci ISSN: 2391-5412 Impact factor: 1.311
Figure 1The cp genome map of A. cochinchinensis. Genes are located outside of the circle and inside of the circle, which are transcribed clockwise and counterclockwise, respectively. The outermost ring represented the genome sequence; the coding gene is expressed in the green frame; tRNA is expressed in the purple frame; rRNA is expressed in the orange frame; the green ring inside shows the coverage depth, and the depth of reverse repeat area is generally twice as deep as other areas; the darker gray are corresponding to GC content in the inner ring. The areas are marked in the circle, including LSC, SSC, IRa, and IRb regions.
The list of gene categories annotated in the cp genome of A. cochinchinensis
| Category | Gene group | Gene name |
|---|---|---|
| Photosynthesis | Subunits of photosystem I |
|
| Subunits of photosystem II |
| |
| Subunits of NADH dehydrogenase |
| |
| Subunits of cytochrome b/f complex |
| |
| Subunits of ATP synthase |
| |
| Large subunit of rubisco |
| |
| Subunits photochlorophyllide reductase |
| |
| Self-replication | Proteins of large ribosomal subunit |
|
| Proteins of small ribosomal subunit |
| |
| Subunits of RNA polymerase |
| |
| Ribosomal RNAs |
| |
| Transfer RNAs |
| |
| Other genes | Maturase |
|
| Protease |
| |
| Envelope membrane protein |
| |
| Acetyl-CoA carboxylase |
| |
| c-type cytochrome synthesis gene |
| |
| Translation initiation factor |
| |
| other |
| |
| Genes of unknown function | Conserved hypothetical chloroplast ORF |
|
Gene*: gene included one intron; Gene**: gene included two introns; #Gene: pseudogene; Gene (2): gene associated with copy number >1, and the copy number is marked in brackets.
The exon and intron length in A. cochinchinensis cp gene
| Gene | Position | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
|
| LSC | 37 | 2,579 | 35 | ||
|
| LSC | 40 | 858 | 209 | ||
|
| LSC | 24 | 676 | 48 | ||
|
| LSC | 144 | 854 | 411 | ||
|
| LSC | 436 | 751 | 1,625 | ||
|
| LSC | 126 | 749 | 229 | 721 | 152 |
|
| LSC | 35 | 527 | 50 | ||
|
| LSC | 39 | 596 | 35 | ||
|
| IRa | 126 | — | 229 | 544 | 26 |
|
| LSC | 71 | 665 | 292 | 801 | 252 |
|
| LSC | 6 | 754 | 642 | ||
|
| LSC | 8 | 734 | 481 | ||
|
| LSC | 9 | 971 | 399 | ||
|
| IRb | 390 | 664 | 432 | ||
|
| IRb | 777 | 699 | 756 | ||
|
| IRb | 229 | — | 26 | 544 | 126 |
|
| IRb | 37 | 941 | 35 | ||
|
| IRb | 38 | 815 | 35 | ||
|
| SSC | 558 | 1,122 | 540 | ||
|
| IRa | 38 | 815 | 35 | ||
|
| IRa | 37 | 941 | 35 | ||
|
| IRa | 777 | 699 | 756 | ||
|
| IRa | 390 | 664 | 432 | ||
|
| IRa | 38 | 815 | 35 | ||
|
| IRa | 37 | 941 | 35 |
Distribution of repeat sequences in cp genes of A. cochinchinensis
| ID | Repeat I start | Repeat II start | Type | Size (bp) | Distance |
| Gene | Region |
|---|---|---|---|---|---|---|---|---|
| 1 | 59,144 | 59,174 | F | 63 | –3 | 8.75 × 10−23 |
| LSC; LSC |
| 2 | 29,804 | 29,804 | P | 56 | 0 | 1.34 × 10−24 | IGS | LSC; LSC |
| 3 | 59,192 | 59,228 | F | 54 | –3 | 1.43 × 10−17 |
| LSC; LSC |
| 4 | 129,395 | 129,410 | F | 54 | –3 | 1.43 × 10−17 |
| SSC; SSC |
| 5 | 59,157 | 59,187 | F | 50 | –2 | 6.04 × 10−17 |
| LSC; LSC |
| 6 | 90,608 | 90,629 | F | 49 | –1 | 3.22 × 10−18 |
| IRb; IRb |
| 7 | 90,608 | 151,725 | P | 49 | –1 | 3.22 × 10−18 |
| IRb; IRa |
| 8 | 90,629 | 151,746 | P | 49 | –1 | 3.22 × 10−18 |
| IRb; IRa |
| 9 | 151,725 | 151,746 | F | 49 | –1 | 3.22 × 10−18 |
| IRa; IRa |
| 10 | 59,162 | 59,228 | F | 48 | –3 | 4.09 × 10−14 |
| LSC; LSC |
| 11 | 39,152 | 41,376 | F | 47 | –3 | 1.53 × 10−13 |
| LSC; LSC |
| 12 | 59,140 | 59,236 | F | 40 | –2 | 4.03 × 10−11 |
| LSC; LSC |
| 13 | 44,079 | 100,383 | F | 39 | –2 | 1.53 × 10−10 |
| LSC; IRb |
| 14 | 44,079 | 141,981 | P | 39 | –2 | 1.53 × 10−10 |
| LSC; IRa |
| 15 | 125,635 | 125,635 | P | 39 | –3 | 5.67 × 10−9 | IGS | SSC; SSC |
| 16 | 127,026 | 127,026 | P | 39 | –3 | 5.67 × 10−9 |
| SSC; SSC |
| 17 | 129,395 | 129,425 | F | 39 | –3 | 5.67 × 10−9 |
| SSC; SSC |
| 18 | 8,775 | 8,775 | P | 38 | 0 | 9.19 × 10−14 | IGS | LSC; LSC |
| 19 | 59,140 | 59,200 | F | 37 | –2 | 2.20 × 10−9 |
| LSC; LSC |
| 20 | 32,182 | 32,201 | P | 35 | –3 | 1.04 × 10−6 | IGS | LSC; LSC |
| 21 | 8,228 | 45,494 | P | 33 | –2 | 4.47 × 10−7 |
| LSC; LSC |
| 22 | 115,811 | 115,811 | R | 33 | –2 | 4.47 × 10−7 | IGS | SSC; SSC |
| 23 | 69,831 | 69,847 | F | 32 | –2 | 1.68 × 10−6 | IGS | LSC; LSC |
| 24 | 8,226 | 35,982 | F | 32 | –3 | 5.04 × 10−5 |
| LSC; LSC |
| 25 | 35,985 | 45,494 | P | 32 | –3 | 5.04 × 10−5 |
| LSC; LSC |
| 26 | 9,961 | 36,985 | F | 31 | –3 | 1.83 × 10−4 |
| LSC; LSC |
| 27 | 90,630 | 90,651 | F | 30 | –2 | 2.36 × 10−5 |
| IRb; IRb |
| 28 | 90,630 | 151,722 | P | 30 | –2 | 2.36 × 10−5 |
| IRb; IRa |
| 29 | 90,651 | 151,743 | P | 30 | –2 | 2.36 × 10−5 |
| IRb; IRa |
| 30 | 44,091 | 100,395 | F | 30 | –3 | 6.60 × 10−4 |
| LSC; IRb |
| 31 | 44,091 | 141,978 | P | 30 | –3 | 6.60 × 10−4 |
| LSC; IRa |
| 32 | 87,962 | 87,985 | F | 30 | –3 | 6.60 × 10−4 | IGS | IRb; IRb |
| 33 | 87,962 | 154,388 | P | 30 | –3 | 6.60 × 10−4 | IGS | IRb; IRa |
| 34 | 87,985 | 154,411 | P | 30 | –3 | 6.60 × 10−4 | IGS | IRb; IRa |
| 35 | 90,606 | 90,648 | F | 30 | –3 | 6.60 × 10−4 |
| IRb; IRb |
| 36 | 906,06 | 151,725 | P | 30 | –3 | 6.60 × 10−4 |
| IRb; IRa |
| 37 | 90,648 | 151,767 | P | 30 | –3 | 6.60 × 10−4 |
| IRb; IRa |
| 38 | 151,722 | 151,764 | F | 30 | –3 | 6.60 × 10−4 |
| IRa; IRa |
| 39 | 154,388 | 154,411 | F | 30 | –3 | 6.60 × 10−4 | IGS | IRa; IRa |
Note: F = forward, P = palindromic, R = reverse, C = complement, IGS = the intergenic spacers; Minimum length: 30 bp; Hamming distance: 3; Soft: vmatch-2.3.0.
Figure 2The type and number of SSR motif in A. cochinchinensis cp genome.
Figure 3The histogram of RSCU value in A. cochinchinensis cp genome. Note: The x-axis represents amino acids, the box below represents all codons corresponding to each amino acid, and the height of the y axis represents the sum of RSCU values for all codons.
Figure 4Boundary difference analysis of the quadripartite structure containing LSC, IRb, SSC, and IRa region of A. cochinchinensis and other five cp genomes of the genus Asparagus.
Figure 5Collinearity analysis of cp genomes among six Asparagus species using Mauve (http://darlinglab.org/mauve). Note: The collinear regions are shown in the same color as rectangular blocks. The rectangles indicate similarities among genomes, and the lines represent a collinear relationship between rectangles. The short squares represent the gene position of each genome. Genome regions are colored as CDs (white), tRNA (green), and rRNA (red).
Figure 6The Ka/Ks ratio of 84 protein-coding genes among the Asparagus cp genomes, including A. cochinchinensis and five other reported species.
Figure 7The PI value of the A. cochinchinensis cp gene. Note: The gene name is shown in abscissa, and the PI value is shown in ordinate.
Figure 8The phylogenetic tree was reconstructed with the entire cp genomes by the ML method, and Allium cyaneum (NC_058219.1) was used as the outgroup.