| Literature DB >> 35468142 |
Kwan-Ho Wong1,2, Bobby Lim-Ho Kong2,3, Tin-Yan Siu1, Hoi-Yan Wu3, Grace Wing-Chiu But2, Pang-Chui Shaw2,3,4, David Tai-Wai Lau1,3.
Abstract
Asparagus species are widely used for medicinal, horticultural, and culinary purposes. Complete chloroplast DNA (cpDNA) genomes of three Asparagus specimens collected in Hong Kong-A. aethiopicus, A. densiflorus 'Myers', and A. cochinchinensis-were de novo assembled using Illumina sequencing. Their sizes ranged from 157,069 to 157,319 bp, with a total guanine-cytosine content of 37.5%. Structurally, a large single copy (84,598-85,350 bp) and a small single copy (18,677-18,685 bp) were separated by a pair of inverted repeats (26,518-26,573 bp). In total, 136 genes were annotated for A. aethiopicus and A. densiflorus 'Myers'; these included 90 mRNA, 38 tRNA, and 8 rRNA genes. Further, 132 genes, including 87 mRNA, 37 tRNA, and 8 rRNA genes, were annotated for A. cochinchinensis. For comparative and phylogenetic analysis, we included NCBI data for four congenerics, A. setaceus, A. racemosus, A. schoberioides, and A. officinalis. The gene content, order, and genome structure were relatively conserved among the genomes studied. There were similarities in simple sequence repeats in terms of repeat type, sequence complementarity, and cpDNA partition distribution. A. densiflorus 'Myers' had distinctive long sequence repeats in terms of their quantity, type, and length-interval frequency. Divergence hotspots, with nucleotide diversity (Pi) ≥ 0.015, were identified in five genomic regions: accD-psaI, ccsA, trnS-trnG, ycf1, and ndhC-trnV. Here, we summarise the historical changes in the generic subdivision of Asparagus. Our phylogenetic analysis, which also elucidates the nomenclatural complexity of A. aethiopicus and A. densiflorus 'Myers', further supports their close phylogenetic relationship. The findings are consistent with prior generic subdivisions, except for the placement of A. racemosus, which requires further study. These de novo assembled cpDNA genomes contribute valuable genomic resources and help to elucidate Asparagus taxonomy.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35468142 PMCID: PMC9037925 DOI: 10.1371/journal.pone.0266376
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Information about the Asparagus specimens deposited at the Shiu-Ying Hu Herbarium.
| Species | Collector no. | Inventory no. | Sheet no. | GPS location |
|---|---|---|---|---|
| K. H. Wong 109 | CUSLSH2801 | CUHK05891 | 22.420786, 114.208312 | |
| K. H. Wong 092 | CUSLSH2773 | CUHK05890 | 22.419994, 114.207354 | |
| K. H. Wong 107 | CUSLSH2799 | CUHK05892 | 22.421524, 114.207135 |
Fig 1Photos of three Asparagus plants collected at the Chinese University of Hong Kong.
A,B: A. aethiopicus. A. Plant climbing under Ficus microcarpa L. f. and twining with Passiflora suberosa L. B. Flowers and cladodes. C,D: A. cochinchinensis. C. Plant straggling on ground. D. Cladodes. E,F,G: A. densiflorus ‘Myers’. E. Plant growing in a concrete pot. F. Flowers and cladodes. G. Fruits and branch apices.
Summary on the cpDNA genome structure of the seven Asparagus species.
| Accession no. | MZ337394 | MZ337395 | MZ424304 | NC_034777 | NC_047472 | NC_035969 | NC_047458 |
|
|
|
|
|
|
|
|
|
| LSC (bp) | 85,246 | 85,350 | 84,598 | 84,999 | 84,989 | 84,928 | 85,311 |
| SSC (bp) | 18,677 | 18,677 | 18,685 | 18,638 | 18,619 | 18,685 | 18,641 |
| IR (bp) | 26,573 | 26,556 | 26,518 | 26,531 | 26,567 | 26,631 | 26,513 |
|
|
|
|
|
|
|
|
|
| mRNA | 90 | 90 | 87 | 88 | 86 | 88 | 90 |
| tRNA | 38 | 38 | 37 | 37 | 36 | 36 | 37 |
| rRNA | 8 | 8 | 8 | 8 | 8 | 8 | 8 |
| Pseudogene (Ψ) | 1 | 1 | 0 | 7 | 1 | 1 | 1 |
| 1-intron gene | 20 | 21 | 21 | 21 | 21 | 20 | 19 |
| 2-introns gene | 2 | 2 | 2 | 2 | 2 | 2 | 2 |
|
|
|
|
|
|
|
|
|
| GC content in LSC (%) | 35.44 | 35.43 | 35.54 | 35.60 | 35.53 | 35.55 | 35.46 |
| GC content in SSC (%) | 31.30 | 31.31 | 31.38 | 31.50 | 31.43 | 31.51 | 31.45 |
| GC content in IR (%) | 42.94 | 42.93 | 42.90 | 42.92 | 42.92 | 42.93 | 42.85 |
a ycf1
b ycf1, ycf15 (x2), ycf68 (x2), infA, rps19.
Fig 2Chloroplast genome map of A. aethiopicus L., A. densiflorus (Kunth) Jessop ‘Myers’, and A. cochinchinensis (Lour.) Merr.
Genes are colour-coded based on their functions shown in the key. Genes located outside of the outer circle are transcribed anticlockwise, while those inside are transcribed clockwise. In the inner circle, the gradient in dark grey represents GC content, whereas light grey represents AT content.
Genes annotated in the complete cpDNA genomes of A. aethiopicus L., A. densiflorus (Kunth) Jessop ‘Myers’, and A. cochinchinensis (Lour.) Merr.
| Gene category | Gene functions | Gene names |
|---|---|---|
| Photosynthesis-related genes | Rubisco |
|
| Photosystem I | ||
| Assembly/ stability of photosystem I | ||
| Photosystem II | ||
| ATP synthase | ||
| Cytochrome b/f complex | ||
| Cytochrome c synthesis |
| |
| NADPH dehydrogenase | ||
| Transcription- and translation-related genes | Transcription | |
| Ribosomal protein | ||
| Translation initiation factor |
| |
| RNA genes | Ribosomal RNA | |
| Transfer RNA | ||
| Miscellaneous group | Maturase |
|
| Inner membrane protein |
| |
| ATP-dependent protease |
| |
| Acetyl-CoA carboxylase |
| |
| Unknown functions |
% Duplicated in inverted repeat regions
* Duplicated in large single copies of A. densiflorus ‘Myers’ and A. cochinchinensis; appeared once in A. aethiopicus
$ Duplicated in large single copies of A. aethiopicus and A. densiflorus ‘Myers’; appeared once in A. cochinchinensis
@ ycf1 was functional in all three species, but the ycf1 pseudogene was absent from A. cochinchinensis
# Duplicated in inverted repeat regions of A. aethiopicus and A. densiflorus ‘Myers’; absent from A. cochinchinensis.
Intron-containing genes in the chloroplast genomes of seven Asparagus species.
| Location | ||||||||
|---|---|---|---|---|---|---|---|---|
| Accession no. | MZ337394 | MZ337395 | MZ424304 | NC_034777 | NC_047472 | NC_035969 | NC_047458 | / |
|
| 0 | 1 | 1 | 1 | 1 | 1 | 0 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
| 1 | 1 | 1 | 1 | 1 | 1 | ABS | LSC | |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
| 2 | 2 | 2 | 2 | 2 | 2 | 2 | LSC | |
|
| 1 | 1 | 1 | 1 | 1 | ABS | 1 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
|
| 2 | 2 | 2 | 2 | 2 | 2 | 2 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | IRA + IRB |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | IRA + IRB |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | IRA + IRB + LSC |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | IRA + IRB |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | IRA + IRB |
|
| 1 | 1 | 1 | 1 | 1 | 1 | 1 | SSC |
0—No intron; 1–1 intron; 2–2 introns; ABS—Gene absent.
A Annotated as clpP1.
B Located in the region 9167–9994 bp; for NC 047458, trnG-UCC, at 36924–36994 bp, had no intron.
C pafI was annotated in A. aethiopicus, A. densiflorus ‘Myers’, and A. cochinchinensis.
Fig 3Simple sequence repeat class distribution.
Fig 4Simple sequence repeat frequency related to sequence complementarity.
Fig 5Simple sequence repeat distribution in the quadripartite cpDNA structure.
The percentages for each region are shown in the middle of each bar. The numbers in brackets are the actual numbers of SSRs distributed in the indicated cpDNA regions.
Fig 6Types of long sequence repeats.
Fig 7Frequency of long sequence repeats in specified length intervals.
Fig 8Large single copy (LSC), small single copy (SSC), and inverted repeat (IR) boundary comparison for the seven Asparagus cpDNA genomes.
Numbers in bold indicate the size of the gene (or gene section) within the specified regions. The numbers next to the dashed arrows indicate distances from the specified junctions. Numbers within the coloured bands indicate the lengths of the respective regions. The direction of gene transcription is presented by the obtuse angles of the pentagons. Ψ, pseudogene. Not to scale.
Fig 9Complete cpDNA genome nucleotide diversity for the seven Asparagus species.
X-axis: window midpoint; Y-axis: nucleotide diversity value (Pi) for each window. Divergence hotspots (Pi > 0.015) are labelled in red above the corresponding position.
Fig 10Maximum likelihood (ML) trees based on Asparagus cpDNA genomes.
Numbers next to the nodes: bootstrap values based on complete cpDNA genomes/LSC/SSC/CDS sequences. The topologies are identical. Bold taxa: the three newly assembled cpDNA genomes.
Fig 11Maximum likelihood (ML) trees based on inverted repeats (IRs) for Asparagus.
Numbers next to the nodes: bootstrap values on IRA and IRB. Bold taxa: the three newly assembled cpDNA genomes.