| Literature DB >> 31666958 |
Song Xue1,2, Ting Shi1, Wenjie Luo1, Xiaopeng Ni1, Shahid Iqbal1, Zhaojun Ni1, Xiao Huang1, Dan Yao1, Zhijun Shen2, Zhihong Gao1.
Abstract
Prunus mume Sieb. et Zucc., P. armeniaca L., and P. salicina L. are economically important fruit trees in temperate regions. These species are taxonomically perplexing because of shared interspecific morphological traits and variation, which are mainly attributed to hybridization. The chloroplast is cytoplasmically inherited and often used for evolutionary studies. We sequenced the complete chloroplast genomes of P. mume, P. armeniaca, and P. salicina using Illumina sequencing followed by de novo assembly. The three chloroplast genomes exhibit a typical quadripartite structure with conserved genome arrangement, structure, and moderate divergence. The lengths of the genomes are 157,815, 157,797, and 157,916 bp, respectively. The length of the large single-copy region (LSC) region is 86,113, 86,283, and 86,122 bp, and the length of the SSC region is 18,916, 18,734, and 19,028 bp; the IR region is 26,393, 26,390, and 26,383 bp, respectively. Each of the three chloroplast genomes encodes 133 genes, including 94 protein-coding, 31 tRNA, and eight rRNA genes. Differential gene analysis for the three species revealed that trnY-ATA is a unique gene in P. armeniaca; in contrast, the gene trnI-TAT is only present in P. mume and P. salicina, though the position of the gene in these chloroplast genomes differs. Further comparative analysis of the complete chloroplast genome sequences revealed that the ORF genes and the sequences of linked regions rps16 and atpA, atpH and atpI, trnc-GCA and psbD, ycf3 and atpB, and rpL32 and ndhD are significantly different and may be used as molecular markers in taxonomic studies. Phylogenetic evolution analysis of the three species suggests that P. mume has a closer genetic relationship to P. armeniaca than to P. salicina.Entities:
Keywords: Evolution; Genome
Year: 2019 PMID: 31666958 PMCID: PMC6804877 DOI: 10.1038/s41438-019-0171-1
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Summary statistics for the assembly of five Prunus species chloroplast genomes
| Genome features |
|
|
|
|
|
|---|---|---|---|---|---|
| Genome size (bp) | 157,797 | 157,815 | 157,916 | 157,834 | 157,790 |
| LSC size (bp) | 86,283 | 86,113 | 86,122 | 85,964 | 85,968 |
| SSC size (bp) | 18,734 | 18,916 | 19,028 | 19,084 | 19,060 |
| IR size (bp) | 26,390 | 26,393 | 26,383 | 26,393 | 26,381 |
| Number of genes | 133 (110) | 133 (110) | 133 (110) | 131 (111) | 130 (110) |
| Protein genes [unique] | 94 (80) | 94 (80) | 94 (80) | 86 (78) | 85 (77) |
| tRNA genes [unique] | 31 (26) | 31 (26) | 31 (26) | 37 (29) | 37 (29) |
| rRNA genes [unique] | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) |
| Duplicated genes in IR | 18 | 18 | 18 | 16 | 14 |
| GC content (%) | 36.75 | 36.74 | 36.74 | 37 | 37 |
| GC content in LSC (%) | 34.54 | 34.58 | 34.58 | 35 | 35 |
| GC content in SSC (%) | 30.43 | 30.35 | 30.40 | 30 | 30 |
| GC content in IR (%) | 42.57 | 42.56 | 42.59 | 43 | 43 |
| Total reads | 23,517,590 | 44,218,598 | 23,901,827 | – | – |
| Aligned paired-end reads | 662,524 | 1,107,094 | 824,216 | – | – |
| Assembled reads | 362,494 | 468,122 | 419,364 | – | – |
| Average organelle coverage | 634 | 1059 | 788 | – | – |
| Average insert size (bp) | 296 | 297 | 298 | – | – |
Fig. 1Chloroplast genome maps of three Prunus species.
a P. salicina chloroplast genome. b P. mume chloroplast genome. c P. armeniaca chloroplast genome. Genes shown outside the circle are transcribed clockwise and those inside counterclockwise. Genes belonging to different functional groups are color-coded
Fig. 2Comparison of the borders of LSC, SSC, and IR regions of chloroplast genomes in five Prunus species
List of annotated genes in P. mume, P. armeniaca, and P. salicina chloroplast genomes
| Category | Group of gene | Name of gene |
|---|---|---|
| Photosynthetic | Subunits of photosystem I |
|
| Submits of photosystem II |
| |
| Subunits of NADH dehydrogenase |
| |
| Subunits of cytochrome b/f complex |
| |
| Subunits of ATP synthase |
| |
| Large subunit of rubisco |
| |
| Self-replication | Proteins of large ribosomal subunit |
|
| Proteins of small ribosomal subunit |
| |
| Subunits of RNA polymerase |
| |
| Ribosomal RNAs |
| |
| Transfer RNAs |
| |
| Biosynthesis | Maturase |
|
| Protease |
| |
| Envelope membrane protein |
| |
| Acetyl-CoA carboxylase |
| |
| c-type cytochrome synthesis gene |
| |
| Translation initiation factor |
| |
| Unknown function | Conserved hypothetical chloroplast Reading Frames |
|
Asterisk denotes the trnY-ATA gene is trnY-ATA in P. armeniaca but is trnI-TAT in P. mume and P. salicina
Information on 12 intron-containing genes in the chloroplast genome of Prunus species
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
|
| LSC | 38 | 83 | 42 | ||
|
| LSC | 1787 | 31 | 323 | ||
|
| LSC | 126 | 713 | 229 | 764 | 149 |
|
| LSC | 147 | 683 | 466 | ||
|
| IR | 384 | 648 | 469 | ||
|
| IR | 200 | 295 | 110 | ||
|
| LSC | 73 | 805 | 296 | 648 | 222 |
|
| LSC | 380 | 64 | 123 | ||
|
| SSC | 555 | 1147 | 535 | ||
|
| IR | 869 | 588 | 752 | ||
|
| LSC | 455 | 755 | 1613 | ||
|
| SSC | 48 | 77 | 34 |
Summary of repeat sequences and SSRs in five Prunus accessions
| Species |
|
|
|
|
|
|---|---|---|---|---|---|
| Total number | 39 | 34 | 38 | 48 | 49 |
| Forward | 14 | 10 | 12 | 18 | 19 |
| Palindromic | 13 | 13 | 15 | 20 | 20 |
| Reverse | 12 | 11 | 11 | 10 | 10 |
| SSR loci (N) | 59 | 54 | 49 | 57 | 57 |
| P1a locia (N) | 52 | 49 | 44 | 51 | 48 |
| P2b loci (N) | 4 | 4 | 1 | 3 | 2 |
| Pcc loci (N) | 3 | 1 | 4 | 3 | 7 |
| LSC | 51 | 47 | 41 | 49 | 48 |
| SSC | 6 | 5 | 6 | 6 | 7 |
| IRa | 1 | 1 | 1 | 1 | 1 |
| IRb | 1 | 1 | 1 | 1 | 1 |
asingle-nucleotide SSRs
bdouble-nucleotide SSRs
ccomplex-nucleotide SSRs
Fig. 3Length distribution of repeat sequences in Prunus species
Fig. 4Analysis of simple repetitive sequences in five Prunus chloroplast genomes
Fig. 5The complete sequence alignment map of 26 chloroplast genomes of Rosaceae.
The vertical axis indicates sequence alignment similarity of 50–100%. The green color indicates the chloroplast genome LSC region, the yellow color the chloroplast genome IR region, and the blue color the SSC region. Notes: First grade: rps16-atpA, atpH-atpI, trnc-GCA-psbD, ycf3-atpB, and rpL32-ndhD. Second grade: trnH-GTG-matK, PsbZ-PsbB, rbcL-accD, psaI-cemA, psbJ-psbB, psbT-rps3, ndhG-ndhH, and rps15-ycf1. Third grade: trnL-CAT-ycf15, trnN-GTT, trnR-GTT-ndhF, and ndhB
Fig. 6Phylogenetic trees of the Prunus species based on the chloroplast genome by MP.
a Phylogenetic tree constructed using the complete chloroplast genome data. b Phylogenetic tree constructed using coding region data. c Phylogenetic tree constructed using LSC data. d Phylogenetic tree constructed using intron data. e Phylogenetic tree constructed using IR data