| Literature DB >> 27375965 |
Tong-Jian Liu1, Cai-Yun Zhang1, Hai-Fei Yan2, Lu Zhang2, Xue-Jun Ge2, Gang Hao1.
Abstract
Species-rich genus Primula L. is a typical plant group with which to understand genetic variance between species in different levels of relationships. Chloroplast genome sequences are used to be the information resource for quantifying this difference and reconstructing evolutionary history. In this study, we reported the complete chloroplast genome sequence of Primula sinensis and compared it with other related species. This genome of chloroplast showed a typical circular quadripartite structure with 150,859 bp in sequence length consisting of 37.2% GC base. Two inverted repeated regions (25,535 bp) were separated by a large single-copy region (82,064 bp) and a small single-copy region (17,725 bp). The genome consists of 112 genes, including 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Among them, seven coding genes, seven tRNA genes and four rRNA genes have two copies due to their locations in the IR regions. The accD and infA genes lacking intact open reading frames (ORF) were identified as pseudogenes. SSR and sequence variation analyses were also performed on the plastome of Primula sinensis, comparing with another available plastome of P. poissonii. The four most variable regions, rpl36-rps8, rps16-trnQ, trnH-psbA and ndhC-trnV, were identified. Phylogenetic relationship estimates using three sub-datasets extracted from a matrix of 57 protein-coding gene sequences showed the identical result that was consistent with previous studies. A transcript found from P. sinensis transcriptome showed a high similarity to plastid accD functional region and was identified as a putative plastid transit peptide at the N-terminal region. The result strongly suggested that plastid accD has been functionally transferred to the nucleus in P. sinensis.Entities:
Keywords: Chloroplast genome; Gene transfer; Illumina sequencing; Primula sinensis
Year: 2016 PMID: 27375965 PMCID: PMC4928469 DOI: 10.7717/peerj.2101
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 3Bar plot that compares potential marker regions with PICs and genetic diversity (Pi).
PICs were determined as the sum of nucleotide substitutions and indels. Barcode rbcL + matK + ITS was used here as a reference as proposed previously for the barcoding analysis of genus Primula (Yan et al., 2015).
Comparison of the general features of four plastomes in Primulaceae.
| Total size | 150,859 | 151,664 | 155,386 | 156,506 |
| LSC size | 82,064 | 83,444 | 85,229 | 86,078 |
| SSC size | 17,725 | 17,822 | 17,951 | 18,328 |
| IR size | 25,535 | 25,199 | 26,103 | 26,050 |
| Total A content | 31.1 | 31.1 | 31.1 | 31.2 |
| Total T content | 31.7 | 31.8 | 31.8 | 31.8 |
| Total C content | 19 | 18.8 | 18.9 | 18.9 |
| Total G content | 18.2 | 18.2 | 18.2 | 18.2 |
| LSC GC content | 35.2 | 34.9 | 35 | 34.9 |
| SSC GC content | 30.5 | 30.1 | 30.5 | 30.2 |
| IR GC content | 42.8 | 42.9 | 42.9 | 43 |
| Total GC content | 37.2 | 37 | 37.1 | 37.1 |
| Total | 112 | 113 | 113 | 114 |
| Protein encoding | 78 | 79 | 79 | 80 |
| tRNA | 30 | 30 | 30 | 30 |
| rRNA | 4 | 4 | 4 | 4 |
Gene contents in Primula sinensis chloroplast genome (112 genes, two pseudogenes).
| Category | Class | Gene |
|---|---|---|
| DNA-dependent RNA polymerase | ||
| Maturase | ||
| Large ribosomal subunits | ||
| Small ribosomal subunits | ||
| Protease | ||
| Ribosomal RNAs | ||
| Transfer RNAs | ||
| Photosystem I | ||
| Photosystem II | ||
| NAD(P)H dehydrogenase complex | ||
| F-type ATP synthase | ||
| PS I assembly factor | ||
| Cytochrome b6/f complex | ||
| Inner membrane protein | ||
| Cytochrome c biogenesis protein | ||
| Large subunit of Rubisco | ||
| Function unknown | ||
| Subunit of acetyl-CoA-carboxylase | ||
| Translation initiation factor |
Notes:
Represent gene with one intron.
Represent gene with two introns; “x2” represent gene location within IR region; Bold represent genes with alternative splicing.
Figure 1Chloroplast genome map of Primula sinensis (GenBank accession number KU321892).
Genes on the outside of the outer circle indicate the clockwise direction of transcription; those on the inside indicate the counterclockwise direction. The bar graphs on the inner circle reveal GC content in dark grey with the 50% threshold line.
Figure 2Comparison of the SC-IR boundary of four plastomes from Primulaceae.
Number of chloroplast SSRs present in two Primula species.
| Taxon | Length (bp) | GC% | Number of SSRs | |||
|---|---|---|---|---|---|---|
| Homo (> 8 units) | Di (> 4 units) | Tri (> 4 units) | Total | |||
| 150,859 | 37.20% | 148 | 44 | 1 | 193 | |
| 151,664 | 37% | 129 | 46 | 2 | 177 | |
Figure 4Alignment of the putative nuclear accD from Primula sinensis and the plastid accD from Primula poissonii, Androsace bulleyana, Lysimachia coreana and Ardisia polystica.
(A) indicates the putative transit peptide at the N-terminal region of nuclear accD. (B, C, D) indicates three putative motifs, acetyl-CoA binding sites, carboxybiotin binding sites and carboxyltransferase catalytic sites, respectively.
Figure 5Phylogenetic analysis of accD sequences from six families in Ericales using the Maximum likelihood.
The underline indicates the putative n-accD gene of Primula sinensis.
Figure 6The ML tree produced by chloroplast genome sequence of Primula sinensis and other nine species.
Numbers associated with each node are bootstrap values with 500 replicates.