| Literature DB >> 28698879 |
Yang He1, Hongtao Xiao2, Cao Deng3, Gang Fan1, Shishang Qin3, Cheng Peng1.
Abstract
The Coptis chinensis Franch. is an important medicinal plant from the Ranunculales. We used next generation sequencing technology to determine the complete chloroplast genome of C. chinensis. This genome is 155,484 bp long with 38.17% GC content. Two 26,758 bp long inverted repeats separated the genome into a typical quadripartite structure. The C. chinensis chloroplast genome consists of 128 gene loci, including eight rRNA gene loci, 28 tRNA gene loci, and 92 protein-coding gene loci. Most of the SSRs in C. chinensis are poly-A/T. The numbers of mononucleotide SSRs in C. chinensis and other Ranunculaceae species are fewer than those in Berberidaceae species, while the number of dinucleotide SSRs is greater than that in the Berberidaceae. C. chinensis diverged from other Ranunculaceae species an estimated 81 million years ago (Mya). The divergence between Ranunculaceae and Berberidaceae was ~111 Mya, while the Ranunculales and Magnoliaceae shared a common ancestor during the Jurassic, ~153 Mya. Position 104 of the C. chinensis ndhG protein was identified as a positively selected site, indicating possible selection for the photosystem-chlororespiration system in C. chinensis. In summary, the complete sequencing and annotation of the C. chinensis chloroplast genome will facilitate future studies on this important medicinal species.Entities:
Mesh:
Year: 2017 PMID: 28698879 PMCID: PMC5494076 DOI: 10.1155/2017/8201836
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Genome sequencing and assembly of the C. chinensis chloroplast genome.
| Sequencing | Raw data | 2.13 G |
| Raw reads (pair) | 10,640,000 | |
| Read length (bp) | 2 | |
| Clean data | 2.12 G | |
| Clean reads (pair) | 10,624,225 | |
|
| ||
| Assembly | Total size | 168,210 bp |
| Contig num | 43 | |
| Average length | 3,911 bp | |
| GC contents | 38.77% | |
| N50 length | 47,033 bp | |
| Min contig length | 511 bp | |
| Max contig length | 64,014 bp | |
|
| ||
| Gap closing | Total size | 155,484 bp |
| Scaffold number | 1 | |
| GC contents | 38.17% | |
Figure 1Genome schema of the C. chinensis chloroplast genome. Genes on the outer side the circle transcribe clockwise, while genes on the inner side transcribe counterclockwise. Genes from different functional groups are colored with different color.
List of genes in the C. chinensis chloroplast genome. Numbers in the parentheses indicate the copy number in the genome.
| Groups | Name of genes |
|---|---|
| Biosynthesis of cofactors |
|
| Cellular processes |
|
| Conserved hypothetical plastid |
|
| Energy metabolism |
|
| Fatty acid metabolism |
|
| Hypothetical or uncharacterized |
|
| Photosynthesis |
|
| Ribosomal RNA |
|
| Transcription |
|
| Translation |
|
| Transporters |
|
| tRNA |
|
Statistics of chloroplast SSRs detected in 24 species. p1, mononucleotide SSRs; p2, dinucleotide SSRs; p3, trinucleotide SSRs; c, compound SSRs; A, adenine; G, guanine; T, thymine; C, cytosine.
| Species | Total | p1 | p2 | p3 | c | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| All | (A)n | (C)n | (G)n | (T)n | All | (AT)n | (TA)n | ||||
|
| 68 | 61 | 26 | 1 | 1 | 33 | 2 | 1 | 1 | 0 | 5 |
|
| 73 | 67 | 26 | 1 | 1 | 39 | 2 | 1 | 1 | 0 | 4 |
|
| 63 | 55 | 22 | 1 | 1 | 31 | 2 | 1 | 1 | 0 | 6 |
|
| 65 | 59 | 25 | 1 | 1 | 32 | 2 | 1 | 1 | 0 | 4 |
|
| 69 | 62 | 28 | 1 | 1 | 32 | 2 | 1 | 1 | 0 | 5 |
|
| 68 | 60 | 26 | 1 | 1 | 32 | 2 | 2 | 0 | 0 | 6 |
|
| 33 | 27 | 10 | 0 | 1 | 16 | 2 | 2 | 0 | 0 | 4 |
|
| 52 | 48 | 19 | 0 | 0 | 29 | 1 | 0 | 1 | 0 | 3 |
|
| 71 | 59 | 26 | 1 | 0 | 32 | 1 | 0 | 1 | 1 | 10 |
|
| 72 | 62 | 29 | 0 | 0 | 33 | 2 | 1 | 0 | 2 | 6 |
|
| 73 | 66 | 30 | 0 | 0 | 36 | 2 | 1 | 0 | 2 | 3 |
|
| 75 | 71 | 39 | 0 | 0 | 32 | 1 | 1 | 0 | 2 | 1 |
|
| 32 | 28 | 13 | 0 | 0 | 15 | 3 | 3 | 0 | 1 | 0 |
|
| 50 | 39 | 18 | 0 | 1 | 20 | 4 | 0 | 4 | 1 | 6 |
|
| 34 | 21 | 9 | 2 | 0 | 10 | 10 | 4 | 6 | 0 | 3 |
|
| 50 | 44 | 19 | 2 | 0 | 23 | 4 | 0 | 4 | 1 | 1 |
|
| 38 | 36 | 10 | 0 | 0 | 26 | 2 | 2 | 0 | 0 | 0 |
|
| 47 | 38 | 16 | 0 | 0 | 22 | 5 | 3 | 2 | 0 | 4 |
|
| 53 | 42 | 17 | 2 | 0 | 23 | 8 | 3 | 5 | 0 | 3 |
|
| 40 | 38 | 20 | 2 | 1 | 15 | 0 | 0 | 0 | 0 | 1 |
|
| 16 | 16 | 8 | 0 | 0 | 8 | 0 | 0 | 0 | 0 | 0 |
|
| 65 | 57 | 27 | 0 | 0 | 30 | 0 | 0 | 0 | 0 | 8 |
|
| 49 | 41 | 13 | 2 | 0 | 26 | 3 | 2 | 1 | 1 | 4 |
|
| 56 | 47 | 18 | 2 | 0 | 27 | 3 | 2 | 1 | 1 | 5 |
Figure 2Phylogenetic tree and estimated divergence time based on chloroplast genomes from Ranunculales and Magnoliaceae.
Figure 3Multisequence alignments and positive selection of the ndhG gene. Blue branch is the foreground branch used in the branch-site test implemented, while yellow background color indicates the positively selected site (position 104 based on the C. chinensis ndhG protein sequence).