| Literature DB >> 25705213 |
Xiaochen Chen1, Qiushi Li1, Ying Li1, Jun Qian1, Jianping Han1.
Abstract
The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.Entities:
Keywords: Aconitum; PacBio RS; chloroplast genome; circular consensus sequencing; the third generation sequencing
Year: 2015 PMID: 25705213 PMCID: PMC4319492 DOI: 10.3389/fpls.2015.00042
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of the sequencing data for PacBio SMRT.
| Raw data | Clean data | ||
|---|---|---|---|
| Number of reads | 300,582 | 23,498 | |
| Number of nucleotides (bp) | 673,594,247 | 20,685,462 | |
| Longest read length (bp) | 11,914 | 2,261 | |
| Mean read length (bp) | 2,241 | 880 | |
| Genome coverage % | 100% | ||
| Average depth of coverage | 132× | ||
| Number of contigs | 1 | ||
| Whole cp genome length (bp) | 156,749 | ||
| Run time | 45 min×2 | ||
| Total DNA requirements (ng) | 700 |
General information of the Aconitum barbatum var. puberulum chloroplast genome.
| Length and GC content of the four regions | ||||
|---|---|---|---|---|
| Length (bp) | 156,749 | 87,630 | 16,941 | 26,089 |
| GC content (%) | 38.7 | 36.1 | 32.7 | 43.0 |
| Number of genes | 130 | 84 | 31 | 4 |
| Number of intron-containing | 18 | 12 | 6 | 0 |
The genes with introns in the A. barbatum var. puberulum chloroplast genome and the length of the exons and introns.
| Gene | Location | Exon (bp) | Intron (bp) | Exon (bp) | Intron (bp) | Exon (bp) |
|---|---|---|---|---|---|---|
| LSC | 35 | 2528 | 37 | |||
| LSC | 251 | 887 | 40 | |||
| LSC | 23 | 738 | 48 | |||
| LSC | 407 | 779 | 124 | |||
| LSC | 1612 | 758 | 428 | |||
| LSC | 155 | 746 | 240 | 735 | 124 | |
| LSC | 35 | 495 | 50 | |||
| LSC | 37 | 597 | 39 | |||
| LSC | 246 | 663 | 289 | 833 | 71 | |
| LSC | 6 | 948 | 489 | |||
| LSC | 8 | 704 | 496 | |||
| LSC | 399 | 1069 | 9 | |||
| IR | 434 | 667 | 391 | |||
| IR | 756 | 702 | 777 | |||
| IR | 42 | 937 | 35 | |||
| IR | 38 | 802 | 35 | |||
| SSC | 539 | 1003 | 553 | |||
| LSC | 114 | 232 | 544 | 26 |
The codon–anticodon recognition pattern and codon usage for the A. barbatum var. puberulum chloroplast genome.
| Animo acid | Codon | No. | RSCU | tRNA | Animo acid | Codon | No. | RSCU | tRNA |
|---|---|---|---|---|---|---|---|---|---|
| Phe | UUU | 834 | 1.23 | Tyr | UAU | 714 | 1.59 | ||
| Phe | UUC | 519 | 0.77 | Tyr | UAC | 185 | 0.41 | ||
| Leu | UUA | 765 | 1.77 | Stop | UAA | 36 | 1.29 | ||
| Leu | UUG | 554 | 1.29 | Stop | UAG | 26 | 0.93 | ||
| Leu | CUU | 542 | 1.26 | His | CAU | 488 | 1.52 | ||
| Leu | CUC | 192 | 0.45 | His | CAC | 153 | 0.48 | ||
| Leu | CUA | 356 | 0.83 | Gln | CAA | 640 | 1.5 | ||
| Leu | CUG | 177 | 0.41 | Gln | CAG | 215 | 0.5 | ||
| Ile | AUU | 1017 | 1.46 | Asn | AAU | 901 | 1.54 | ||
| Ile | AUC | 430 | 0.62 | Asn | AAC | 267 | 0.46 | ||
| Ile | AUA | 646 | 0.93 | Lys | AAA | 889 | 1.44 | ||
| Met | AUG | 602 | 1 | Lys | AAG | 345 | 0.56 | ||
| Val | GUU | 507 | 1.45 | Asp | GAU | 809 | 1.6 | ||
| Val | GUC | 157 | 0.45 | Asp | GAC | 200 | 0.4 | ||
| Val | GUA | 527 | 1.51 | Glu | GAA | 924 | 1.45 | ||
| Val | GUG | 208 | 0.59 | Glu | GAG | 348 | 0.55 | ||
| Ser | UCU | 538 | 1.67 | Cys | UGU | 220 | 1.5 | ||
| Ser | UCC | 335 | 1.04 | Cys | UGC | 73 | 0.5 | ||
| Ser | UCA | 390 | 1.21 | Stop | UGA | 22 | 0.79 | ||
| Ser | UCG | 191 | 0.59 | Trp | UGG | 432 | 1 | ||
| Pro | CCU | 406 | 1.52 | Arg | CGU | 348 | 1.38 | ||
| Pro | CCC | 214 | 0.8 | Arg | CGC | 86 | 0.34 | ||
| Pro | CCA | 311 | 1.16 | Arg | CGA | 346 | 1.38 | ||
| Pro | CCG | 138 | 0.52 | Arg | CGG | 114 | 0.45 | ||
| Thr | ACU | 502 | 1.57 | Arg | AGA | 451 | 1.79 | ||
| Thr | ACC | 240 | 0.75 | Arg | AGG | 163 | 0.65 | ||
| Thr | ACA | 393 | 1.23 | Ser | AGU | 372 | 1.15 | ||
| Thr | ACG | 140 | 0.44 | Ser | AGC | 108 | 0.34 | ||
| Ala | GCU | 589 | 1.73 | Gly | GGU | 592 | 1.36 | ||
| Ala | GCC | 220 | 0.65 | Gly | GGC | 177 | 0.41 | ||
| Ala | GCA | 382 | 1.12 | Gly | GGA | 694 | 1.59 | ||
| Ala | GCG | 171 | 0.5 | Gly | GGG | 282 | 0.65 |
Repeated sequences in the A. barbatum var. puberulum chloroplast genome.
| Repeat number | Size (bp) | Type | Location | Repeat unit | Region |
|---|---|---|---|---|---|
| 1 | 52 | F | AGAAAAAGAATTGCAATAGCTAAATGG(A)TGA(G)TGA(C)GCAATATCGGTCAGCCATA | LSC | |
| 2 | 39 | F | CAGAACCGTACATGAGATTTTCACCTCATACGGCTCCTC | LSC, Ira | |
| 3 | 33 | F | IGS( | TAAAC(A)GGAA(G)AGAGAGGGATTCGAACCCTCGG(A)TA | LSC |
| 4 | 32 | F | IGS( | TTT(G)TTCT(A)T(A)CTTGATTTAGATTCTCTAATTCAA | SSC |
| 5 | 39 | P | CAGAACCGTACATGAGATTTTCACCTCATACGGCTCCTC | LSC, Irb | |
| 6 | 33 | P | IGS( | GTAAGAATAAGAACTCAATGGACCTTGCCCCTC | LSC |
| 7 | 30 | P | ACGGAAAGAGAGGGATTCGAACCCTCGGTA | LSC | |
| 8 | 31 | P | IGS( | TCT(G)ATT(A)TC(A)TTATTTCTATATATTCTAATGAT | LSC |
| 9 | 30 | P | TAC(A)CGAGGGTTCGAATCCCTCTCTT(G)TCCG(A)T | LSC | |
| 10 | 30 | T | TAATAGTATATATAG(×2) | LSC | |
| 11 | 34 | T | IGS( | TTTTATTCTATTTATTA(×2) | LSC |
| 12 | 40 | T | IGS | GTTATTGTAGGAGTGAAATC(×2) | LSC |
| 13 | 30 | T | IGS | ATAGTCATTATAATG(×2) | LSC |
| 14 | 36 | T | IGS | TTTCTATTGTTGTATCCA(×2) | LSC |
| 15 | 40 | T | IGS | TTTATTATATAAAATATTAA(×2) | LSC |
| 16 | 42 | T | ycf2(CDS) | AGATAATGAACTATTCAAAGA(2) | IRa,b |
| 17 | 30 | T | IGS | TATTACCTATTATAT(×2) | SSC |