| Literature DB >> 34899783 |
Yifan Yu1,2, Zhen Ouyang1, Juan Guo2, Wen Zeng2, Yujun Zhao2, Luqi Huang1,2.
Abstract
Erigeron breviscapus is a famous medicinal plant. However, the limited chloroplast genome information of E. breviscapus, especially for the chloroplast DNA sequence resources, has hindered the study of E. breviscapus chloroplast genome transformation. Here, the complete chloroplast (cp) genome of E. breviscapus was reported. This genome was 152,164bp in length, included 37.2% GC content and was structurally arranged into two 24,699bp inverted repeats (IRs) and two single-copy areas. The sizes of the large single-copy region and the small single-copy region were 84,657 and 18,109bp, respectively. The E. breviscapus cp genome consisted of 127 coding genes, including 83 protein coding genes, 36 transfer RNA (tRNA) genes, and eight ribosomal RNA (rRNA) genes. For those genes, 95 genes were single copy genes and 16 genes were duplicated in two inverted regions with seven tRNAs, four rRNAs, and five protein coding genes. Then, genomic DNA of E. breviscapus was used as a template, and the endogenous 5' and 3' flanking sequences of the trnI gene and trnA gene were selected as homologous recombinant fragments in vector construction and cloned through PCR. The endogenous 5' flanking sequences of the psbA gene and rrn16S gene, the endogenous 3' flanking sequences of the psbA gene, rbcL gene, and rps16 gene and one sequence element from the psbN-psbH chloroplast operon were cloned, and certain chloroplast regulatory elements were identified. Two homologous recombination fragments and all of these elements were constructed into the cloning vector pBluescript SK (+) to yield a series of chloroplast expression vectors, which harbored the reporter gene EGFP and the selectable marker aadA gene. After identification, the chloroplast expression vectors were transformed into Escherichia coli and the function of predicted regulatory elements was confirmed by a spectinomycin resistance test and fluorescence intensity measurement. The results indicated that aadA gene and EGFP gene were efficiently expressed under the regulation of predicted regulatory elements and the chloroplast expression vector had been successfully constructed, thereby providing a solid foundation for establishing subsequent E. breviscapus chloroplast transformation system and genetic improvement of E. breviscapus.Entities:
Keywords: Erigeron breviscapus; chloroplast genome sequencing; chloroplast regulatory elements; gene assembly; prokaryotic expression
Year: 2021 PMID: 34899783 PMCID: PMC8657942 DOI: 10.3389/fpls.2021.758290
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Sequencing data volume information statistics.
| Sample name | Length (bp) | Raw reads | Raw bases (bp) | Clean reads | Clean bases (bp) | GC (%) | Q20 (%) | Q30 (%) |
|---|---|---|---|---|---|---|---|---|
|
| 150 | 27,679,808 | 4,151,971,200 | 27,467,550 | 4,120,132,500 | 37.73 | 98.55 | 95.37 |
Figure 1Base quality value distribution graph of E. breviscapus sample. The graph reflects the average sequencing quality (Y-axis) of read 1 and read 2 along the Read Cycle (X-axis).
Figure 2Circular map of the E. breviscapus cp genome. There are four rings on the cp genome map. From the center to the outside, the first circle shows the forward and reverse repetitions connected by the red arc and the green arc respectively; The second circle shows the tandem repeat marked with a short bar; The third circle is the microsatellite sequence identified by MISA; the fourth circle was drawn with genemap to show the gene structure on the plastids, Genes shown outside and inside of the fourth circle are transcribed counterclockwise and clockwise, respectively; the colors of genes are distinguished according to their functional classification.
Chloroplast structure statistics of Erigeron breviscapus samples.
| Region | Start | End | Length | A (%) | T (%) | C (%) | G (%) | AT (%) | GC (%) |
|---|---|---|---|---|---|---|---|---|---|
| Total genome | 1 | 152,164 | 152,164 | 31.36 | 31.45 | 18.54 | 18.65 | 62.80 | 37.20 |
| LSC | 1 | 84,657 | 84,657 | 32.29 | 32.65 | 17.25 | 17.80 | 64.94 | 35.06 |
| IRA | 84,658 | 109,356 | 24,699 | 28.37 | 28.50 | 20.73 | 22.40 | 56.87 | 43.13 |
| SSC | 109,357 | 127,465 | 18,109 | 34.97 | 34.03 | 16.33 | 14.67 | 69.00 | 31.00 |
| IRB | 127,466 | 152,164 | 24,699 | 28.50 | 28.37 | 22.40 | 20.73 | 56.87 | 43.13 |
List of genes in the E. breviscapus cp genome.
| Gene group | Gene name |
|---|---|
| NADH dehydrogenase | |
| Other genes | |
| RNA polymerase | |
| Transfer RNAs | |
| Ribosomal proteins (SSU) | |
| Maturase |
|
| Cytochrome b/f complex | |
| Photosystem II | |
| ATP synthase | |
| Photosystem I | |
| RubisCO large subunit |
|
| Ribosomal proteins (LSU) | |
| Translational initiation factor |
|
| Ribosomal RNAs | |
| Protease |
|
| Hypothetical chloroplast reading frames (ycf) |
Figure 3The type, distribution, and presence of simple sequence repeats (SSRs) in E. breviscapus cp genome. (A) Number of different SSR types detected in E. breviscapus cp genome. (B) Frequency of SSRs in the large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions. (C) Frequency of SSRs in the protein-coding regions, intergenic spacers, and intron.
Regulatory elements of E. breviscapus cp genome.
| Feature | Type | Size (bp) | Transcriptional elements | |
|---|---|---|---|---|
| Eb-trnI | Site of integration | 1,322 | - | |
| Eb-trnA | Site of integration | 936 | - | |
| Eb-Prrn | Promoter+rbcL5'UTR | 135 | −10 consensus sequence | TATATT |
| −35 consensus sequence | TTGACG | |||
| GAA-box | GAA | |||
| SD-like sequence | GGAGG | |||
| Eb-PpsbA | Promoter+psbA5'UTR | 244 | −10 consensus sequence | TATACT |
| −35 consensus sequence | TTGACA | |||
| Ribosome binding site 1 | AAG | |||
| Ribosome binding site 2 | TGATAAT | |||
| AU-box | TAAATAAA | |||
| Eb-TpsbA | 3'UTR | 138 | Stem-loop | |
| Eb-TrbcL | 3'UTR | 309 | Stem-loop | |
| Eb-Trps16 | 3'UTR | 303 | Stem-loop | |
| Eb-IEE | IEE+rbcL5'UTR | 67 | Stem-loop | - |
| S-box | AAGTCAA | |||
| SD-like sequence | GGAGG | |||
Figure 4PCR amplification of chloroplast regulatory elements of E. breviscapus.
Figure 5The expression of aadA gene in Escherichia coli. (A): Escherichia coli containing empty vector, (B–E): Escherichia coli containing recombinant vector.
Figure 6The expression level of EGFP gene in E. coli. Data are presented as mean±SE (n=3). The asterisk represents a significant difference (*p<0.05).