| Literature DB >> 29844954 |
Piotr Androsiuk1, Jan Paweł Jastrzębski1, Łukasz Paukszto1, Adam Okorski2, Agnieszka Pszczółkowska2, Katarzyna Joanna Chwedorzewska3, Justyna Koc1, Ryszard Górecki1, Irena Giełwanowska1.
Abstract
Colobanthus apetalus is a member of the genus Colobanthus, one of the 86 genera of the large family Caryophyllaceae which groups annual and perennial herbs (rarely shrubs) that are widely distributed around the globe, mainly in the Holarctic. The genus Colobanthus consists of 25 species, including Colobanthus quitensis, an extremophile plant native to the maritime Antarctic. Complete chloroplast (cp) genomes are useful for phylogenetic studies and species identification. In this study, next-generation sequencing (NGS) was used to identify the cp genome of C. apetalus. The complete cp genome of C. apetalus has the length of 151,228 bp, 36.65% GC content, and a quadripartite structure with a large single copy (LSC) of 83,380 bp and a small single copy (SSC) of 17,206 bp separated by inverted repeats (IRs) of 25,321 bp. The cp genome contains 131 genes, including 112 unique genes and 19 genes which are duplicated in the IRs. The group of 112 unique genes features 73 protein-coding genes, 30 tRNA genes, four rRNA genes and five conserved chloroplast open reading frames (ORFs). A total of 12 forward repeats, 10 palindromic repeats, five reverse repeats and three complementary repeats were detected. In addition, a simple sequence repeat (SSR) analysis revealed 41 (mono-, di-, tri-, tetra-, penta- and hexanucleotide) SSRs, most of which were AT-rich. A detailed comparison of C. apetalus and C. quitensis cp genomes revealed identical gene content and order. A phylogenetic tree was built based on the sequences of 76 protein-coding genes that are shared by the eleven sequenced representatives of Caryophyllaceae and C. apetalus, and it revealed that C. apetalus and C. quitensis form a clade that is closely related to Silene species and Agrostemma githago. Moreover, the genus Silene appeared as a polymorphic taxon. The results of this study expand our knowledge about the evolution and molecular biology of Caryophyllaceae.Entities:
Keywords: Caryophyllaceae; Colobanthus apetalus; Genome features; Next-generation sequencing; Phylogenetics; plastid genome
Year: 2018 PMID: 29844954 PMCID: PMC5970550 DOI: 10.7717/peerj.4723
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
GenBank accession numbers and references for cp genomes used in this study.
Species list arranged alphabetically.
| Species | Accession number | Length | Reference |
|---|---|---|---|
|
| 151,733 bp | ||
|
| 154,478 bp | ||
|
| 151,228 bp | in this study | |
|
| 151,276 bp | ||
|
| 152,320 bp | ||
|
| 150,224 bp | ||
|
| 148,081 bp | ||
|
| 147,208 bp | ||
|
| 147,896 bp | ||
|
| 151,736 bp | ||
|
| 151,639 bp | ||
|
| 151,632 bp | ||
|
| 151,583 bp |
Figure 1Gene map of the Colobanthus apetalus chloroplast genome.
Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise (indicated by arrows). Differential functional gene groups are color-coded. GC content variations is shown in the middle circle.
List of genes present in chloroplast genome of Colobanthus apetalus.
Genes list arranged alphabetically.
| Category | Group of gene | Name of genes |
|---|---|---|
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| Cytochrome complex | ||
| ATP synthase | ||
| NADH dehydrogenase | ||
| Large subunit of RUBISCO | ||
| DNA replication and protein synthesis | Ribosomal RNA | |
| Small subunit ribosomal proteins | ||
| Large subunit ribosomal proteins | ||
| RNA polymerase subunits | ||
| Transfer RNA | ||
| Other genes | Conserved chloroplast ORF | |
| Other proteins |
Notes.
genes associated with Photosystem I
Figure 2The distribution, type, and presence of SSRs in cp genome of Colobanthus apetalus and Colobanthus quitensis.
(A) Presence of SSRs in the LSC, SSC, and IR regions in C. apetalus cp genome; (B) presence of SSRs in the LSC, SSC, and IR regions in C. quitensis cp genome; (C) presence of SSRs in the protein-coding regions, intergenic spacers, and introns in cp genome of C. apetalus (C. a) and C. quitensis (C. q).
Figure 3Nucleotide and amino acid substitutions in cp genome of C. apetalus when compared with C. quitensis plastid genome.
N—number of changed amino acids, N—number of changed nucleotides in particular group of genes: (A) genes for photosynthesis; (B) self-replication genes; (C) other genes.
Summary of chloroplast genome characteristic of C. apetalus and C. quitensis.
| Genome features | ||
|---|---|---|
| Size (bp) | 151,228 | 151,276 |
| LSC length (bp) | 83,380 | 83,462 |
| SSC length (bp) | 17,206 | 17,208 |
| IR length (bp) | 25,321 | 25,303 |
| Number of genes | 112 | 112 |
| Protein-coding genes | 78 | 78 |
| tRNA genes | 30 | 30 |
| rRNA genes | 4 | 4 |
| Number of genes duplicated in IR | 19 | 19 |
| Overall GC content (%) | 36.65 | 36.7 |
Figure 4Border position of LSC, SSC, and IR regions for C. apetalus and C. quitensis.
Genes are indicated by boxes and the gaps between the genes and the boundaries are indicated by number of bases unless the gene coincides with the boundary. Extension of the genes are also indicated above the boxes.
Figure 5Phylogeny of Colobanthus apetalus and other 11 representatives of Caryophyllaceae based on sequences of sheared 76 protein-coding genes using maximum likelihood method.