| Literature DB >> 31683955 |
Myong-Suk Cho1, Jin Hyeong Kim2, Chang-Seok Kim3, José A Mejías4, Seung-Chul Kim5.
Abstract
Prickly sow thistle, Sonchus asper (L.) Hill, and common sow thistle, Sonchus oleraceus L., are noxious weeds. Probably originating from the Mediterranean region, they have become widespread species. They share similar morphology and are closely related. However, they differ in their chromosome numbers and the precise relationship between them remains uncertain. Understanding their chloroplast genome structure and evolution is an important initial step toward determining their phylogenetic relationships and analyzing accelerating plant invasion processes on a global scale. We assembled four accessions of chloroplast genomes (two S. asper and two S. oleraceus) by the next generation sequencing approach and conducted comparative genomic analyses. All the chloroplast genomes were highly conserved. Their sizes ranged from 151,808 to 151,849 bp, containing 130 genes including 87 coding genes, 6 rRNA genes, and 37 tRNA genes. Phylogenetic analysis based on the whole chloroplast genome sequences showed that S. asper shares a recent common ancestor with S. oleraceus and suggested its likely involvement in a possible amphidiploid origin of S. oleraceus. In total, 79 simple sequence repeats and highly variable regions were identified as the potential chloroplast markers to determine genetic variation and colonization patterns of Sonchus species.Entities:
Keywords: Sonchus asper; Sonchus oleraceus; chloroplast genome evolution; comparative genomic analyses; hybridization; invasive species
Mesh:
Year: 2019 PMID: 31683955 PMCID: PMC6895928 DOI: 10.3390/genes10110881
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Pictures of Sonchus asper (prickly sow thistle) and Sonchus oleraceus (common sow thistle). Sonchus asper (A). plant; (C). achene; (E,G). rounded leaf base auricle and prickly leaf margin. Sonchus oleraceus (B). plant; (D). achene; (F,H). pointed leaf base auricle and lobed leaf margin. Photo credits: (C,D) (Jose Mejías); (A,B,E–H) (Jin Hyeong Kim and Myong-Suk Cho).
Figure 2Merged gene map of four weedy Sonchus chloroplast genomes of two accessions each of S. asper and S. oleraceus that were sequenced in this study. The genes inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. The thick lines indicate the extent of the inverted repeats (IRA and IRB) that separate the genomes into small single copy (SSC) and large single copy (LSC) regions. Large inversion and smaller inversion nested within the large inversion that is unique in Asteraceae are indicated with black lines outside the gene map.
Summary of the complete chloroplast genome characteristics of five accessions of two weedy species of Sonchus: S. asper and S. oleraceus.
| Characteristics |
|
| |||
|---|---|---|---|---|---|
| GenBank Accession Number/Locality ID | MK371015 | MK371016 | MG878405 | MK371006 | MH908962 |
| Total length (bp) | 151,849 | 151,849 | 151,808 | 151,808 | 151,849 |
| LSC size (bp) | 84,156 | 84,156 | 84,157 | 84,113 | 84,156 |
| SSC size (bp) | 24,738 | 24,738 | 24,707 | 24,739 | 24,738 |
| IR size (bp) | 18,217 | 18,217 | 18,237 | 18,217 | 18,217 |
| Number of genes | 130 (17 duplicated in IR) | 130 (17 duplicated in IR) | 130 (17 duplicated in IR) | 130 (17 duplicated in IR) | 130 (17 duplicated in IR) |
| Number of protein coding genes | 80 (+7 in IR) | 80 (+7 in IR) | 80 (+7 in IR) | 80 (+7 in IR) | 80 (+7 in IR) |
| Number of tRNA genes | 30 (+7 in IR) | 30 (+7 in IR) | 30 (+7 in IR) | 30 (+7 in IR) | 30 (+7 in IR) |
| Number of rRNA genes | 3 (+3 in IR) | 3 (+3 in IR) | 3 (+3 in IR) | 3 (+3 in IR) | 3 (+3 in IR) |
| GC content of whole genome (frequency/%) | 57,133/37.6% | 57,134/37.6% | 57,130/37.6% | 57,130/37.6% | 57,134/37.6% |
cpDNA: chloroplast DNA; LSC: large single copy region; SSC: small single copy region; IR: inverted repeat; GC: guanine-cytosine.
Figure 3Comparison of the border positions of the large single copy (LSC), small single copy (SSC), and inverted repeat (IR) regions among five weedy Sonchus chloroplast genomes representing S. asper and S. oleraceus. Gene names are indicated in the boxes and their lengths in the corresponding regions are displayed above the boxes. Ψ indicates a pseudogene.
Genes present in the complete chloroplast genomes of five accessions of two weedy species of Sonchus: S. asper and S. oleraceus.
| Category | Gene Name |
|---|---|
| Photosystem I | |
| Photosystem II | |
| Cytochrome b6/f complex | |
| Cytochrome C synthesis | |
| ATP synthase | |
| RuBisCO | |
| NADH oxidoreductase | |
| Large subunit ribosomal proteins | |
| Small subunit ribosomal proteins | |
| RNA polymerase | |
| Translation initiation factor | |
| Others | |
| Unknown function genes (conserved reading frames) | |
| Ribosomal RNAs | |
| Transfer RNAs |
a Gene containing a single intron. b Gene containing two introns. c Two gene copies in IRs. d Trans-splicing gene
Figure 4Simple sequence repeat (SSR) number per distribution and repeat type of five accessions of weedy S. asper and S. oleraceus chloroplast genomes. (A) Variation in the numbers of SSRs detected in five chloroplast genomes of weedy S. asper and S. oleraceus. (B) Number of SSR motifs in different repeat motifs of each weedy Sonchus chloroplast genome.
Figure 5Repeat numbers per repeat type and repeat length of five chloroplast genomes of two Sonchus species. (A) Variation in the distribution of forward, reverse, complement, and palindromic repeats in each chloroplast genomes of five weedy Sonchus genomes. (B) Number of different repeat lengths of each weedy Sonchus chloroplast genomes.
Figure 6DNA sequence polymorphisms of three Sonchus chloroplast genomes (S. webbii, S. asper, and S. oleraceus) calculated using a sliding window analysis of 1000 bases and 200 base step sizes using DNAsp. Ten most divergent regions are suggested as mutation hotspots and potential chloroplast markers for Sonchus species.
Figure 7Comparison of five chloroplast genomes of two weedy Sonchus species, S. asper and S. oleraceus plotted against S. webbii by mVISTA. Sequence identity is portrayed with a cut-off of 50% identity. The Y-scale axis represents the percent identity within 50%–100%. Grey arrows indicate genes with their orientation and position. Genome regions are color-coded as green blocks for the conserved coding genes (exon), aqua blue blocks for the conserved non-coding sequences in intergenic regions (CNS), and orange blocks for introns. Thick lines below the alignment indicate the quadripartite regions of genomes; the LSC region is in dark blue, IR regions are in pink, and the SSC region is in peach. Black bordered white peaks that are shown in genome regions indicate the divergent regions with sequence variation among Sonchus species.
Figure 8Phylogenetic position and relationships of weedy Sonchus sow thistles among Asteraceae species based on whole chloroplast genome sequences inferred from maximum likelihood analysis by IQ-TREE. Numbers above nodes are bootstrap values with 1000 replicates. Newly sequenced four chloroplast genomes representing S. asper and S. oleraceus in this study are red colored on the phylogenetic tree.