| Literature DB >> 32283660 |
Yanli Xiong1, Yi Xiong1, Jun He2, Qingqing Yu1, Junming Zhao1, Xiong Lei1, Zhixiao Dong1, Jian Yang1, Yan Peng1, Xinquan Zhang1, Xiao Ma1.
Abstract
Trifolium L., which belongs to the IR lacking clade (IRLC), is one of the largest genera in the Leguminosae and contains several economically important fodder species. Here, we present whole chloroplast (cp) genome sequencing and annotation of two important annual grasses, Trifolium alexandrinum (Egyptian clover) and T. resupinatum (Persian clover). Abundant single nucleotide polymorphisms (SNPs) and insertions/deletions (In/Dels) were discovered between those two species. Global alignment of T. alexandrinum and T. resupinatum to a further thirteen Trifolium species revealed a large amount of rearrangement and repetitive events in these fifteen species. As hypothetical cp open reading frame (ORF) and RNA polymerase subunits, ycf1 and rpoC2 in the cp genomes both contain vast repetitive sequences and observed high Pi values (0.7008, 0.3982) between T. alexandrinum and T. resupinatum. Thus they could be considered as the candidate genes for phylogenetic analysis of Trifolium species. In addition, the divergence time of those IR lacking Trifolium species ranged from 84.8505 Mya to 4.7720 Mya. This study will provide insight into the evolution of Trifolium species.Entities:
Keywords: IR lacking; Trifolium; chloroplast genome; divergence time; rearrangement; repetitive events
Year: 2020 PMID: 32283660 PMCID: PMC7238141 DOI: 10.3390/plants9040478
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Gene maps of T. alexandrinum (A), T. resupinatum (B) cp genomes and pictures of T. alexandrinum and T. resupinatum were shown right in the gene maps. Pictures were cited from the open website http://plants-of-styria.uni-graz.at. Genes drawn inside and outside of the circle are transcribed clockwise and counterclockwise, respectively. Genes belonging to different functional groups are color coded. The darker gray color and lighter gray color in the inner circle corresponds to the GC content and the AT content, respectively.
Comparison of the fifteen Trifolium species.
| Species | Genome Length(bp) | GC Content (%) | Gene Density | tRNA | rRNA | mRNA | Genes | Genes with Introns | GenBank Number | |
|---|---|---|---|---|---|---|---|---|---|---|
| Repetitive % | cp Genome | CDS | ||||||||
|
| 148545/2.85% | 34.09 | 37.05 | 7.54 × 10−4 | 31 | 6 | 75 | 112 | 13 | MN857160 |
|
| 149026/2.69% | 33.80 | 36.64 | 7.31 × 10−4 | 37 | 6 | 66 | 109 | 5 | MN857161 |
|
| 144763/20.71% | 34.83 | 37.10 | 7.60 × 10−4 | 30 | 4 | 76 | 110 | 16 | NC011828 |
|
| 142595/12.83% | 34.87 | 37.34 | 7.78 × 10−4 | 30 | 4 | 77 | 111 | 15 | NC476730.1 |
|
| 121178/NA * | 34.63 | 36.94 | 7.43 × 10−4 | 28 | 4 | 58 | 90 | 11 | KJ788290 |
|
| 132120/20.70% | 34.53 | 36.96 | 8.10 × 10−4 | 31 | 4 | 72 | 107 | 16 | KC894706.1 |
|
| 125834/0.71% | 34.98 | 36.70 | 8.82 × 10−4 | 31 | 5 | 75 | 111 | 18 | NC025745.1 |
|
| 126970/5.60% | 34.86 | 36.81 | 8.51 × 10−4 | 30 | 4 | 74 | 108 | 15 | KC894708.1 |
|
| 125740/1.05% | 35.24 | 36.83 | 8.75 × 10−4 | 31 | 5 | 74 | 110 | 17 | NC025743.1 |
|
| 126149/0.78% | 34.90 | 36.70 | 8.72 × 10−4 | 30 | 5 | 75 | 110 | 17 | NC025744.1 |
|
| 126149/5.60% | 37.20 | 35.82 | 8.80 × 10−4 | 30 | 4 | 77 | 111 | 16 | NC_024034 |
|
| 134831/7.86% | 34.33 | 35.97 | 8.08 × 10−4 | 31 | 4 | 74 | 109 | 17 | KJ788286 |
|
| 135049/5.98% | 33.97 | 35.71 | 8.15 × 10−4 | 30 | 4 | 76 | 110 | 15 | KJ788287 |
|
| 133780/4.64% | 36.34 | 35.91 | 8.00 × 10−4 | 29 | 4 | 74 | 107 | 17 | KJ788289 |
|
| 138194/10.55% | 36.31 | 35.81 | 7.89 × 10−4 | 31 | 4 | 74 | 109 | 17 | KJ788291 |
* The cp genome annotation of Trifolium pratense is incomplete, so the percentage of repetitive cannot be calculated [13].
List of genes annotated in the cp genomes of T. alexandrinum and T. resupinatum.
| Category | Function | Name of Genes | |||||
|---|---|---|---|---|---|---|---|
| Self-replication (31) | Ribosomal RNA Genes |
|
|
|
| ||
| Transfer RNA genes |
|
|
|
|
|
| |
|
|
|
|
|
|
| ||
|
|
|
|
|
|
| ||
|
|
|
|
|
| |||
|
|
|
| |||||
| Ribosomal proteins (10) | Small subunit of ribosome (SSU) |
|
|
|
|
|
|
|
|
|
|
| ||||
| Transcription (12) | Large subunit of ribosome (LSU) |
|
|
|
|
| |
|
|
| ||||||
| RNA polymerase subunits |
|
|
| ||||
| Photosynthesis related genes (46) | Large subunit of Rubisco |
| |||||
| Subunits of Photosystem I |
|
|
|
|
| ||
| Subunits of Photosystem II |
|
|
|
|
|
| |
|
|
|
|
|
|
| ||
|
|
| ||||||
| Subunits of ATP synthase |
|
|
|
|
| ||
| Cytochrome b/f complex |
|
|
|
|
|
| |
| C-type cytochrome synthesis gene |
| ||||||
| Subunits of NADH dehydrogenase |
|
|
|
| |||
|
|
|
|
|
| |||
| Other genes (7) | Maturase |
| |||||
| Protease | |||||||
| Chloroplast envelope membrane protein |
| ||||||
| Subunit of acetyl-CoA | |||||||
| Hypothetical open reading frames |
|
| |||||
Note: *, Genes containing a single intron; **, Genes containing two introns; (ale), Genes that are particular for T. alexandrinum; */ale, Genes that only have an intron in T. alexandrinum.
Figure 2Repeating events of shared genes of T. alexandrinum and T. resupinatum. (A) Shared length of repeats and repeated sequences of T. alexandrinum and T. resupinatum; (B) Repeat type predicted in T. alexandrinum and T. resupinatum and (C) Listing of shared repetitive sequences with more than 30 bp.
Figure 3Transversion (Tv), transition (Tn) and Insert/Deletion (In/Del) were showed in intergenic and genic regions.
Figure 4The synonymous/synonymous substitution rates (Ka/Ks) calculated using 59 shared genes in T. alexandrinum and T. resupinatum.
Figure 5The nucleotide diversity (Pi) computed using 96 common genes of T. alexandrinum and T. resupinatum. Genes with Ka/Ks values more than one were red coded.
Figure 6Synteny comparison of fifteen Trifolium chloroplast genomes with the reference of T. resupinatum using Mauve. Rectangular blocks with the same color indicate collinear regions.
Figure 7(A) BEAST chronogram of the 25 Leguminosae species based on the 41 common protein-coding genes and the phylogenetic trees constructed using rpoC2 gene (B) and ycf1 gene (C, ycf1 gene was absent in T. hybridum). Geologic timescale was obtained from TIMETREE, time is shown in millions of years (MYA). Min, Mimosaceae; Cas, Caesalpinioideae. The over 50% bootstrap support values were showed in each main branch.