| Literature DB >> 36064865 |
Nurbanu Tayşi1, Yasin Kaymaz1, Duygu Ateş1, Hatice Sari2, Cengiz Toker2, M Bahattin Tanyolaç3.
Abstract
Lens is a member of the Papilionoideae subfamily of Fabaceae and is generally used as a source of vegetable protein as part of human diets in many regions worldwide. Chloroplast (cp) genomes are highly active genetic components of plants and can be utilized as molecular markers for various purposes. As one of the wild lentil species, the Lens ervoides cp genome has been sequenced for the first time in this study using next-generation sequencing. The de novo assembly of the cp genome resulted in a single 122,722 bp sequence as two separate coexisting structural haplotypes with similar lengths. Results indicated that the cp genome of L. ervoides belongs to the inverted repeat lacking clade. Several noteworthy divergences within the coding regions were observed in ndhB, ndhF, rbcL, rpoC2, and ycf2 genes. Analysis of relative synonymous codon usage showed that certain genes, psbN, psaI, psbI, psbE, psbK, petD, and ndhC, preferred using biased codons more often and therefore might have elevated expression and translation efficiencies. Overall, this study exhibited the divergence level between the wild-type and cultured lentil cp genomes and pointed to certain regions that can be utilized as distinction markers for various goals.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36064865 PMCID: PMC9445179 DOI: 10.1038/s41598-022-17877-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1(A) Comparison of two possible L. ervoides cp genome scaffold orientations (Path 1—left panel; Path 2—right panel) to L. culinaris cp genome sequence using nucmer. Mummerplot analysis demonstrates similar sequence blocks with the same orientation as purple diagonal lines while inverted reverse sequence blocks as backward diagonal lines in blue colour. (B) Gene order comparison of Lens cp genomes using MAUVE software. Blocks of the same colour show the same gene orientation. Boxes above the line represent the clockwise gene sequence, and boxes below the line represent gene sequences in the opposite direction. Co-existing alternative structural haplotypes (scaffolds graph 1 and 2) of L. ervoides and comparison of these structures with L. culinaris. (C) Block similarity alignments of both genomes after manually setting the starting point of L. ervoides assembly path 1 according to L. culinaris.
Comparison of L. ervoides cp genome and L. culinaris cp genome deposited to the GenBank features.
| Species | ||
|---|---|---|
| Chloroplast genome size (bp) | 122,722 | 122,967 |
| Total number of genes | 112 | 107 |
| Protein coding genes | 77 | 73 |
| tRNA genes | 31 | 29 |
| rRNA genes | 4 | 4 |
| Pseudogene | – | 1 |
| Overall GC content (%) | 34.44 | 34.4 |
Figure 2Gene maps of L. ervoides and L. culinaris. Genes inside the circle transcript clockwise, and genes outside the circle transcript counterclockwise. Genes with the same functional trait are shown in the same colour group. The dark grey area in the inner circle indicates the GC content of the corresponding genes, and the light grey area indicates the AT content.
List of chloroplast genomes gene content and functional classification in L. ervoides.
| Category | Functional groups | Genes |
|---|---|---|
| Self-replication | Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA-dependent RNA polymerase | ||
| Ribosomal RNA genes | ||
| Transfer RNA genes | ||
| Genes for photosynthesis | Photosystem I | |
| Photosystem II | ||
| RUBISCO | ||
| Subunits of ATP synthase | ||
| Subunit of NADH-dehydrogenase | ||
| Cytochrome b/f complex | ||
| Other genes | Protease | |
| Maturase | ||
| Envelope membrane protein | ||
| Translation initiation factor | ||
| C-type cytochrome synthesis gene | ||
| Subunit of Acetyl-CoA-carboxylase | ||
| Genes of unknown function | Conserved hypothetical chloroplast |
aGene containing one intron.
bGene containing two introns.
cGene with two exons.
dGene with three exons.
Figure 3(A) mVISTA analysis of two cp genomes. Grey arrows show the direction of gene translation. The y-axis gives percent identity ranging from 50–100% and exon, conserved noncoding sequence (CNS), tRNAs, tRNAs are represented in different colours. (B) Percent dissimilarity (mismatches) between L. ervoides and L. culinaris throughout their cp genomes with 1 kb window and 200 bp sliding step. Repetitive regions determined by miropeats are masked.
Figure 4Type of simple sequence repeats (SSRs) and repeat sequences number in the cp genome of L. culinaris (red) and L. ervoides (blue). (A) Distribution of various SSRs on both cp genomes grouped based on type. (B) The frequencies of SSR types. (C) Various types of repeats grouped based on their size in bp. (D) The distribution of tandem repeats.
Figure 5(A) RSCU values of each codon grouped based on their associated amino acid (single letters on top). (B) The relationship between the percent preferred codon (RSCU > 1.0) usage of each protein coding gene and their MELP index. Ribosomal genes are in green and other genes are read. Only non-ribosomal genes with MELP > 1.0 are labelled with gene names. Blue line represents the linear regression and the grey area surrounding the line shows the confidence interval.
Figure 6Phylogenetic relationship of Papilionoideae species using maximum likelihood (ML) and Jukes Cantor model analysis. (A) Phylogenetic tree of whole cp genomes from various species including IRLC and two outgroups. (B) Phylogenetic tree based on 54 protein coding genes that are common among 14 Papilionoideae cp genomes within IRLC. Arabidopsis thaliana is the out group. Bootstrap supports were calculated from 100 replicates.