| Literature DB >> 36140718 |
Weicai Song1, Zimeng Chen1, Wenbo Shi1, Weiqi Han1, Qi Feng1, Chao Shi1,2, Michael S Engel3, Shuo Wang1.
Abstract
Litsea is a group of evergreen trees or shrubs in the laurel family, Lauraceae. Species of the genus are widely used for a wide range of medicinal and industrial aspects. At present, most studies related to the gene resources of Litsea are restricted to morphological analyses or features of individual genomes, and currently available studies of select molecular markers are insufficient. In this study, we assembled and annotated the complete chloroplast genomes of nine species in Litsea, carried out a series of comparative analyses, and reconstructed phylogenetic relationships within the genus. The genome length ranged from 152,051 to 152,747 bp and a total of 128 genes were identified. High consistency patterns of codon bias, repeats, divergent analysis, single nucleotide polymorphisms (SNP) and insertions and deletions (InDels) were discovered across the genus. Variations in gene length and the presence of the pseudogene ycf1Ψ, resulting from IR contraction and expansion, are reported. The hyper-variable gene rpl16 was identified for its exceptionally high Ka/Ks and Pi values, implying that those frequent mutations occurred as a result of positive selection. Phylogenetic relationships were recovered for the genus based on analyses of full chloroplast genomes and protein-coding genes. Overall, both genome sequences and potential molecular markers provided in this study enrich the available genomic resources for species of Litsea. Valuable genomic resources and divergent analysis are also provided for further research of the evolutionary patterns, molecular markers, and deeper phylogenetic relationships of Litsea.Entities:
Keywords: Litsea; chloroplast genome; genetic relationship; structural variations
Mesh:
Year: 2022 PMID: 36140718 PMCID: PMC9498446 DOI: 10.3390/genes13091550
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Complete genome map of the chloroplast genome for representative species L. auriculata. The inner gray ring is divided into four areas, clockwise, and they are SSC, IRb, LSC, and IRa. The genes in the outer ring region are transcribed clockwise, while those in the inner ring are transcribed counterclockwise. In addition, this figure also reflects the GC content; the inner ring dark gray indicates the GC content, the light gray reaction AT content. In the lower left is a legend that classifies cp genes according to their functions.
Gene content of the L. moupinensis chloroplast genome.
| Group of Genes | Gene Names | Amount |
|---|---|---|
| Pholosystem I | 5 | |
| Photosystem II | 15 | |
| Cytochrome | 6 | |
| ATP syntliase | 6 | |
| NADH dehydrogenase | 12 | |
| RubisCO large subunit |
| 1 |
| RNA polymerase | 4 | |
| Ribosomal proteins (SSU) | 14 | |
| Ribosomal proteins (LSU) | 9 | |
| Transfer RNAs | 34 | |
| Ribosomal RNAs | 8 | |
| Hypothetical chloroplast | 4 | |
| Other genes | 7 |
* Gene with two copies.
Chloroplast genome features of nine species of Litsea.
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|
| 152,377 | 152,081 | 152,747 | 152,588 | 152,619 | 152,581 | 152,717 | 152,051 | 152,578 |
| 93,535 | 93,138 | 93,631 | 93,552 | 93,569 | 93,550 | 93,583 | 93,093 | 93,540 |
| 18,814 | 18,813 | 18,902 | 18,824 | 18,838 | 18,819 | 18,900 | 18,828 | 18,826 |
| 20,014 | 20,065 | 20,107 | 20,106 | 20,106 | 20,106 | 20,117 | 20,065 | 20,106 |
| 39.2 | 39.2 | 39.2 | 39.2 | 39.2 | 39.2 | 39.1 | 39.2 | 39.2 |
| 37.9 | 37.9 | 38.0 | 37.9 | 38.0 | 37.9 | 37.9 | 37.9 | 38.0 |
| 33.9 | 33.9 | 33.9 | 33.9 | 33.9 | 33.9 | 33.9 | 34.0 | 34.0 |
| 44.4 | 44.4 | 44.4 | 44.4 | 44.4 | 44.4 | 44.4 | 44.4 | 44.4 |
| 128 (113) | 128 (113) | 128 (113) | 128 (113) | 128 (113) | 128 (113) | 128 (113) | 128 (113) | 128 (113) |
| 84 (79) | 84 (79) | 84 (79) | 84 (79) | 84 (79) | 84 (79) | 84 (79) | 84 (79) | 84 (79) |
| 9 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) |
| 36 (30) | 36 (30) | 36 (30) | 36 (30) | 36 (30) | 36 (30) | 36 (30) | 36 (30) | 36 (30) |
| NC_056809 | NC_056810 | NC_056811 | NC_056812 | NC_056813 | NC_056814 | NC_056815 | NC_056816 | NC_056817 |
| 49.12 | 49.35 | 49.06 | 49.12 | 49.11 | 49.12 | 49.15 | 49.37 | 49.12 |
| 8.51 | 7.54 | 8.81 | 8.82 | 8.82 | 8.95 | 8.81 | 8.85 | 8.82 |
| 30.19 | 29.92 | 30.93 | 30.85 | 30.86 | 1.36 | 30.83 | 29.90 | 30.85 |
| 13 | 13 | 13 | 13 | 13 | 13 | 13 | 13 | 13 |
Figure 2The frequency (Freq) and preference of codon usage (RSCU) in L. moupinensis protein-coding region. Axis of abscissae indicate each amino acid and its abbreviation as well as the respective codon, the blue bar in the figure is the frequency of codon usage, and the orange line represents codon preference.
Figure 3Comparison of microsatellites and oligonucleotide repeats in the chloroplast genomes of Litsea species. (A) The number of SSR markers in the LSC/SSC/IR region for nine Litsea species. (B) Number of four long repeat sequences in nine species: complement repeats. F represents forward repeats, P represents palindromic repeats, R represents reverse repeats, C represents complement repeats. (C) number of long repeat sequences with different lengths in nine species. Different colors in the figure represent different long repeat sequence types. Species from left to right are: L. auriculata, L. chunii, L. ichangensis, L. moupinensis, L. populifolia, L. rubescens, L. sericea, L. tsinlingensis, L. veitchiana.
Figure 4Comparison of SSC, LSC, IRa, and IRb boundary regions in the chloroplast genomes of nine species of Litsea.
Figure 5Identification map of chloroplast genome of nine species of Litsea. From top to bottom: L. auriculata, L. chunii, L. ichangensis, L. moupinensis, L. populifolia, L. rubescens, L. sericea, L. tsinlingensis, L. veitchiana. The gray arrows above indicate the extension direction of the gene, and purple indicates the exon, blue indicates the untranslated region, pink indicates the non-coding sequences, and the grayish part indicates mRNA.
The number of SNP types in LSC, IR and SSC regions of nine Litsea chloroplast genomes.
| Species | Region | Transition Substitutions | Transversion Substitutions | ||||
|---|---|---|---|---|---|---|---|
| A/G | C/T | A/T | A/C | C/G | G/T | ||
|
| 109 | 106 | 25 | 46 | 6 | 63 | |
|
| Large | 137 | 129 | 20 | 52 | 11 | 80 |
|
| 129 | 139 | 26 | 58 | 11 | 75 | |
|
| single | 134 | 139 | 21 | 58 | 10 | 78 |
|
| copy | 129 | 138 | 22 | 60 | 10 | 80 |
|
| 134 | 140 | 22 | 58 | 10 | 78 | |
|
| 123 | 129 | 23 | 55 | 10 | 78 | |
|
| 136 | 127 | 19 | 56 | 11 | 73 | |
|
| 127 | 128 | 21 | 59 | 10 | 75 | |
|
| 3 | 5 | 0 | 2 | 2 | 11 | |
|
| Inverted repeat | 4 | 8 | 2 | 12 | 1 | 15 |
|
| 4 | 5 | 1 | 3 | 1 | 3 | |
|
| 3 | 8 | 2 | 12 | 1 | 12 | |
|
| 3 | 6 | 2 | 12 | 1 | 11 | |
|
| 3 | 8 | 2 | 12 | 1 | 11 | |
|
| 3 | 6 | 3 | 12 | 1 | 12 | |
|
| 2 | 9 | 2 | 12 | 1 | 14 | |
|
| 3 | 7 | 2 | 12 | 1 | 12 | |
|
| 43 | 47 | 5 | 21 | 3 | 16 | |
|
| Small | 42 | 37 | 10 | 19 | 4 | 17 |
|
| 42 | 45 | 5 | 24 | 5 | 21 | |
|
| 38 | 41 | 5 | 24 | 6 | 21 | |
|
| single | 37 | 43 | 4 | 19 | 5 | 21 |
|
| copy | 38 | 41 | 5 | 10 | 6 | 21 |
|
| 38 | 39 | 6 | 19 | 5 | 24 | |
|
| 42 | 37 | 5 | 20 | 4 | 18 | |
|
| 37 | 43 | 4 | 18 | 5 | 22 | |
Comparative analyses of the number and average length of InDel sites in LSC, SSC, and IR regions in the complete cp genomes of nine species of Litsea.
| Comparative Analyses of InDel Sites | ||||||
|---|---|---|---|---|---|---|
| Species | Large Single Copy | Inverted Repeat | Small Single Copy | |||
| No′s of InDels | InDels′ Average Length (bp) | No′s of InDels | InDels′ Average Length (bp) | No′s of InDels | InDels′ Average Length (bp) | |
|
| 86 | 4.40 | 16 | 103.3 | 18 | 1.3 |
|
| 89 | 8.4 | 3 | 458.7 | 20 | 1.4 |
|
| 99 | 3.7 | 5 | 276.0 | 19 | 1.6 |
|
| 88 | 3.7 | 4 | 339.0 | 16 | 1.9 |
|
| 86 | 3.8 | 4 | 339.0 | 17 | 1.5 |
|
| 88 | 3.9 | 5 | 272.4 | 16 | 1.9 |
|
| 86 | 3.7 | 4 | 339.0 | 19 | 2.0 |
|
| 87 | 9.1 | 2 | 678.0 | 19 | 1.4 |
|
| 88 | 3.7 | 4 | 339.0 | 18 | 1.8 |
| Average | 88.6 | 4.90 | 5.2 | 349.4 | 18.0 | 1.6 |
Figure 6Nucleotide diversity in chloroplast genomes of nine species of Litsea. The abscissa represents the position, and the red line represents the average of the nucleotide variations of the nine species. (A) Pi values for each gene region. (B). Pi values for each intergenic region. (C) Ka/KS values for each gene.
Figure 7Phylogenetic analysis. (A) Phylogenetic tree based on the complete chloroplast genome. (B) Phylogenetic tree based on 64 sets of protein-coding genes. N. sericea and Actinodaphne obovata served as out groups. The colored branches show the difference between two trees. Numbers at branch nodes are bootstrap values and posterior probability.