| Literature DB >> 34991482 |
Jiaxin Yang1,2,3,4, Guoxiong Hu5, Guangwan Hu6,7.
Abstract
BACKGROUND: Handeliodendron Rehder and Eurycorymbus Hand.-Mazz. are the monotypic genera in the Sapindaceae family. The phylogenetic relationship of these endangered species Handeliodendron bodinieri (Lévl.) Rehd. and Eurycorymbus cavaleriei (Lévl.) Rehd. et Hand.-Mazz. with other members of Sapindaceae s.l. is not well resolved. A previous study concluded that the genus Aesculus might be paraphyletic because Handeliodendron was nested within it based on small DNA fragments. Thus, their chloroplast genomic information and comparative genomic analysis with other Sapindaceae species are necessary and crucial to understand the circumscription and plastome evolution of this family.Entities:
Keywords: Chloroplast genome; Endangered; Endemic; Eurycorymbus; Handeliodendron; Monotypic genus; Phylogenetic analysis; Sapindaceae
Mesh:
Year: 2022 PMID: 34991482 PMCID: PMC8734052 DOI: 10.1186/s12864-021-08259-w
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Circular gene map of chloroplast genomes of Handeliodendron bodinieri and Eurycorymbus cavaleriei. The gray arrowheads indicate the direction of the genes. Genes on the outside and inside of the circle are transcribed in clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are color coded. The innermost darker gray corresponds to GC content, whereas the lighter gray corresponds to AT content. IR, LSC, SSC indicate inverted repeat, large single copy region, and small single copy region, respectively. The black arrows indicate annotation in H. bodinieri that are inconsistent with those in E. cavaleriei
Comparison of chloroplast genome feature of Handeliodendron bodinieri and Eurycorymbus cavaleriei
| Species | Location | length (bp) | T (U) (%) | C (%) | A (%) | G (%) | G + C (%) |
|---|---|---|---|---|---|---|---|
| LSC | 85,092 | 32.8 | 18.5 | 31.1 | 17.5 | 36.0 | |
| SSC | 18,731 | 34.2 | 16.3 | 34.4 | 15.1 | 31.5 | |
| IR | 25,724 | 28.3 | 20.9 | 28.6 | 22.3 | 43.1 | |
| CDS | 78,970 | 31.4 | 19.5 | 30.5 | 18.6 | 38.1 | |
| Total | 155,271 | 31.5 | 19.3 | 30.6 | 18.6 | 37.8 | |
| LSC | 86,874 | 32.7 | 18.6 | 31.3 | 17.5 | 36.1 | |
| SSC | 17,996 | 33.8 | 16.7 | 33.9 | 15.6 | 32.3 | |
| IR | 26,910 | 28.5 | 20.8 | 28.6 | 22.1 | 42.8 | |
| CDS | 79,251 | 31.3 | 19.7 | 30.3 | 18.7 | 38.4 | |
| Total | 158,690 | 31.4 | 19.3 | 30.7 | 18.6 | 37.9 |
Summary of assembled gene functions of Handeliodendron bodinieri and Eurycorymbus cavaleriei chloroplast genomes
| Gene Family | Gene Names |
|---|---|
| Subunits of ATP synthase | |
| Subunits of NADH dehydrogenase | |
| Subunits of cytochrome | |
| Subunits of photosystem I | |
| Subunits of photosystem II | |
| Subunit of rubisco | |
| Subunit of Acetyl-CoA-carboxylase | |
| c-type cytochrome synthesis gene | |
| Envelop membrane protein | |
| Protease | |
| Translational initiation | |
| Maturase | |
| Large subunit of ribosome | |
| DNA dependent RNA polymerase | |
| Small subunit of ribosome | |
| rRNA Genes | |
| tRNA Genes | |
| Unknown function |
* Genes containing a single intron; ** Genes containing two introns; (× 2) Genes are located within the IR regions and therefore are duplicated; (×E) Genes present as two copies in the IR regions of E. cavaleriei; ψ indicates a pseudogene; H Pseudogene in H. bodinieri only
Genes with introns in the chloroplast genomes of Handeliodendron bodinieri and Eurycorymbus cavaleriei as well as the lengths of the exons and introns
| Gene | Location | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) | ||
| LSC | 159 | 756 | 408 | 159 | 750 | 408 | |||||
| LSC | 69 | 836 | 291 | 645 | 228 | 69 | 853 | 291 | 658 | 228 | |
| SSC | 552 | 1143 | 540 | 558 | 1089 | 540 | |||||
| IR | 777 | 682 | 756 | 777 | 680 | 756 | |||||
| LSC | 6 | 789 | 657 | 6 | 794 | 657 | |||||
| LSC | 9 | 740 | 525 | 9 | 677 | 525 | |||||
| LSC | 9 | 1049 | 402 | 9 | 923 | 402 | |||||
| IR | 393 | 661 | 435 | 390 | 664 | 435 | |||||
| LSC | 435 | 674 | 1620 | 435 | 711 | 1620 | |||||
| LSC | 114 | – | 232 | 537 | 26 | 114 | – | 232 | 533 | 26 | |
| LSC | 39 | 829 | 225 | 39 | 836 | 225 | |||||
| LSC | 39 | 589 | 37 | 39 | 590 | 37 | |||||
| LSC | 37 | 542 | 49 | 37 | 542 | 49 | |||||
| LSC | 37 | 2514 | 38 | 37 | 2496 | 38 | |||||
| IR | 42 | 953 | 35 | 42 | 946 | 35 | |||||
| LSC | 23 | 724 | 48 | 23 | 721 | 48 | |||||
| IR | 38 | 841 | 35 | 38 | 810 | 35 | |||||
| LSC | 126 | 731 | 228 | 746 | 153 | 126 | 732 | 228 | 767 | 153 | |
Statistics on the basic features of the chloroplast genomes from Sapindaceae species
| Species | size (bp) | LSC | IR | SSC | GC content (%) | No. rRNA | No. tRNA | No. PCGs |
|---|---|---|---|---|---|---|---|---|
| 152,688 | 82,529 | 26,306 | 17,795 | 38.2 | 8 | 40 | 87 | |
| 155,271 | 85,092 | 25,724 | 18,731 | 37.8 | 8 | 37 | 89 | |
| 155,528 | 85,489 | 25,656 | 18,727 | 37.9 | 8 | 37 | 83 | |
| 155,871 | 84,882 | 26,390 | 18,209 | 38.0 | 8 | 40 | 84 | |
| 156,262 | 86,018 | 26,086 | 18,072 | 37.9 | 8 | 40 | 89 | |
| 156,595 | 86,327 | 26,100 | 18,068 | 37.9 | 8 | 40 | 89 | |
| 156,857 | 85,227 | 26,742 | 18,146 | 37.9 | 8 | 40 | 86 | |
| 156,911 | 85,314 | 26,752 | 18,093 | 37.9 | 8 | 40 | 89 | |
| 157,044 | 85,410 | 26,761 | 18,112 | 37.9 | 8 | 40 | 86 | |
| 157,067 | 85,418 | 26,760 | 18,129 | 37.9 | 8 | 39 | 88 | |
| 157,071 | 85,529 | 26,730 | 18,082 | 38.0 | 8 | 40 | 87 | |
| 157,080 | 85,455 | 26,766 | 18,093 | 37.8 | 8 | 40 | 88 | |
| 157,118 | 85,558 | 26,722 | 18,119 | 37.9 | 8 | 39 | 88 | |
| 157,197 | 85,655 | 26,728 | 18,086 | 37.8 | 8 | 40 | 86 | |
| 157,367 | 85,829 | 26,689 | 18,160 | 37.8 | 8 | 40 | 89 | |
| 158,690 | 86,874 | 26,910 | 17,996 | 37.9 | 8 | 37 | 87 | |
| 159,375 | 87,204 | 27,100 | 17,971 | 37.9 | 8 | 37 | 88 | |
| 160,481 | 85,649 | 27,979 | 18,874 | 37.7 | 8 | 39 | 88 | |
| 160,818 | 85,666 | 28,396 | 18,360 | 37.9 | 8 | 37 | 88 | |
| 160,833 | 85,708 | 28,428 | 18,269 | 37.8 | 8 | 37 | 87 | |
| 161,231 | 85,299 | 28,620 | 18,692 | 37.7 | 8 | 38 | 86 | |
| 161,356 | 86,009 | 28,597 | 18,153 | 37.8 | 8 | 37 | 87 | |
| 162,524 | 85,750 | 30,103 | 16,568 | 37.8 | 8 | 37 | 87 | |
| 163,258 | 90,236 | 27,377 | 18,268 | 37.3 | 8 | 37 | 85 |
Fig. 2Analyses of repeated sequences in Handeliodendron bodinieri and Eurycorymbus cavaleriei complete chloroplast genomes. A Total of dispersed repeats, the F, R, P, C indicate forward, reverse, palindromic, and complement repeats, respectively; B Frequency of dispersed repeats by length; C Numbers and types of SSR in the chloroplast genomes of H. bodinieri and E. cavaleriei; D SSR distribution between Gene and intergenic spacer regions (IGS) of H. bodinieri; E SSR distribution between Gene and intergenic spacer regions (IGS) of E. cavaleriei
Fig. 3Codon contents of 20 amino acids and stop codons in all coding sequences of the Handeliodendron bodinieri and Eurycorymbus cavaleriei chloroplast genomes. The histogram on the left side presents each amino acid codon usage within H. bodinieri cp genome, and the right side denotes that of within E. cavaleriei cp genome. The colour of the histogram corresponds to the colour of codons
Fig. 4Genomic rearrangement of the 16 Sapindaceae chloroplast genomes, with Litchi chinensis set as a reference genome
Fig. 5Visualization of genome alignment of the complete genome of seven complete chloroplast genomes from Sapindales. The cp genome of Eurycorymbus cavaleriei is used as the reference. X-axis indicates the sequence coordinates in the whole cp genome. Y-axis represents the similarity of the aligned regions, indicating percent identity to the reference genome (50–100%)
Fig. 6Sliding window analysis based on the 13 cp genome sequences of Sapindaceae. X-axis: position of the Midpoint of a window; Y-axis: Nucleotide diversity of each window (Pi)
Fig. 7Comparison of the Large Single-Copy (LSC), Small Single-Copy (SSC), and inverted repeat (IR) boundary regions across nine Sapindales chloroplast genomes, ψ indicates a pseudogene. The mumbers next to gene indicate the distance to the boundary, or the length of these genes in single copy regions and inverted repeat regions
Fig. 8Phylogenetic tree reconstruction of Sapindales using the maximum likelihood (ML) and Bayesian inference (BI) method based on complete chloroplast genome sequences. Only the ML tree is shown, because its topology is identical to that of the obtained BI tree. ML supports/ BI posterior probabilities values are indicated on the nodes
Fig. 9Phylogenetic tree reconstruction of Sapindales using the maximum likelihood (ML) Bayesian inference (BI) method based on 79 coding sequences. Only the ML tree is shown, because its topology is identical to that of the obtained BI tree. ML supports/BI posterior probabilities values are indicated on the node