| Literature DB >> 32466556 |
Zhou Hong1, Zhiqiang Wu2, Kunkun Zhao1, Zengjiang Yang1, Ningnan Zhang1, Junyu Guo1, Luke R Tembrock3, Daping Xu1.
Abstract
Pterocarpus is a genus of trees mainly distributed in tropical Asia, Africa, and South America. Some species of Pterocarpus are rosewood tree species, having important economic value for timber, and for some species, medicinal value as well. Up to now, information about this genus with regard to the genomic characteristics of the chloroplasts has been limited. Based on a combination of next-generation sequencing (Illumina Hiseq) and long-read sequencing (PacBio), the whole chloroplast genomes (cp genomes) of five species (rosewoods) in Pterocarpus (Pterocarpus macrocarpus, P. santalinus, P. indicus, P. pedatus, P. marsupium) have been assembled. The cp genomes of five species in Pterocarpus have similar structural characteristics, gene content, and sequence to other flowering plants. The cp genomes have a typical four-part structure, containing 110 unique genes (77 protein coding genes, 4 rRNAs, 29 tRNAs). Through comparative genomic analysis, abundant simple sequence repeat (SSR)loci (333-349) were detected in Pterocarpus, among which A /T single nucleotide repeats accounted for the highest proportion (72.8-76.4%). In the five cp genomes of Pterocarpus, eight hypervariable regions, including trnH-GUG_psbA, trnS-UGA_psbC, accD-psaI, ndhI-exon2_ndhI-exon1, ndhG_ndhi-exon2, rpoC2-exon2, ccsA, and trnfM-CAU, are proposed for use as DNA barcode regions. In the comparison of gene selection pressures (P. santalinus as the reference genome), purifying selection was inferred as the primary mode of selection in maintaining important biological functions. Phylogenetic analysis shows that Pterocarpus is a monophyletic group. The species P. tinctorius is resolved as early diverging in the genus. Pterocarpus was resolved as sister to the genus Tipuana.Entities:
Keywords: Pterocarpus; chloroplast genome; hypervariable regions; microsatellite markers; phylogenetic analysis; purifying selection
Mesh:
Substances:
Year: 2020 PMID: 32466556 PMCID: PMC7312355 DOI: 10.3390/ijms21113758
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1The assembly, size and features of M-S-I-P-M2 cp genomes (Pterocarpus). The genes outside the circle are transcribed in the counter clockwise direction, and the genes inside the circle are transcribed in the clockwise direction. Different colors in genes represent different functions. The dark gray area and light gray area of inner circle represent the GC content to AT content of the genome respectively.
List of genes encoded by five species of Pterocarpus chloroplast genome. (×2) indicates that the gene has two copies. * and ** indicate genes containing one/ two introns. The ycf1 gene of Pterocarpus pedatus, Pterocarpus marsupium and Pterocarpus santalinus contains one intron. The cp genome of P. santalinus was used as the reference, + indicates that the gene is inferred to be under positive selection (ω > 1) and − indicates that the gene is inferred to be under purifying selection (ω = 0.001).
| Category | Group | Genes |
|---|---|---|
| Photosynthesis | Subunits_of_photosystem_I | psaA-, psaB, psaI, psaJ, psaC |
| Subunits_of_photosystem_II | psbA, psbZ, psbC-, psbD, psbM, psbI, psbK, psbL, psbF, psbE, psbB-, psbT, psbN, psbH | |
| Subunits_of_NADH_dehydrogenase | ndhC, ndhK, ndhJ, ndhB(×2) * +, ndhF, ndhD+, ndhE, ndhG, ndhI *, ndhA *, ndhH- | |
| Subunits_of_cytochrome_b/f_complex | petN, petA, petL, petG, petB *, petD * | |
| Subunits_of_ATP_synthase | atpB-, atpE, atpI, atpH, atpF *, atpA | |
| Large_subunit_of_Rubisco | rbcL | |
| Self-replication | Large_subunits_of_ribosome | rpl33, rpl20, rpl36-, rpl14, rpl16 *, rpl2(×2) *, rpl32+, rpl23(×2) |
| Small_subunits_of_ribosome | rps4+, rps14, rps2, rps18, rps11, rps8, rps3, rps19, rps12 *- (×2,part), rps15, rps7(×2) | |
| DNA-dependent_RNA_polymerase | rpoB, rpoC1 *, rpoC2 *, rpoA | |
| Ribosomal_RNAs | rrn5, rrn4.5, rrn23, rrn16 | |
| Transfer_RNAs | trnH-GUG, trnK-UUU *, trnM-CAU, trnV-UAC *, trnF-GAA, trnL-UAA *, trnT-UGU, trnS-GGA, trnfM-CAU, trnG-UCC, trnS-UGA, trnT-GGU, trnE-UUC, trnY-GUA, trnD-GUC, trnC-GCA, trnR-UCU, trnS-GCU, trnQ-UUG, trnW-CCA, trnP-UGG, trnI-CAU, trnL-CAA, trnV-GAC, trnI-GAU *, trnA-UGC *, trnR-ACG, trnN-GUU, trnL-UAG | |
| Other genes | Maturase | matK * |
| Protease | clpP **- | |
| Envelope_membrane_protein | cemA | |
| Acetyl-CoA_carboxylase | accD+ | |
| C-type_cytochrome_synthesis_gene | ccsA | |
| Genes of unknown function | Proteins_of_unknown_function | ycf3 **, ycf4, ycf68(×2) *, ycf1, orf42(×2), ycf2(×2) + |
Figure 2The abundance of different SSR types in the five chloroplast genomes of Pterocarpus (M-S-I-P-M2 cp genomes).
Figure 3Comparison of LSC, IRb, SSC, and IRa border regions in five species of Pterocarpus, Dalbergia cultrata, and Arabiopsis thaliana.
Figure 4Global alignment of five (M-S-I-P-M2) chloroplast genomes of Pterocarpus using mVISTA. Y-axis indicates the range of identity (50–100%). Alignment was performed using Pterocarpus marsupium as a reference.
Figure 5Comparison of nucleotide variability in coding region (A) and non-coding region (B) among our new sequenced five species of Pterocarpus.
Figure 6Phylogenetic tree for 32 species in Fabaceae (Arabidopsis thaliana as an outgroup) using maximum likelihood (ML), based on alignments of complete chloroplast genomes. Numbers at the nodes indicate bootstrap values from 1000 replicates. If the bootstrap values are as 100, this number was not shown on the nodes.