| Literature DB >> 32384622 |
Sajjad Asaf1, Abdul Latif Khan1, Arif Khan2, Ahmed Al-Harrasi1.
Abstract
Genus Prosopis (family Fabaceae) are shrubby trees, native to arid and semi-arid regions of Asia, Africa, and America and known for nitrogen fixation. Here, we have sequenced the complete chloroplast (cp) genomes of two Prosopis species (P. juliflora and P. cineraria) and compared them with previously sequenced P. glandulosa, Adenanthera microsperma, and Parkia javanica belonging to the same family. The complete genome sequences of Prosopis species and related species ranged from 159,389 bp (A. microsperma) to 163,677 bp (P. cineraria). The overall GC contents of the genomes were almost the similar (35.9-36.6%). The P. juliflora and P. cineraria genomes encoded 132 and 131 genes, respectively, whereas both the species comprised of 85 protein-coding genes higher than other compared species. About 140, 134, and 129 repeats were identified in P. juliflora, P. cineraria and P. glandulosa cp genomes, respectively. Similarly, the maximum number of simple sequence repeats were determined in P. juliflora (88), P. cineraria (84), and P. glandulosa (78). Moreover, complete cp genome comparison determined a high degree of sequence similarity among P. juliflora, P. cineraria, and P. glandulosa, however some divergence in the intergenic spacers of A. microsperma and Parkia javanica were observed. The phylogenetic analysis showed that P. juliflora is closer to P. cineraria than P. glandulosa.Entities:
Keywords: Fabaceae; comparative analysis; inverted repeats; phylogenetic position; plastid genome
Mesh:
Substances:
Year: 2020 PMID: 32384622 PMCID: PMC7247323 DOI: 10.3390/ijms21093280
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Genome map of the P. juliflora and P. cineraria cp genomes. The extent of the inverted repeat regions (IRs) is represented by thick lines, which divide the cp genome into large (LSC) and small (SSC) single copy regions. Genes drawn inside the circle are transcribed clockwise, while those outside of the circle are transcribed counter clockwise. Genes belonging to different functional groups are color coded. The dark grey in the inner circle corresponds to the GC content, while the light grey corresponds to the AT content.
Summary of genome features of complete chloroplast of P. cineraria, P. juliflora, P. glandulosa, A. microsperma, P. javanica.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Size (bp) | 163,677 | 163,237 | 163,040 | 159,389 | 161,681 |
| Overall GC contents | 35.9 | 35.9 | 35.9 | 36.5 | 35.9 |
| LSC size (bp) | 92,937 | 92,495 | 92,310 | 88,577 | 91,093 |
| SSC size (bp) | 18,878 | 18,880 | 19,132 | 18,756 | 18,574 |
| IR size (bp) | 25,931 | 25,931 | 25,931 | 26,028 | 26,007 |
| Protein coding regions (bp) | 78,883 | 78,421 | 78,039 | 78,030 | 78,075 |
| tRNA size (bp) | 2868 | 2927 | 2810 | 2793 | 2794 |
| rRNA size (bp) | 9052 | 9052 | 9052 | 9052 | 9052 |
| Number of genes | 131 | 132 | 128 | 128 | 130 |
| Number of protein coding genes | 85 | 85 | 83 | 83 | 83 |
| Number of rRNAs | 8 | 8 | 8 | 8 | 8 |
| Number of tRNA s | 38 | 39 | 37 | 37 | 37 |
| Genes with introns | 21 | 21 | 23 | 22 | 23 |
P. cineraria = Prosopis cineraria, P. juliflora = Prosopis juliflora, P. glandulosa = Prosopis glandulosa, A. microsperma = Adenanthera microsperma, P. javanica = Parkia javanica.
The genes with introns in the three Prosopis species chloroplast genomes and the length of exons and introns.
| Gene | Location | Exon I (bp) | Intron 1 (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| P. j | P. c | P. g | P. j | P. c | P. g | P. j | P. c | P. g | P. j | P. c | P. g | P. j | P. c | P. g | ||
| LSC | 145 | 145 | 145 | 744 | 726 | 727 | 389 | 407 | 407 | |||||||
| LSC | 6 | 6 | 6 | 815 | 815 | 815 | 642 | 642 | 642 | |||||||
| LSC | 8 | 8 | 8 | 720 | 720 | 720 | 475 | 475 | 475 | |||||||
| IR | 391 | 393 | 393 | 665 | 662 | 662 | 434 | 435 | 435 | |||||||
| LSC | 9 | 9 | 9 | 1173 | 1173 | 1173 | 399 | 399 | 399 | |||||||
| LSC | 40 | 40 | 40 | 884 | 883 | 883 | 245 | 245 | 245 | |||||||
| LSC | 432 | 432 | 432 | 842 | 802 | 802 | 1578 | 1617 | 1617 | |||||||
| LSC-IR | 114 | 114 | 114 | 232 | 232 | 232 | 540 | 540 | 540 | 26 | 26 | |||||
| LSC | 69 | 69 | 69 | 789 | 790 | 789 | 291 | 291 | 291 | 642 | 642 | 642 | 228 | 228 | 228 | |
| SSC | 552 | 553 | 552 | 1449 | 1452 | 1457 | 540 | 539 | 540 | |||||||
| IR | 777 | 777 | 777 | 685 | 684 | 685 | 756 | 756 | 756 | |||||||
| LSC | 126 | 126 | 126 | 729 | 729 | 728 | 228 | 228 | 228 | 738 | 738 | 738 | 153 | 153 | 153 | |
| IR | 38 | 38 | 38 | 802 | 794 | 802 | 35 | 35 | 35 | |||||||
| IR | 42 | 42 | 42 | 948 | 948 | 948 | 35 | 35 | 35 | |||||||
| LSC | 37 | 37 | 37 | 536 | 536 | 536 | 50 | 50 | 50 | |||||||
| LSC | 29 | 29 | 29 | 2492 | 2491 | 2491 | 37 | 37 | 37 | |||||||
| LSC | 23 | 702 | 49 | |||||||||||||
| LSC | 37 | 37 | 37 | 619 | 619 | 619 | 39 | 39 | 39 | |||||||
P. j = Prosopis juliflora, P. c = Prosopis cineraria, P. g = Prosopis glandulosa. a Duplicated gene. * The rps12 coding sequence is split between 5′-rps12 and 3′-rps12, which are located in the large single-copy region and inverted repeat region, respectively.
Genes in the sequenced Prosopis species chloroplast genomes.
| Category | Group of Genes | Name of Genes |
|---|---|---|
|
| Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA dependent RNA polymerase | ||
| rRNA genes | ||
| tRNA genes | ||
|
| Photosystem I | |
| Photosystem II | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco | ||
| Maturase | ||
| Protease | ||
| Envelop membrane protein | ||
| Subunit Acetyl- CoA-Carboxylate | ||
| c-type cytochrome synthesis gene | ||
|
| Conserved Open reading frames |
Base composition of the P. cineraria, P. juliflora and P. glandulosa chloroplast genome.
| T/U | C | A | G | Length (bp) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| P. j | P. c | P. g | P. j | P. c | P. g | P. j | P. c | P. g | P. j | P. c | P. g | P. j | P. c | P. g | |
| Genome | 32.5 | 32.5 | 32.5 | 18.3 | 18.3 | 18.3 | 31.6 | 31.6 | 31.7 | 17.6 | 17.6 | 17.6 | 163,237 | 163,677 | 163,040 |
| LSC | 34.2 | 34.2 | 34.2 | 17.0 | 17.0 | 17.0 | 32.6 | 32.6 | 32.6 | 16.1 | 16.2 | 16.1 | 92,495 | 92,937 | 92,310 |
| SSC | 34.9 | 34.9 | 34.6 | 15.9 | 16.0 | 15.7 | 34.7 | 34.6 | 35.2 | 14.5 | 14.5 | 14.4 | 18,880 | 18,878 | 19,132 |
| IR | 28.8 | 28.8 | 28.5 | 20.6 | 20.6 | 22.1 | 28.5 | 28.5 | 28.8 | 22.1 | 22.1 | 20.6 | 25,931 | 25,931 | 25,931 |
| tRNA | 25.1 | 24.9 | 24.8 | 23.4 | 23.5 | 23.8 | 22.1 | 22.0 | 21.9 | 29.4 | 29.6 | 29.5 | 2927 | 2868 | 2810 |
| rRNA | 18.9 | 18.9 | 18.8 | 23.7 | 23.7 | 23.6 | 25.7 | 25.7 | 25.7 | 31.7 | 31.7 | 31.9 | 9052 | 9052 | 9052 |
| Protein coding genes | 31.7 | 31.7 | 31.7 | 17.4 | 17.4 | 17.4 | 30.9 | 30.8 | 30.8 | 20.0 | 20.1 | 20.1 | 78,421 | 78,883 | 78,039 |
| 1st position | 32.6 | 33.0 | 32.4 | 17.9 | 18.14 | 18.2 | 31.4 | 31.5 | 31.7 | 17.9 | 17.5 | 17.5 | 54,412 | 54,559 | 54,347 |
| 2nd position | 32.4 | 32.33 | 32.7 | 18.5 | 18.4 | 17.7 | 31.5 | 31.3 | 32.1 | 17.4 | 17.8 | 17.5 | 54,412 | 54,559 | 54,347 |
| 3rd position | 32.3 | 32.44 | 32.4 | 20.9 | 18.1 | 18.8 | 32.0 | 31.9 | 31 | 17.4 | 17.3 | 17.7 | 54,412 | 54,559 | 54,347 |
P. c = Prosopis cinerea, P. j = Prosopis juliflora, P. g = Prosopis glandulosa.
Figure 2Analysis of simple sequence repeats (SSRs) in the P. juliflora and P. cineraria and related cp genomes. Total numbers of SSRs in whole genome (A), Number of SSRs in coding region (B), Number of SSRs in intergenic region (C) and Frequency of identified SSR motifs in different repeat class types (D).
Figure 3Analysis of repeated sequences in P. juliflora and P. cineraria and related cp genomes. Total numbers of the three repeat types (A), frequencies of palindromic repeats by length (B), frequencies of forward repeats by length (C) and frequencies of tandem repeats by length (D).
Figure 4Visual alignment of plastid genomes from P. juliflora with P. cineraria, P. glandulosa, P. javanica and A. micosperma cp genomes. VISTA-based identity plot showing sequence identities among five species, using P. juliflora as a reference. Genome regions are color-coded as protein coding, rRNA coding, tRNA coding, or conserved noncoding sequences (CNS). The x-axis represents the coordinate in the chloroplast genome. Annotated genes are displayed along the top. The sequences similarity of the aligned regions is shown as horizontal bars indicating the average percent identity between 50% and 100%.
Figure 5Pairwise distance of 75 genes from P. cineraria (as reference genome) with P. juliflora, P. glandulosa, P. javanica and A. microsperma cp genomes.
Figure 6Distances between adjacent genes and junctions of the small single-copy (SSC), large single-copy (LSC), and two inverted repeat (IR) regions among five cp genomes within the subfamily Caesalpinioideae. Boxes above and below the primary line indicate the adjacent border genes. The figure is not to scale with regards to sequence length and only shows relative changes at or near the IR/SC borders.
Figure 7Phylogenetic tree of P. juliflora and P. cineraria within the subfamily Caesalpinioideae (Leguminosae). The entire genome data set was analysed using three different methods: Bayesian inference (BI), maximum parsimony (MP) and maximum likelihood (ML). Numbers above and below the branches represent bootstrap values in the MP and ML trees and posterior probabilities in the BI trees. The green color and red stars represent the positions of P. juliflora and P. cineraria.