| Literature DB >> 31765416 |
Sajjad Asaf1, Arif Khan1,2, Abdul Latif Khan1, Ahmed Al-Harrasi1, Ahmed Al-Rawahi1.
Abstract
Vachellia and Senegalia are the most important genera in the subfamily Mimosoideae (Fabaceae). Recently, species from both genera were separated from the long-characterized Acacia due to their macro-morphological characteristics. However, this morpho-taxonomic differentiation struggles to discriminate some species, for example, Vachellia nilotica and Senegalia senegal. Therefore, sequencing the chloroplast (cp) genomes of these species and determining their phylogenetic placement via conserved genes may help to validate the taxonomy. Hence, we sequenced the cp genomes of V. nilotica and S. senegal, and the results showed that the sizes of the genomes are 165.3 and 162.7 kb, respectively. The cp genomes of both species comprised large single-copy regions (93,849~91,791 bp) and pairs of inverted repeats (IR; 26,093~26,008 bp). The total numbers of genes found in the V. nilotica and S. senegal cp genomes were 135 and 132, respectively. Approximately 123:130 repeats and 290:281 simple sequence repeats were found in the S. senegal and V. nilotica cp genomes, respectively. Genomic characterization was undertaken by comparing these genomes with those of 17 species belonging to related genera in Fabaceae. A phylogenetic analysis of the whole genome dataset and 56 shared genes was undertaken by generating cladograms with the same topologies and placing both species in a new generic system. These results support the likelihood of identifying segregate genera from Acacia with phylogenomic disposition of both V. nilotica and S. senegal in the subfamily Mimosoideae. The current study is the first to obtain complete genomic information on both species and may help to elucidate the genome architecture of these species and evaluate the genetic diversity among species.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31765416 PMCID: PMC6876885 DOI: 10.1371/journal.pone.0225469
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 5Comparison of border distance between adjacent genes and junctions of LSC, SSC and two IR regions among the chloroplast genomes of V. nilotica and S. senegal with related species.
Boxes above or below the main line indicate the adjacent border genes. The figure is not to scale with respect to sequence length and only shows relative changes at or near the IR/SC borders.
Fig 1Genome map of the Vachellia nilotica and Senegalia senegal chloroplast genome.
Thick lines indicate the extent of the inverted repeat regions (IRa and IRb), which separate the genome into small (SSC) and large (LSC) single copy regions. Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. The dark grey in the inner circle corresponds to the GC content, and the light grey corresponds to the AT content.
Composition of Vachellia nilotica and Senegalia senegal cp genomes with related species.
| Size (bp) | Overall GC contents | LSC size in bp | SSC size in bp | IR size in bp | Protein coding regions size in bp | tRNA size in bp | rRNA size in bp | Number of genes | Number of protein coding genes | Number of rRNA | Number of tRNA | Genes with introns | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 165,343 | 35.4 | 93849 | 19308 | 26093 | 79520 | 2847 | 9052 | 135 | 89 | 8 | 38 | 22 | |
| 165,383 | 35.3 | 93901 | 19347 | 26068 | 78009 | 2828 | 9052 | 127 | 82 | 8 | 37 | 21 | |
| 165,829 | 35.3 | 94220 | 19474 | 26068 | 78033 | 2828 | 9052 | 127 | 82 | 8 | 37 | 21 | |
| 162702 | 35.7 | 91791 | 18895 | 26008 | 75571 | 2794 | 9052 | 132 | 87 | 8 | 37 | 22 | |
| 162754 | 35.8 | 91750 | 18911 | 26047 | 77997 | 2829 | 9052 | 127 | 82 | 8 | 37 | 20 | |
| 176717 | 35.3 | 92036 | 5053 | 39814 | 89202 | 2793 | 9052 | 138 | 92 | 8 | 37 | 21 | |
| 178887 | 35.1 | 91076 | 4805 | 41503 | 89784 | 2793 | 9078 | 139 | 94 | 8 | 37 | 23 | |
| 162,552 | 35.9 | 91517 | 18941 | 26047 | 77949 | 2793 | 9052 | 130 | 83 | 8 | 37 | 23 | |
| 161,681 | 35.9 | 91093 | 18574 | 26007 | 78075 | 2794 | 9052 | 130 | 83 | 8 | 37 | 23 | |
| 176,692 | 35.3 | 92320 | 5036 | 39668 | 89271 | 2793 | 9052 | 138 | 92 | 8 | 37 | 24 | |
| 164692 | 35.6 | 93690 | 18890 | 26056 | 78759 | 2815 | 9049 | 129 | 84 | 8 | 37 | 22 | |
| 175489 | 35.5 | 90987 | 4948 | 39777 | 89244 | 2820 | 9056 | 137 | 92 | 8 | 37 | 24 | |
| 175646 | 35.3 | 91045 | 4761 | 39920 | 88638 | 2793 | 9068 | 138 | 90 | 8 | 37 | 23 | |
| 161240 | 35.9 | 90430 | 18526 | 26142 | 77958 | 2793 | 9068 | 130 | 83 | 8 | 37 | 23 | |
| 176870 | 35.2 | 92319 | 4573 | 39989 | 89133 | 2793 | 9052 | 138 | 92 | 8 | 37 | 24 | |
| 174861 | 35.6 | 90169 | 4928 | 39882 | 89130 | 2793 | 9066 | 138 | 92 | 8 | 37 | 24 | |
| 159389 | 36.5 | 88577 | 18756 | 26028 | 78030 | 2793 | 9052 | 130 | 83 | 8 | 37 | 23 | |
| 174233 | 35.4 | 92770 | 4986 | 38225 | 88107 | 2802 | 9062 | 134 | 89 | 8 | 37 | 22 | |
| 174217 | 35.4 | 92753 | 4956 | 38254 | 88551 | 2793 | 9060 | 137 | 91 | 8 | 37 | 22 |
Genes in the sequenced V. nilotica and S. senegal species chloroplast genomes.
| Category | Group of genes | Name of genes |
|---|---|---|
| Large subunit of ribosomal proteins | ||
| Small subunit of ribosomal proteins | ||
| DNA dependent RNA polymerase | ||
| rRNA genes | ||
| tRNA genes | ||
| Photosystem I | ||
| Photosystem II | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco | ||
| Maturase | ||
| Protease | ||
| Envelop membrane protein | ||
| Subunit Acetyl- CoA-Carboxylate | ||
| c-type cytochrome synthesis gene | ||
| Conserved open reading frames |
Base composition of the Vachellia nilotica and Senegalia senegal chloroplast genome.
| T/U | C | A | G | Length (bp) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| V.N | S.S | V.N | S.S | V.N | S.S | V.N | S.S | V.N | S.S | |
| 32.9 | 32.7 | 17.9 | 18 | 31.7 | 31.6 | 17.5 | 17.7 | 165343 | 162702 | |
| 34.6 | 34.3 | 16.7 | 16.9 | 32.9 | 32.7 | 15.8 | 16 | 93849 | 91791 | |
| 35.8 | 35.8 | 14.2 | 14.4 | 34.5 | 34.2 | 15.5 | 15.5 | 193.8 | 18895 | |
| 28.4 | 28.3 | 22.1 | 22.1 | 28.9 | 29.1 | 20.6 | 20.6 | 26093 | 26008 | |
| 25.3 | 24.9 | 23.3 | 23.4 | 22.1 | 22.2 | 29.3 | 29.5 | 2847 | 2794 | |
| 19 | 18.9 | 23.7 | 23.7 | 25.6 | 25.7 | 31.7 | 31.7 | 9052 | 9052 | |
| 31.9 | 32.1 | 17.4 | 17.3 | 30.6 | 30.7 | 20.1 | 19.9 | 79520 | 75571 | |
| 23.36 | 25.2 | 17.43 | 18.20 | 34.47 | 30.8 | 25.57 | 25.16 | 25189 | ||
| 33.59 | 32.2 | 19.09 | 19.78 | 29.26 | 29.62 | 19.44 | 17.49 | 25189 | ||
| 40.12 | 38.7 | 15.24 | 13.8 | 27.82 | 31.6 | 15.15 | 14.4 | 25189 | ||
V.N = Vachellia nilotica, S. S = Senegalia Senegal.
Fig 2Analysis of repeated sequences in V. nilotica and S. senegal.
(A) Totals of three repeat types, (B) Frequency of palindromic repeats by length, (C) Frequency of forward repeats by length and (D) Frequency of tandem repeats by length.
Fig 3Analysis of simple sequence repeat (SSR) in V. nilotica and S. senegal genomes with related species cp genomes.
Number of different SSR types detected in these genomes.
Fig 4Analysis of simple sequence repeat (SSR) in the V. nilotica and S. senegal genomes.
Frequency of identified SSRs in the Small Single-Copy (SSC), Large Simple-Copy (LSC), Inverted Repeat (IR), transfer RNA (tRNA), ribosomal RNA (rRNA), and coding sequence (CDS) regions.
Fig 6Phylogenetic tree constructed on the basis of whole genome dataset using four different methods: Bayesian inference (BI), maximum likelihood (ML), maximum parsimony (MP), and neighbor-joining (NJ).
Numbers above the branches are the posterior probabilities of BI and bootstrap values for ML, MP and NJ. The star represents the position of V. nilotica and S. senegal.