| Literature DB >> 27379132 |
Sajjad Asaf1, Abdul L Khan2, Abdur R Khan1, Muhammad Waqas3, Sang-Mo Kang1, Muhammad A Khan1, Seok-Min Lee1, In-Jung Lee1.
Abstract
Nicotiana otophora is a wild parental species of Nicotiana tabacum, an interspecific hybrid of Nicotiana tomentosiformis and Nicotiana sylvestris. However, N. otophora is least understood as an alternative paternal donor. Here, we compared the fully assembled chloroplast (cp) genome of N. otophora and with those of closely related species. The analysis showed a cp genome size of 156,073 bp and exhibited a typical quadripartite structure, which contains a pair of inverted repeats separated by small and large single copies, containing 163 representative genes, with 165 microsatellites distributed unevenly throughout the genome. Comparative analysis of a gene with known function across Nicotiana species revealed 76 protein-coding sequences, 20 tRNA sequences, and 3 rRNA sequence shared between the cp genomes. The analysis revealed that N. otophora is a sister species to N. tomentosiformis within the Nicotiana genus, and Atropha belladonna and Datura stramonium are their closest relatives. These findings provide a valuable analysis of the complete N. otophora cp genome, which can identify species, elucidate taxonomy, and reconstruct the phylogeny of genus Nicotiana.Entities:
Keywords: Nicotiana; SSRs; cp genome; phylogeny; repeat analysis; sequence divergence
Year: 2016 PMID: 27379132 PMCID: PMC4906380 DOI: 10.3389/fpls.2016.00843
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Summary of complete chloroplast genomes for five .
| Total | 156,073 | 155,941 | 155,943 | 155,745 | 155,863 |
| Large single copy (LSC, bp) | 86,621 | 86,684 | 86,686 | 86,392 | 86,633 |
| Inverted repeat (IR, bp) | 25,888 | 25,342 | 25,343 | 25,429 | 25,331 |
| Small single copy (SSC, bp) | 17,677 | 18,573 | 18,571 | 18,495 | 18,568 |
| GC% | 37.7 | 37.8 | 37.8 | 37.8 | 37.9 |
| Total | 163 | 140 | 144 | 140 | 156 |
| Protein coding genes | 110 | 111 | 98 | 111 | 110 |
| tRNA | 45 | 37 | 37 | 37 | 37 |
| rRNA | 8 | 8 | 8 | 8 | 8 |
Figure 1Gene map of the . Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Genes belonging to different functional groups are color-coded. The darker gray in the inner circle corresponds to GC content, and the lighter gray corresponds to AT content.
Genes in the sequenced .
| Self-replication | Large subunit of ribosomal proteins | |
| Small subunit of ribosomal proteins | ||
| DNA dependent RNA polymerase | ||
| rRNA genes | ||
| tRNA genes | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| NadH oxidoreductase | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco | ||
| Other genes | Translational initiation factor | |
| Maturase | ||
| Protease | ||
| Envelop membrane protein | ||
| Subunit Acetyl- CoA-Carboxylate | ||
| c-type cytochrome synthesis gene | ||
| Unknown | Conserved Open reading frames |
Base composition in the .
| Genome | 31.5 | 19.2 | 30.8 | 18.5 | 156,073 |
| LSC | 32.8 | 18.3 | 31.5 | 17.5 | 86,621 |
| SSC | 34.3 | 16.8 | 33.7 | 15.2 | 17,677 |
| IR | 28.6 | 22.3 | 28.4 | 20.7 | 25,888 |
| tRNA | 25.8 | 22.7 | 22.7 | 28.9 | 1268 |
| rRNA | 18.8 | 23.6 | 26.1 | 31.5 | 4524 |
| Protein Coding genes | 29.8 | 18.7 | 29.7 | 21.8 | 80,379 |
| 1st position | 23.38 | 20.66 | 26.77 | 26.65 | 26,883 |
| 2nd position | 32.38 | 20.34 | 29.4 | 17.89 | 26,883 |
| 3rd position | 35.25 | 13.9 | 31.58 | 15.8 | 26,883 |
Figure 2Amino acid frequencies of the . The frequencies of amino acids were calculated for all 110 protein-coding genes from the start to the stop codon.
Figure 3Analysis of repeated sequences in five , Total of three repeat types; (B), Frequency of the palindromic repeat by length; (C), frequency of the direct repeat by length; and (D), Frequency of tandem repeat by length.
Figure 4Analysis of simple sequence repeat (SSR) in the five Number different SSRs types detected in five genomes and (B) Frequency of identified SSR motifs in different repeat class types.
Figure 5Venn diagram illustrating the proportion of genes in five Number of protein coding genes shared by five Nicotiana cp genomes. (B) Number of unique genes identified in each cp genome.
Figure 6Visualization alignment of five chloroplast genome sequences. VISTA-based identity plot showing sequence identity among five Nicotiana species using N. otophora as a reference. The thick black line shows the inverted repeats (IRs) in the chloroplast genomes.
Figure 7Sliding window analysis of Analysis of LSC regions, (B) Analysis of SSC regions, and (C) Analysis of IR regions. (window length: 600 bp, step size: 200 bp). X-axis, position of the midpoint of a window; Y-axis, nucleotide diversity of each window.
Figure 8Phylogenetic relationship of . Tree constructed by maximum likelihood (A), maximum parsimony and Bayesian inference (B) with Citrus aurantifolia and Citrus sinensis as outgroups.