| Literature DB >> 27764252 |
Saura R Silva1, Yani C A Diaz2, Helen Alves Penha3, Daniel G Pinheiro3, Camila C Fernandes3, Vitor F O Miranda2, Todd P Michael4, Alessandro M Varani3.
Abstract
Lentibulariaceae is the richest family of carnivorous plants spanning three genera including Pinguicula, Genlisea, and Utricularia. Utricularia is globally distributed, and, unlike Pinguicula and Genlisea, has both aquatic and terrestrial forms. In this study we present the analysis of the chloroplast (cp) genome of the terrestrial Utricularia reniformis. U. reniformis has a standard cp genome of 139,725bp, encoding a gene repertoire similar to essentially all photosynthetic organisms. However, an exclusive combination of losses and pseudogenization of the plastid NAD(P)H-dehydrogenase (ndh) gene complex were observed. Comparisons among aquatic and terrestrial forms of Pinguicula, Genlisea, and Utricularia indicate that, whereas the aquatic forms retained functional copies of the eleven ndh genes, these have been lost or truncated in terrestrial forms, suggesting that the ndh function may be dispensable in terrestrial Lentibulariaceae. Phylogenetic scenarios of the ndh gene loss and recovery among Pinguicula, Genlisea, and Utricularia to the ancestral Lentibulariaceae cladeare proposed. Interestingly, RNAseq analysis evidenced that U. reniformis cp genes are transcribed, including the truncated ndh genes, suggesting that these are not completely inactivated. In addition, potential novel RNA-editing sites were identified in at least six U. reniformis cp genes, while none were identified in the truncated ndh genes. Moreover, phylogenomic analyses support that Lentibulariaceae is monophyletic, belonging to the higher core Lamiales clade, corroborating the hypothesis that the first Utricularia lineage emerged in terrestrial habitats and then evolved to epiphytic and aquatic forms. Furthermore, several truncated cp genes were found interspersed with U. reniformis mitochondrial and nuclear genome scaffolds, indicating that as observed in other smaller plant genomes, such as Arabidopsis thaliana, and the related and carnivorous Genlisea nigrocaulis and G. hispidula, the endosymbiotic gene transfer may also shape the U. reniformis genome in a similar fashion. Overall the comparative analysis of the U. reniformis cp genome provides new insight into the ndh genes and cp genome evolution of carnivorous plants from Lentibulariaceae family.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27764252 PMCID: PMC5072713 DOI: 10.1371/journal.pone.0165176
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Genomic map of the Utricularia reniformis cp genome.
Genes shown on the outside of the map are transcribed clockwise, whereas genes on the inside are transcribed counter-clockwise. Genes are color coded by their function in the legend.
List of genes encoded by the Utricularia reniformis chloroplast (cp) genome.
Ψ pseudogene
◊ trans-splicing
• intron-containing gene
Codon usage table.
Codon usage and codon—anticodon recognition pattern for tRNA in Utricularia reniformis cp genome.
| Amino acid | Codon | No. | RSCU | % | tRNA | Amino acid | Codon | No. | RSCU | % | tRNA |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Ala | GCU | 479 | 1.64 | 40.9 | - | Pro | CCU | 339 | 1.42 | 35.6 | - |
| GCC | 201 | 0.69 | 17.2 | - | CCC | 214 | 0.90 | 22.4 | - | ||
| GCA | 346 | 1.18 | 29.6 | trnA-UGC | CCA | 240 | 1.01 | 25.2 | trnP-UGG | ||
| GCG | 143 | 0.49 | 12.3 | - | CCG | 159 | 0.67 | 16.8 | - | ||
| Cys | UGU | 181 | 1.49 | 74.4 | - | Gln | CAA | 629 | 1.51 | 75.3 | trnQ-UUG |
| UGC | 62 | 0.51 | 25.6 | trnC-GCA | CAG | 206 | 0.49 | 24.7 | - | ||
| Asp | GAU | 702 | 1.60 | 79.9 | - | Arg | CGU | 301 | 1.24 | 20.7 | trnR-ACG |
| GAC | 176 | 0.40 | 20.1 | trnD-GUC | CGC | 115 | 0.47 | 7.9 | - | ||
| Glu | GAA | 878 | 1.50 | 74.9 | trnE-UUC | CGA | 338 | 1.39 | 23.3 | - | |
| GAG | 294 | 0.50 | 25.1 | - | CGG | 128 | 0.53 | 8.7 | - | ||
| Phe | UUU | 806 | 1.31 | 65.4 | - | AGA | 407 | 1.68 | 27.9 | trnR-UCU | |
| UUC | 426 | 0.69 | 35.6 | trnF-GAA | AGG | 168 | 0.69 | 11.5 | - | ||
| Gly | GGU | 483 | 1.27 | 31.7 | - | Ser | UCU | 475 | 1.67 | 27.8 | - |
| GGC | 184 | 0.48 | 12.2 | trnG-GCC | UCC | 275 | 0.97 | 16.1 | trnS-GGA | ||
| GGA | 555 | 1.46 | 36.4 | trnG-UCC | UCA | 320 | 1.12 | 18.7 | trnS-UGA | ||
| GGG | 300 | 0.79 | 19.7 | - | UCG | 198 | 0.70 | 11.6 | - | ||
| His | CAU | 420 | 1.49 | 74.6 | - | AGU | 345 | 1.21 | 20.2 | - | |
| CAC | 142 | 0.51 | 25.4 | trnH-GUG | AGC | 95 | 0.33 | 5.6 | trnS-GCU | ||
| Ile | AUU | 916 | 1.50 | 49.9 | - | Thr | ACU | 464 | 1.54 | 38.3 | - |
| AUC | 369 | 0.60 | 20.1 | trnI-GAU | ACC | 234 | 0.77 | 19.4 | trnT-GGU | ||
| AUA | 548 | 0.90 | 30 | trnI-CAU | ACA | 366 | 1.21 | 30.3 | trnT-UGU | ||
| Lys | AAA | 990 | 1.48 | 73.9 | trnK-UUU | ACG | 144 | 0.48 | 12 | - | |
| AAG | 348 | 0.52 | 26.1 | - | Val | GUU | 457 | 1.46 | 36.5 | - | |
| Leu | UUA | 704 | 1.82 | 30.3 | trnL-UAA | GUC | 163 | 0.52 | 13.1 | trnV-GAC | |
| UUG | 493 | 1.27 | 21.2 | trnL-CAA | GUA | 466 | 1.49 | 37.2 | trnV-UAC | ||
| CUU | 501 | 1.29 | 21.5 | - | GUG | 166 | 0.53 | 13.2 | - | ||
| CUC | 148 | 0.38 | 6.3 | - | Tyr | UAU | 642 | 1.52 | 76 | - | |
| CUA | 312 | 0.80 | 13.4 | trnL-UAG | UAC | 202 | 0.48 | 24 | trnY-GUA | ||
| CUG | 168 | 0.43 | 7.3 | - | Trp | UGG | 384 | 1.00 | 100 | trnW-CCA | |
| Met (START) | AUG | 480 | 1.00 | 100 | trnf(f)M-CAU | Stop | UGA | 16 | 0.63 | 21 | - |
| Asn | AAU | 843 | 1.49 | 74.6 | - | UAA | 39 | 1.54 | 51 | - | |
| AAC | 287 | 0.51 | 25.4 | trnN-GUU | UAG | 21 | 0.83 | 27 | - |
(a) Relative Synonymous Codon Usage
(b) Codon frequency (in %) per amino acid
•Intron-containing tRNA genes
Fig 2Expression profile of the Utricularia reniformis cp genes.
The expression values are normalized in Reads Per Kilobase per Million mapped reads (RPKM) on the Y-axis. The psaJ and psbA genes show the highest expression levels as measured in RPKM (323,750 and 437,421, respectively) and were suppressed from the figure for clarity.
RNA editing pattern in Utricularia reniformis cp genome identified by RNAseq reads alignment.
| Gene | Genome (cp) | Codon | Codon position | Amino acid | Editing level | |||
|---|---|---|---|---|---|---|---|---|
| Position | Strand | from | to | from | to | U/C (%) | ||
| 129,471 | + | CAC | UAC | 1 | H | Y | 15 | |
| 102,612 | - | CCU | UCU | 1 | P | S | 20 | |
| 68,312 | - | CCA | UCA | 1 | P | S | 99 | |
| 65,985 | + | AAC | AAU | 3 | N | N | 11 | |
| 115,658 | + | UCU | UUU | 2 | S | F | 81 | |
| 124,300 | + | ACA | AUA | 2 | T | I | 99 | |
| 103,606 | + | CCA | UCA | 2 | P | S | 85 | |
* synonymous substitution
† Also predicted by PREP-Cp
RNA editing pattern in Utricularia reniformis cp genome predicted by PREP-Cp.
| Gene | Genome (cp) | Codon | Codon position | Amino acid | Editing Score | |||
|---|---|---|---|---|---|---|---|---|
| Position | Strand | from | to | from | to | (PREP-Cp) | ||
| 86,311 | - | ACG | AUG | 2 | T | M | 1.0 | |
| 86,391 | - | CGG | UGG | 1 | R | W | 1.0 | |
| 86,562 | - | CCU | UCU | 1 | P | S | 1.0 | |
| 87,043 | - | UCG | UUG | 2 | S | L | 0.8 | |
| 125,155 | + | UUT | CUU | 1 | P | L | 1.0 | |
| 125,752 | + | UCA | UUA | 2 | S | L | 1.0 | |
| 35,170 | - | GCC | GUV | 2 | A | V | 1.0 | |
| 136,847 | + | UCT | UAU | 3 | H | Y | 1.0 | |
| 137,456 | + | CAU | UAU | 1 | H | Y | 1.0 | |
| 137,579 | + | CCG | UCG | 1 | P | S | 0.86 | |
| 68,692 | - | CGG | UGG | 1 | R | W | 1.0 | |
| 68,885 | - | CCA | CUA | 2 | P | L | 1.0 | |
| 84,202 | - | CCU | UCU | 1 | P | S | 1.0 | |
| 70,971 | - | CGU | UGU | 1 | R | C | 1.0 | |
| 71,044 | - | GCG | GUG | 2 | A | V | 0.86 | |
| 1,364 | - | GCG | GUG | 2 | A | V | 0.86 | |
| 75,977 | + | UCA | UUA | 2 | S | L | 0.86 | |
| 65,590 | + | UCA | UUA | 2 | S | L | 1.0 | |
| 113,701 | + | CUU | UUU | 1 | L | F | 1.0 | |
| 113,996 | + | UCU | UUU | 2 | S | F | 1.0 | |
| 114,131 | + | UCA | UUA | 2 | S | L | 0.86 | |
| 114,209 | + | UCA | UUA | 2 | S | L | 1.0 | |
| 114,575 | + | ACG | AUG | 2 | T | M | 0.86 | |
| 115,219 | + | CUC | UUC | 1 | L | F | 1.0 | |
| 115,657 | + | UCU | UUU | 2 | S | F | 1.0 | |
| 116,084 | + | UCA | UUA | 2 | S | L | 0.86 | |
| 119,318 | + | CGU | UUA | 1 | R | C | 0.86 | |
| 120,709 | + | CUU | UUU | 1 | L | F | 0.86 | |
| 121,870 | + | CUU | UUU | 1 | L | F | 1.0 | |
| 122,038 | + | CUU | UUU | 1 | L | F | 1.0 | |
| 122,170 | + | CGG | UGG | 1 | R | W | 1.0 | |
| 123,590 | + | UCA | UUA | 2 | S | L | 0.86 | |
| 103,537 | + | UCA | UUA | 2 | S | L | 1.0 | |
| 103,606 | + | CCA | CUA | 2 | P | L | 1.0 | |
| 134,633 | + | UCA | UUA | 2 | S | L | 1.0 | |
| 124,189 | + | ACC | AUC | 2 | T | I | 1.0 | |
| 124,414 | + | UCA | UUA | 2 | S | L | 1.0 | |
‡ The rpl2 is duplicated in the IRs
† Also detected by RNAseq read mapping
Fig 3Frequency of SSR motifs found in Utricularia reniformis cp coding and intergenic regions, taking into account sequence complementarities.
Distribution of the cpSSRs present in the U. reniformis cp coding regions.
| cpSSR type | Genes | Pseudogenes |
|---|---|---|
| mono-repeats | ||
| di-repeats | ||
| tri-repeat (TTC) | ||
| compound/polymorphic |
Sequence repeats in the cp genome of Utricularia reniformis.
Type, location and size (in bp) of repeated elements (IGS, Intergenic spacer).
| Type | Location | Size (bp) |
|---|---|---|
| Palindrome | IGS: | 58 |
| Palindrome | IGS: | 44 |
| Palindrome | 37 | |
| Palindrome | IGS: | 36 |
| Palindrome | 30 | |
| Forward | 35 | |
| Forward | IGS: | 31 |
| Palindrome | IGS: | 31 |
| Palindrome | IGS: | 31 |
| Forward | IGS: | 31 |
| Palindrome | IGS: | 33 |
| Palindrome | IGS: | 30 |
| Forward | 32 | |
| Forward | 31 | |
| Palindrome | 31 | |
| Palindrome | 31 | |
| Forward | 31 | |
| Forward | 30 | |
| Palindrome | 30 | |
| Palindrome | 30 | |
| Palindrome | IGS: | 30 |
| Forward | 30 | |
| Palindrome | 30 | |
| Forward | 30 |
Fig 4Comparison of the boundaries of the LSC, SSC, and IR regions in the currently available cp genomes of Lentibulariaceae.
Fig 5Venn diagram showing the full complement of genes present in the sequenced Lentibulariaceae cp genomes (pan genome).
The tRNAs and rRNAs are not included. The numbers below each species represent the unique genes used in the comparison.
Distribution and comparisons of the eleven genes in the ndh complex, encoded in aquatic and terrestrial cp genomes from carnivorous plants and the ancestral lineages Sesamum indicum, Tanaecium tetragonolobum and Andrographis paniculata.
Black box represents that the given ndh gene is present and intact. Gray box indicates truncated ndh gene according to the legend. White box indicate that the given ndh gene is absent in the cp genome.
| 2,171bp | 2,162bp | 2,069bp | 1,811bp | 197bp | 1,640bp | 2,190bp | 2,153bp | |||||||||||||||||
| 2,211bp | 2,211bp | 2,21bp | 2.211bp | 2,121bp | 1,080bp | 2,211bp | 2,211bp | |||||||||||||||||
| 362bp | 362bp | 362bp | 362bp | 362bp | ||||||||||||||||||||
| 1,502bp | 1,148bp | 1,454bp | 622bp | 737bp | 1,526bp | 1,526bp | ||||||||||||||||||
| 305bp | 305bp | 305bp | 309bp | 188bp | 233bp | 305bp | 305bp | |||||||||||||||||
| 2,255bp | 2,231bp | 2,23bp | 2,261bp | 2,261bp ( | ||||||||||||||||||||
| 530bp | 530bp | 530bp | 520bp | 509bp | 530bp | 530bp | ||||||||||||||||||
| 1,18bp | 1,181bp | 1,18bp | 1,131bp | 1,085bp | 1,187bp | 1,181bp | ||||||||||||||||||
| 506bp | 506bp | 506bp | 514bp | 469bp | 520bp | 530bp | 524bp | |||||||||||||||||
| 476bp | 476bp | 476bp | 434bp | 476bp | 476bp | |||||||||||||||||||
| 701bp | 677bp | 677bp | 410bp | 644bp | 677bp | |||||||||||||||||||
| 14,411bp (9.4%) | 14,000bp (9.1%) | 14,213bp (9.4%) | 10,173bp (6.9%) | 5,096bp (3.6%) | 6,884bp (4.9%) | 14,433bp (9.4%) | 16,678bp (10.3%) | |||||||||||||||||
• Intron-containing gene
▲Missing fragment of coding region (incomplete gene)
■ Stop codon in-frame
* two gene copies
□ Frameshift
Fig 6Phylogenetic history of ndh genes.
(A) represents the only scenario for the ndhA, ndhC, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, and ndhK genes evolution. (B) represents the phylogenetic history and the two possible scenarios for the ndhB gene and (C) for the ndhD gene (blue bars indicate the arising of the functional genes and red bars their loss).
Fig 7Phylogenomic analysis based on 23 complete Lamiales cp genomes.
The phylogenetic position of Utricularia reniformis was inferred by maximum likelihood and Bayesian analyses. Numbers above and below the lines indicate the maximum likelihood bootstrap values and Bayesian posterior probabilities values > 50% for each clade, respectively. The table on the right indicates the genome features in base pairs (chloroplast genome length, LSC, SSC and IRs regions). The histograms located to the left of each feature (CP size, LSC, SSC and IRs), graphically illustrate the size distribution for each feature. Sub-titles: I Lentibulariaceae; II Acanthaceae; III Bignoniaceae; IV Pedaliaceae; V Orobanchaceae; VI Lamiaceae; VII Scrophulariaceae; VIII Gesneriaceae, and IX Oleaceae.
Fig 8Cloudgram (Bayesian inference) of Lentibulariaceae from 18,000 Bayesian trees based on matK cp gene.
The red circle 1 indicates the terrestrial ancestral lineage for the Lentibulariaceae family. The blue circles, 2 and 2’, represent the independent radiations to the aquatic habitat of Utricularia lineages. The green circle 3 indicates the possible ancestor of the epiphytic species from the plesiomorphic terrestrial life form, and the black circles, 4 and 4’, represent the independent origins for the rare reophytic life form for Utricularia lineages. Numbers below clades represent the support (maximum parsimony bootstrap/ maximum likelihood bootstrap/ posterior probabilities based on Bayesian inference). Higher color densities represent higher levels of certainty represented by congruent trees (from the 18,000 trees).