| Literature DB >> 28723946 |
Saura R Silva1, Danillo O Alvarenga2, Yani Aranguren3, Helen A Penha2, Camila C Fernandes2, Daniel G Pinheiro2, Marcos T Oliveira2, Todd P Michael4, Vitor F O Miranda3, Alessandro M Varani2.
Abstract
The carnivorous plants of the family Lentibulariaceae have attained recent attention not only because of their interesting lifestyle, but also because of their dynamic nuclear genome size. Lentibulariaceae genomes span an order of magnitude and include species with the smallest genomes in angiosperms, making them a powerful system to study the mechanisms of genome expansion and contraction. However, little is known about mitochondrial DNA (mtDNA) sequences of this family, and the evolutionary forces that shape this organellar genome. Here we report the sequencing and assembly of the complete mtDNA from the endemic terrestrial Brazilian species Utricularia reniformis. The 857,234bp master circle mitochondrial genome encodes 70 transcriptionaly active genes (42 protein-coding, 25 tRNAs and 3 rRNAs), covering up to 7% of the mtDNA. A ltrA-like protein related to splicing and mobility and a LAGLIDADG homing endonuclease have been identified in intronic regions, suggesting particular mechanisms of genome maintenance. RNA-seq analysis identified properties with putative diverse and important roles in genome regulation and evolution: 1) 672kbp (78%) of the mtDNA is covered by full-length reads; 2) most of the 243kbp intergenic regions exhibit transcripts; and 3) at least 69 novel RNA editing sites in the protein-coding genes. Additional genomic features are hypothetical ORFs (48%), chloroplast insertions, including truncated plastid genes that have been lost from the chloroplast DNA (5%), repeats (5%), relics of transposable elements mostly related to LTR retrotransposons (5%), and truncated mitovirus sequences (0.4%). Phylogenetic analysis based on 32 different Lamiales mitochondrial genomes corroborate that Lentibulariaceae is a monophyletic group. In summary, the U. reniformis mtDNA represents the eighth largest plant mtDNA described to date, shedding light on the genomic trends and evolutionary characteristics and phylogenetic history of the family Lentibulariaceae.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28723946 PMCID: PMC5516982 DOI: 10.1371/journal.pone.0180484
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Assembly summary statistics and validation of Utricularia reniformis mtDNA master circle (MC) genome.
| Number of mtDNA-related paired-end reads (2x300bp ~600bp) | 1,787,363 |
| Number of mtDNA-related mate-paired reads (2x100bp ~3kbp) | 178,224 |
| Total assembled size (excluding repeats > 500bp)(bp) | 831,638bp |
| PE assembly statistics | |
| - Nodes | 13 |
| - Edges | 8 |
| - Dead ends | 14 (53.85%) |
| - Connected components | 5 |
| - Contigs | 7 |
| - Longest contig size | 335,336bp |
| - Shortest contig size | 40,798bp |
| - Average contig size | 118,634bp |
| - N50 | 230,405 |
| - L50 | 2 |
| Master circle size (bp) | 857,234 |
| Master circle size GC% | 43,98 |
| - Average coverage | |
| - Paired-end | 956x (st.dev. 265) |
| - Mate-pairs | 40.98x (st.dev. 20.9) |
| - Long Repeat (LR) coverage (25kb) | 1,342 (st.dev. 200) |
| - Small Repeat (SR) coverage (3.2kb) | 1,201 (st. dev. 240) |
| - Mapped reads in pairs | |
| -paired-end reads | 1,762,585 (98.61%) |
| -mate-paired reads | 177,316 (99.49%) |
| - uncalled MC bases (Ns) | 9 (0.0010%) |
| RNAseq mapping | |
| - Total Number of Reads (single-end ~200bp) | 1,213,898 |
| - Mitochondrial genome covered (bp) | 672,561 (78.45%) |
| - Average coverage | 83.9x (st. dev. 566.4) |
Fig 1Genomic map of the Utricularia reniformis mtDNA genome.
The inner red circle illustrates the transcript depth of coverage for U. reniformis mtDNA, whereas the peaks represent the most covered regions by RNAseq reads. The second level circle in gray scale represents the GC% distribution across the mtDNA. The Large inverted repeats (LIR) and Small direct repeats (SDR) are shown as black boxes. The mitochondrial genes are shown in the outer circle, whereas genes shown on the outside of the map are transcribed clockwise, and the genes on the inside are transcribed counter-clockwise. Genes are color coded by their function in the legend, whereas partial mitovirus derived sequences are shown in gray boxes. The ndhJ-ndhK-ndhC loci which is deleted from the U. reniformis plastid genome is shown in the gray box.
Main characteristics and features of Utricularia reniformis mtDNA genome.
| Number | Length (bp) | % of MT Genome | |
|---|---|---|---|
| A—common mitochondrial genes | 70 | 61,207 | 7,14 |
| - mtDNA coding regions | 42 | 54,079 | 6.31 |
| - group I intron | 1 | 892 | 0,1 |
| - group II intron (cis/trans-spliced) | 19 | 17,543 | 2,04 |
| - pseudogenes | 3 | 711 | 0,08 |
| - mtDNA tRNAs | 25 | 1,929 | 0,23 |
| -group I intron | 2 | 671 | 0,07 |
| - mtDNA rRNAs | 3 | 5,219 | 0,61 |
| B–ORFs (from 89bp to 665bp; average 192bp) | 2,149 | 414,236 | 48.32 |
| - no transcription evidence supported by RNAseq | 944 | 156,725 | 18.29 |
| - with more than 150pb (average 206bp) | 484 | 101,237 | 11.81 |
| - transcription evidence supported by RNAseq (≥ 5 reads) | 1,205 | 257,471 | 30.03 |
| - with more than 150bp (average 237bp) | 935 | 223,352 | 26.05 |
| C—mitovirus derived (RNA-dependent RNA polymerase) | 7 | 3,800 | 0,44 |
| D—chloroplast-derived regions (from | 262 | 44,431 | 5,18 |
| - identified cpDNA genes and truncated pseudogenes | 71 | 28,586 | 3.33 |
| - intact cpDNA genes | 2 | 531 | 0.62 |
| - pseudogenes/truncated/partial | 69 | 28,055 | 3.28 |
| E—repeated regions (excluding A, B, C and D) | 883 | 49,000 | 5,72 |
| F—Transposable Elements (TEs) related regions | 125 | 41,165 | 4.8 |
| - Retroelements (Class I) | |||
| -LINEs | 6 | 1,550 | 0.18 |
| -LTR elements | 119 | 39,615 | 4.62 |
| -Ty1/Copia | 60 | 20,958 | 2.44 |
| -Gypsy/DIRS1 | 36 | 12,954 | 1.51 |
| - Annotated and characterized regions, and potential hypothetical/chimeric orfs | 3,496 | 613,839 | 71.61 |
| - excluding potential hypothetical/chimeric orfs | 1,351 | 201,147 | 23.46 |
| - including potential hypothetical/chimeric orfs with transcription evidence | 2,552 | 457,074 | 53.32 |
| - Rest of the genome (intergenic spacer regions and non-characterized regions) | - | 243,395 | 28.39 |
| - Mapped by RNAseq (unique mapping– 99% identity and 100% coverage) | - | 178,814 | 21 |
§ Only fragments of TEs were identified
List of the traditional mitochondrial, chloroplast derived, and additional genes encoded by the Utricularia reniformis mtDNA genome.
| Complex I (NADH dehydrogenase) | |
| Complex II (succinate dehydrogenase) | |
| Complex III (ubichinol cytochrome c reductase) | |
| Complex IV (cytochrome c oxydase) | |
| Complex V (ATP synthase) | |
| Cytochrome c biogenesis | |
| Ribosomal proteins (SSU) | |
| Ribosomal proteins (LSU) | |
| Maturases | |
| Transport membrane proteins | |
| Transfer RNAs | |
| Ribosomal RNAs | |
| Others | LAGLIDADG endonuclease (intron region of |
| Photosystem I | |
| Protosystem II | |
| Cytochrome b/f complex | |
| ATP synthase | |
| NADH dehydrogenase | |
| RubisCO large subunit | |
| RNA polymerase | |
| Ribosomal proteins (SSU) | |
| Ribosomal proteins (LSU) | |
| Other genes | |
| hypothetical chloroplast reading frames | |
| Ribosomal RNAs | |
| Transfer RNAs | |
| Mitovirus RNA-dependent RNA polymerase | |
| DNA-directed RNA polymerase subunit beta |
Ψ pseudogene
◊ trans-splicing
• intron-containing gene
Fig 2Repeats and alternative master circle structures.
(A) Repeats across U. reniformis mtDNA; (B) Putative alternative conformations of U. reniformis mtDNA based on repeat-mediated intramolecular recombination mechanism; (C) Shared regions among U. reniformis mtDNA and U. gibba mtDNA; (D) repeats between U. reniformis cpDNA against the mtDNA.
Main characteristics of the unknown (no hits) and hypothetical open reading frames identified in U. reniformis mtDNA.
| ORFs | Number | Signal Peptide + Transmembrane Domain | Only Signal Peptide | Only Transmembrane Domain |
|---|---|---|---|---|
| 1,898 | 21 | 187 | 254 | |
| - transcribed | 1,018 | 17 | 98 | 135 |
| - non transcribed | 880 | 4 | 89 | 119 |
| 152 | 2 | 8 | 15 | |
| - transcribed | 111 | 1 | 8 | 14 |
| - non transcribed | 41 | 1 | 0 | 1 |
| 99 | 0 | 4 | 9 | |
| - transcribed | 76 | 0 | 3 | 6 |
| - non transcribed | 23 | 0 | 1 | 3 |
Fig 3U. reniformis mtDNA ndhJ-ndhK-ndhC loci alignment against the respective loci in the cpDNA of U. gibba and U. macrorhiza.
Fig 4Transcription patterns of the traditional mitochondrial genes by Reads Per Kilobase Million (RPKM).
The 69 novel RNA editing sites identified in the protein-coding genes of U. reniformis mtDNA.
For a complete list of the 147 editing sites identified by the RNAseq approach please refer to S4 Table.
| Gene | mtDNA | Codon | Codon position | Amino acid | Editing level (%) | |||
|---|---|---|---|---|---|---|---|---|
| Position | Strand | From | To | From | To | |||
| 99048 | + (exon 1) | CUA | UUA | 1 | L | L | 18 | |
| 99201 | + (exon 1) | CCG | UCG | 1 | P | S | 88 | |
| 344481 | + (exon 3) | CCC | CCU | 3 | P | P | 33 | |
| 726995 | - (exon 5) | AAC | AAU | 3 | F | N | 63 | |
| 297295 | - (exon 2) | CCA | CUA | 2 | P | L | 92 | |
| 463531 | + (exon 4) | UUC | UUU | 3 | F | F | 18 | |
| 463653 | + (exon 4) | CCG | CUG | 2 | P | L | 81 | |
| 465022 | + (exon 5) | UCC | UUC | 2 | S | F | 70 | |
| 465030 | + (exon 5) | CUC | UUC | 1 | L | F | 64 | |
| 20810 | - | CCA | CUA | 2 | P | L | 92 | |
| 578975 | + (exon 3) | CUA | UUA | 1 | L | L | 36 | |
| 579324 | + (exon 3) | CCA | CUA | 2 | P | L | 94 | |
| 581798 | + (exon 4) | ACC | ACU | 3 | T | T | 93 | |
| 260779 | - (exon 2) | GCC | GCU | 3 | A | A | 29 | |
| 260950 | - (exon 2) | AUC | AUU | 3 | I | I | 20 | |
| 261311 | - (exon 2) | GCC | GUC | 2 | A | V | 75 | |
| 261673 | - (exon 2) | ACC | ACU | 3 | T | T | 25 | |
| 261677 | - (exon 2) | CCG | CUG | 2 | P | L | 92 | |
| 300266 | - | CCA | CUA | 2 | P | L | 93 | |
| 300273 | - | CCC | UCC | 1 | P | S | 90 | |
| 711488 | + (exon 1) | AAC | AAU | 3 | N | N | 13 | |
| 712855 | + (exon 2) | UCU | UUU | 2 | S | F | 70 | |
| 713691 | + (exon 3) | CGU | UGU | 1 | R | C | 100 | |
| 713819 | + (exon 3) | GUC | GUU | 3 | V | V | 24 | |
| 713906 | + (exon 3) | UCC | UCU | 3 | S | S | 12 | |
| 714114 | + (exon 3) | CUU | UUU | 1 | L | F | 91 | |
| 715347 | + (exon 4) | CCU | CUU | 2 | P | L | 72 | |
| 715366 | + (exon 4) | UCC | UCU | 3 | S | S | 61 | |
| 715453 | + (exon 4) | CCC | CCU | 3 | P | P | 88 | |
| 715506 | + (exon 4) | UCU | UUU | 2 | S | F | 85 | |
| 715522 | + (exon 4) | CAC | CAU | 3 | H | H | 72 | |
| 715527 | + (exon 4) | CCA | CUA | 2 | P | L | 98 | |
| 715569 | + (exon 4) | UCU | UUU | 2 | S | F | 100 | |
| 497201 | + | UCU | UUU | 2 | S | F | 73 | |
| 497282 | + | CCA | CUA | 2 | P | L | 47 | |
| 497708 | + | UCU | UUU | 2 | S | F | 78 | |
| 779528 | + | UCU | UUU | 2 | S | F | 92 | |
| 779794 | + | CUA | UUA | 1 | L | L | 86 | |
| 321763 | + (exon 1) | ACC | ACU | 3 | T | T | 89 | |
| 323181 | + (exon 2) | CUG | UUG | 1 | L | L | 96 | |
| 514300 | + | CUA | UUA | 1 | L | L | 84 | |
| 558125 | + | UUC | UUU | 3 | F | F | 67 | |
| 285091 | + | CCC | CUC | 2 | P | L | 87 | |
| 285114 | + | CCG | UCG | 1 | P | S | 95 | |
| 810361 | - | UCC | UCU | 3 | S | S | 48 | |
| 810993 | - | CGU | UGU | 1 | R | C | 64 | |
| 556640 | + | UGC | UGU | 3 | C | C | 84 | |
| 834747 | + (exon 1) | CAU | UAU | 1 | H | Y | 63 | |
| 834901 | + (exon 1) | CCA | CUA | 2 | P | L | 77 | |
| 836320 | + (exon 2) | CUA | UUA | 1 | L | L | 32 | |
| 488236 | - | ACC | AUC | 2 | T | I | 61 | |
| 22103 | - | CCG | UCG | 1 | P | S | 16 | |
| 22105 | - | CCU | CUU | 2 | P | L | 77 | |
| 22173 | - | AUC | AUU | 3 | I | I | 13 | |
| 22389 | - | UCC | UCU | 3 | S | S | 31 | |
| 611859 | + | ACC | ACU | 3 | T | T | 35 | |
| 611934 | + | CGG | UGG | 1 | R | W | 41 | |
| 612084 | + | UAC | UAU | 3 | Y | Y | 40 | |
| 102277 | + | UCC | UCU | 3 | S | S | 96 | |
| 103757 | + | CGC | UCG | 1 | R | S | 16 | |
| 103771 | + | UAC | UAU | 3 | Y | Y | 75 | |
| 263919 | - (exon 2) | CCG | CUG | 2 | P | L | 97 | |
| 263864 | - (exon 2) | UCC | UCU | 3 | S | S | 48 | |
| 264252 | - (exon 2) | CCA | CUA | 2 | P | L | 15 | |
| 264811 | - (exon 2) | CGA | UGA | 1 | R | Stop | 85 | |
| 267470 | - | UCC | UCU | 3 | S | S | 56 | |
| 20376 | - | CCC | CCU | 3 | P | P | 11 | |
| 21929 | - | UCC | UCU | 3 | S | S | 21 | |
| 21974 | - | CCC | CCU | 3 | P | P | 17 | |
Fig 5Phylogenetic analysis based on the mitochondrial genes atp1, cox1, matR, nad5, rps3 from 32 different species from the Lamiales order.