| Literature DB >> 31616005 |
Giobbe Forni1, Guglielmo Puccio1, Thomas Bourguignon2,3, Theodore Evans4, Barbara Mantovani1, Omar Rota-Stabelli5, Andrea Luchetti6.
Abstract
Thousands of eukaryotes transcriptomes have been generated, mainly to investigate nuclear genes expression, and the amount of available data is constantly increasing. A neglected but promising use of this large amount of data is to assemble organelle genomes. To assess the reliability of this approach, we attempted to reconstruct complete mitochondrial genomes from RNA-Seq experiments of Reticulitermes termite species, for which transcriptomes and conspecific mitogenomes are available. We successfully assembled complete molecules, although a few gaps corresponding to tRNAs had to be filled manually. We also reconstructed, for the first time, the mitogenome of Reticulitermes banyulensis. The accuracy and completeness of mitogenomes reconstruction appeared independent from transcriptome size, read length and sequencing design (single/paired end), and using reference genomes from congeneric or intra-familial taxa did not significantly affect the assembly. Transcriptome-derived mitogenomes were found highly similar to the conspecific ones obtained from genome sequencing (nucleotide divergence ranging from 0% to 3.5%) and yielded a congruent phylogenetic tree. Reads from contaminants and nuclear transcripts, although slowing down the process, did not result in chimeric sequence reconstruction. We suggest that the described approach has the potential to increase the number of available mitogenomes by exploiting the rapidly increasing number of transcriptomes.Entities:
Mesh:
Year: 2019 PMID: 31616005 PMCID: PMC6794255 DOI: 10.1038/s41598-019-51313-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Short reads recruitment across iterations. Relationships between the number of iterations of the process (x axis) and mapped reads (% of the total reads; y axis) using congeneric (blue triangles) or intra-familial (orange circles) references.
Figure 2Contigs analysis across iterations. Relationship between the number of iteration process (x axis) and contigs length (bp, primary y axis; solid lines) and contigs number (secondary y axis; dashed lines) using congeneric (blue triangles) or intra-familial (orange circles) references.
Non-mitochondrial reads leakage analysis in Reticulitermes labralis and R. speratus RNA-Seq experiments.
| Contigs at the 10th iteration |
|
| ||
|---|---|---|---|---|
| Congeneric references | Intra-familial references | Congeneric references | Intra-familial references | |
| Total | 34,439 | 33,906 | 262 | 157 |
| Target mitochondrial contigs | 138 (0.40%) | 180 (0.53%) | 32 (12.2%) | 22 (14.0%) |
| Contaminant leakage | 23,819 (69.2%) | 23,572 (69.5%) | 113 (43.1%) | 66 (42.0%) |
| Nuclear leakage | 10,482 (30.4%) | 10,154 (29.9%) | 117 (44.7%) | 69 (43.9%) |
Numbers refer to contigs number (percentage) that were found of contaminant or nuclear DNA origin.
Figure 3Reads coverage along mitogenomes reconstructed using congeneric references. Coverage has been capped at 5,000 × for graphical purposes; shaded areas indicate tRNAs and the control region.
Figure 4Mitogenomes phylogenetic analysis. Schematic drawing of Maximum Likelihood (−lnL = 78672.19) and Bayesian inference (−lnL = 78489.80). Maximum bootstrap (=100%) and posterior probability (=1.0) values are omitted, while they are reported on nodes showing supports lower than the maximum (indicated with red dots). Mitogenomes assembled from RNA-Seq are highlighted in red.
Data information for Illumina RNA-Seq experiments and conspecific mitogenomes obtained from NCBI Genbank database.
| Species | SRA acc. no. | Gbp | Reads length | Conspecific mitogenomes |
|---|---|---|---|---|
|
| SRR1325101 | 2.4 | 51 (single end) | KY484910 |
| EF206314 | ||||
| EF206315a | ||||
| EF206316 | ||||
| EF206317 | ||||
|
| SRR1325103 | 1.5 | 51 (single end) | KU925237 |
|
| SRR5253660 | 1.3 | 51 (single end) | — |
|
| SRR1325112 | 2.3 | 51 (single end) | MK088051 |
|
| SRR5808263 | 9.1 | 150 (pair end) | KT224427 |
| KU877221 | ||||
|
| DRR030843 | 3.4 | 93 (pair end) | KY484910 |
aThis was attributed to R. santonensis, which is synonym species of R. flavipes.
Figure 5Conceptual map of the pipeline implemented for mitogenome reconstruction from RNA-Seq raw reads.