| Literature DB >> 26137446 |
Basdeo Kushwaha1, Ravindra Kumar1, Suyash Agarwal1, Manmohan Pandey1, N S Nagpure1, Mahender Singh1, Shreya Srivastava1, C G Joshi2, P Das3, L Sahoo3, P Jayasankar3, P K Meher3, T M Shah2, A B Patel2, Namrata Patel2, P Koringa2, Sofia Priyadarsani Das3, Siddhi Patnaik3, Amrita Bit3, M A Iquebal4, Dinesh Kumar4, J K Jena1.
Abstract
Whole genome sequencing (WGS) using next generation sequencing technologies paves the way to sequence the mitochondrial genomes with greater ease and lesser time. Here, we used the WGS data of Clarias batrachus, generated from Roche 454 and Ion Torrent sequencing platforms, to assemble the complete mitogenome using both de novo and reference based approaches. Both the methods yielded almost similar results and the best assembled mitogenome was of 16,510 bp size (GenBank Acc. No. KM259918). The mitogenome annotation resulted in 13 coding genes, 22 tRNA genes, 2 rRNA genes and one control region, and the gene order was found to be identical with other catfishes. Variation analyses between assembled and the reference (GenBank Acc. No. NC_023923) mitogenome revealed 51 variations. The phylogenetic analysis of coding DNA sequences and tRNA supports the monophyly of catfishes. Two SSRs were identified in C. batrachus mitogenome, out of which one was unique to this species. Based on the relative rate of gene evolution, protein coding mitochondrial genes were found to evolve at a much faster pace than the d-loop, which in turn are followed by the rRNAs; the tRNAs showed wide variability in the rate of sequence evolution, and on average evolve the slowest. Among the coding genes, ND2 evolves most rapidly. The variations present in the coding regions of the mitogenome and their comparative analyses with other catfish species may be useful in species conservation and management programs.Entities:
Keywords: Clarias batrachus; Indian catfish; Mitogenome; Phylogenetics; WGS data
Year: 2015 PMID: 26137446 PMCID: PMC4484717 DOI: 10.1016/j.mgene.2015.06.004
Source DB: PubMed Journal: Meta Gene ISSN: 2214-5400
Raw data QC statistics of all the 4 runs.
| Platform/Run | Number of reads | Avg. length | Number of reads after trimming | Percentage of high quality reads |
|---|---|---|---|---|
| Run1_454 | 14,88,526 | 350.2 | 14,45,418 | 97.1 |
| Run2_454 | 15,39,353 | 372.7 | 15,03,731 | 97.69 |
| Run1_Ion_Torrent | 27,56,593 | 333.2 | 27,26,188 | 98.9 |
| Run2_Ion_Torrent | 33,94,980 | 300.6 | 32,96,863 | 97.11 |
Statistics of De novo and Reference guided mitogenome assembly.
| Assembler | Data set | Reference guided assembled mitogenome | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mitogenome length | No. of reads | Query coverage (%) | Percent similarity (%) | Average coverage | Mitogenome length | No. of reads | Query coverage (%) | Percent similarity (%) | ||
| CLC | Pooled 454 | 16,509 | 7655 | 100 | 99.4 | 198.77 | 16,474 | 7712 | 100 | 99.4 |
| CLC | Pooled Ion Torrent | 16,512 | 4492 | 100 | 99.4 | 87.93 | 16,493 | 4506 | 100 | 99.4 |
| CLC | Pooled all | 16,509 | 12,160 | 100 | 99.4 | 284.67 | 16,476 | 12,218 | 99 | 99.4 |
| Newbler | Pooled 454 | 16,509 | 7636 | 100 | 99.4 | 198.3 | 16,509 | 7511 | 100 | 99.4 |
| Newbler | Pooled Ion Torrent | 16,517 | 4469 | 100 | 99.4 | 87.2 | 16,331 | 4464 | 98 | 99.4 |
| Newbler | Pooled all | 16,510 | 12,094 | 100 | 99.5 | 284.5 | 16,509 | 11,975 | 100 | 99.4 |
Fig. 1Complete mitogenome of C. batrachas. The position of 13 protein coding genes, 22 tRNA genes, 2 rRNA genes and d-loop are shown using abbreviations given in the text. All protein coding genes, except ND6, are encoded on the heavy strand and depicted in the outer circle with clockwise transcriptional polarity.
Fig. 2Spanning of genes across mitochondrial genome. The maximum spanning was observed for PCG's followed by tRNA genes.
Fig. 3The stem loop structure of OL region from C. batrachus.
Fig. 4CSB region of d-loop in C. batrachus. The figure depicts the conserved d-loop motifs among the catfishes. Motif 1 is conserved throughout the human.
Fig. 5Gene-wise average coverage. This depicts the average coverage of 13 PCG's, 2 rRNAs and d-Loop region. Average coverage of all pooled and 454 data was quite high as compared to that of Ion Torrent.
Fig. 6Venn diagram of common variations among 4 runs.
Fig. 7Gene-wise variation histogram. This figure depicts the number of variations for mitochondrial genes along with intergenic spacers.
Fig. 8A — CDS regions based phylogenetic relationships among the catfishes. B — CDS + tRNA regions based phylogenetic relationships among the catfishes.
Fig. 9Heatmap of codon usage.
Fig. 10Relative rate of molecular evolution. Bars represent the coefficients of linear least squares regression, where regressions of the pairwise distances of all genes were calculated against the distances of the 12S rRNAs.