| Literature DB >> 29721275 |
Denis Jacob Machado1, Daniel Janies2, Cory Brouwer2,3, Taran Grant4.
Abstract
We applied a novel strategy to infer sequence circularity and complete assembly of four mitochondrial genomes (mitogenomes) of the frog families Bufonidae (Melanophryniscus moreirae), Dendrobatidae (Hyloxalus subpunctatus and Phyllobates terribilis), and Scaphiopodidae (Scaphiopus holbrookii). These are the first complete mitogenomes of these four genera and Scaphiopodidae. We assembled mitogenomes from short genomic sequence reads using a baiting and iterative mapping strategy followed by a new ad hoc mapping strategy developed to test for assembly circularization. To assess the quality of the inferred circularization, we used Bowtie2 alignment scores and a new per-position sequence coverage value (which we named "connectivity"). Permutation tests with 400 iterations per specimen and 1% or 5% chance of mutation at the ends of the putative circular sequences showed that the proposed method is highly sensitive, with a single nucleotide insertion or deletion being sufficient for circularity to be rejected. False positives comprised only 2% of all observations and possessed significantly lower alignment scores. The size, gene content, and gene arrangement of each mitogenome differed among the species but matched the expectations for their clades. We argue that basic studies on circular sequences can benefit from the results and bioinformatics procedures introduced here, especially when closely related references are lacking.Entities:
Keywords: amphibians; bioinformatics; circularity; genomics; mitochondrial genome
Year: 2018 PMID: 29721275 PMCID: PMC5916287 DOI: 10.1002/ece3.3918
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Figure 1Main steps of our strategy to infer circularity. (a) We search for k‐mers of a specified length, from the end to the middle of the scaffold, with the condition that they are at a minimum distance from each other. (b) The longest putative circular sequence found for each k‐mer size is flipped, so the 5′ and 3′ ends will be adjacent to each other in the middle of the fragment. (c) Original sequence reads are remapped against the flipped putative circular sequence. All the mapped reads (represented by reads 1–3) contribute to the average alignment score. For each nucleotide, only the reads that support its position in relation to the two adjacent nucleotides (represented by read 1) are counted to determine the contiguity coverage
Number of base pairs, average sequence coverage, and nucleotide composition of the new mitogenomes
| Species | Base pairs | Avg. coverage | Overall base composition (%) | ||||
|---|---|---|---|---|---|---|---|
| A | C | G | T | GC | |||
|
| 16,881 | 1,326.69 | 32.40 | 24.90 | 20.10 | 22.60 | 44.90 |
|
| 18,005 | 196.58 | 30.30 | 24.20 | 14.20 | 31.30 | 38.40 |
|
| 17,702 | 2,277.35 | 28.30 | 26.00 | 14.60 | 31.00 | 40.60 |
|
| 16,751 | 871.75 | 26.90 | 27.60 | 14.90 | 30.60 | 42.40 |
Figure 2Genome arrangement in the mitochondrial genome of (a) Scaphiopus holbrookii and (b) Hyloxalus subpunctatus, Melanoprhyniscus moreirae, and Phyllobates terribilis