| Literature DB >> 27006764 |
Alex Crampton-Platt1, Douglas W Yu2, Xin Zhou3, Alfried P Vogler4.
Abstract
'Mitochondrial metagenomics' (MMG) is a methodology for shotgun sequencing of total DNA from specimen mixtures and subsequent bioinformatic extraction of mitochondrial sequences. The approach can be applied to phylogenetic analysis of taxonomically selected taxa, as an economical alternative to mitogenome sequencing from individual species, or to environmental samples of mixed specimens, such as from mass trapping of invertebrates. The routine generation of mitochondrial genome sequences has great potential both for systematics and community phylogenetics. Mapping of reads from low-coverage shotgun sequencing of environmental samples also makes it possible to obtain data on spatial and temporal turnover in whole-community phylogenetic and species composition, even in complex ecosystems where species-level taxonomy and biodiversity patterns are poorly known. In addition, read mapping can produce information on species biomass, and potentially allows quantification of within-species genetic variation. The success of MMG relies on the formation of numerous mitochondrial genome contigs, achievable with standard genome assemblers, but various challenges for the efficiency of assembly remain, particularly in the face of variable relative species abundance and intra-specific genetic variation. Nevertheless, several studies have demonstrated the power of mitogenomes from MMG for accurate phylogenetic placement, evolutionary analysis of species traits, biodiversity discovery and the establishment of species distribution patterns; it offers a promising avenue for unifying the ecological and evolutionary understanding of species diversity.Entities:
Keywords: Biodiversity; Community ecology; Genome assembly; Illumina; Phylogenetics; Shotgun sequencing
Mesh:
Substances:
Year: 2016 PMID: 27006764 PMCID: PMC4802855 DOI: 10.1186/s13742-016-0120-y
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Fig. 1A schematic diagram of mitochondrial metagenomics. The central panel (red) represents the ‘contig-based’ analyses, using a database of complete (circles) or partial (lines) mitochondrial contigs. These are derived from one or more sources: sequencing of taxonomically chosen specimens and/or representative specimens from an ecological study (Voucher MMG; left panel); direct assembly of ecological bulk samples (Bulk MMG; right panel); external databases containing identified mitogenome sequences (superbarcodes), such as NCBI. Specimens for voucher and bulk MMG are shotgun- sequenced in mixtures, assembled with standard assembly pipelines, annotated for each gene, and assigned to known species through matches with cox1 barcodes or other mitochondrial sequences from well-identified specimens where applicable. The ‘contig-based’ analysis concludes with a phylogenetic analysis, with the tree updated as new data become available in an iterative process. This set of mitogenomes can then be used as a reference for a ‘read-based’ analysis (green panel). Here the presence and possible abundance of a given species in the local assemblage is determined by mapping reads from ecological bulk samples against the mitogenome database (dashed arrows). The knowledge of the phylogenetic tree provides an evolutionary perspective to all species in the study
Fig. 2Coverage and mitochondrial contig length. a Coverage is approximately proportional to input species biomass; therefore, sequence contiguity (contig length) should increase with coverage, up to the minimum level of coverage required to obtain a full-length (~15–20 kb) mitogenome assembly. Increasing sequencing depth beyond this point is not cost-effective. b An example from [28], showing the mitochondrial contigs obtained in a reference set (one specimen per morphologically identified species, normalized for roughly equal DNA concentration based on body size), with read coverage calculated for each contig based on number of reads mapped. c Assembly from the same study [28] but made from mixed bulk specimens. d The use of different assemblers (left: IDBA-UD; middle: Newbler; right: Celera) on a mixed sample of rainforest beetles [20], showing fairly incomplete assembly even for mitochondrial contigs with high coverage. e Combining these three assemblies in Geneious to increase sequence contiguity resolves a large number of these cases but is not completely effective