Literature DB >> 29668970

MitoFish and MiFish Pipeline: A Mitochondrial Genome Database of Fish with an Analysis Pipeline for Environmental DNA Metabarcoding.

Yukuto Sato1,2, Masaki Miya3, Tsukasa Fukunaga4, Tetsuya Sado3, Wataru Iwasaki4,5,6.   

Abstract

Fish mitochondrial genome (mitogenome) data form a fundamental basis for revealing vertebrate evolution and hydrosphere ecology. Here, we report recent functional updates of MitoFish, which is a database of fish mitogenomes with a precise annotation pipeline MitoAnnotator. Most importantly, we describe implementation of MiFish pipeline for metabarcoding analysis of fish mitochondrial environmental DNA, which is a fast-emerging and powerful technology in fish studies. MitoFish, MitoAnnotator, and MiFish pipeline constitute a key platform for studies of fish evolution, ecology, and conservation, and are freely available at http://mitofish.aori.u-tokyo.ac.jp/ (last accessed April 7th, 2018).

Entities:  

Mesh:

Year:  2018        PMID: 29668970      PMCID: PMC5967551          DOI: 10.1093/molbev/msy074

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

Fish occupy an important position in the vertebrate evolution and hydrosphere ecology, and genetic information from their mitochondrial genomes (mitogenomes) plays a key role in the investigation of their evolutionary histories and the protection and management of biological diversity. Mitofish is a database of fish mitogenomes with precise de novo annotations, and freely available at http://mitofish.aori.u-tokyo.ac.jp/. Since its major update in 2013 (Iwasaki et al. 2013), MitoFish has been actively and widely used by those in evolutionary science, ecology, ichthyology, fisheries, and conservation science from academia, government, and industry. MitoFish and its fish mitogenome annotation pipeline MitoAnnotator now receive >40,000 page views per year from around the world. In addition to its regular update of the data content, MitoFish has acquired two new functions since 2013. One is the multiple sequence annotation function, through which users can easily annotate many mitogenomic sequences for phylogeographic studies, for example. The other, more important recent functional development in MitoFish is the implementation of MiFish pipeline (http://mitofish.aori.u-tokyo.ac.jp/mifish/). The recent advance in the high-throughput sequencing technology has enabled a new powerful approach in fish studies, that is, the metabarcoding analysis of environmental DNA (eDNA) (Deiner et al. 2017). It has been proved that fish (and tetrapod) mitochondrial DNA can be efficiently amplified by PCR from various environmental samples that include seawater, freshwater, sediment, and gut content (Miya et al. 2015; Ushio et al. 2017). eDNA analysis is a cost-effective and high-throughput approach to investigate species diversity in a noninvasive way, although several factors such as potential contaminations need to be taken cared of. MiFish is a set of universal PCR primers for effective metabarcoding of fish eDNA (Miya et al. 2015). As a powerful metabarcoding tool for biodiversity monitoring, MiFish primers were developed to target a hypervariable region within the fish mitochondrial 12S rRNA gene that is flanked by two highly conservative regions based on the MitoFish data. MiFish pipeline on the MitoFish server is a user-friendly pipeline for analyzing fish metabarcoding data to estimate the species composition and ecological characteristics of natural environment. Whereas a number of computational tools are available for microbial metabarcoding analysis, there are few for the eDNA metabarcoding analysis of larger organisms. MiFish pipeline serves as a useful tool for those who are interested in diversity and ecological studies of fishes.

MiFish Pipeline

As a new function on the MitoFish server, MiFish pipeline accepts and analyzes fish mitogenomic metabarcoding data, which include those produced using the MiFish primers (fig. 1). The overall sequence quality is assessed by FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) and low-quality (Phred score < 10 by default) 3′-tails are trimmed by DynamicTirm.pl (Cox et al. 2010). The paired-end reads are merged by FLASH (Magoc and Salzberg 2011) and erroneous merged reads that contain N-nucleotides or do not have typical lengths are removed (229 ± 25 bp by default). The primer sequences are removed by TagCleaner (Schmieder et al. 2010) by allowing three-base mismatches at the maximum. Species-level taxonomic assignment is performed using Uclust (Edgar 2010) and NCBI Blast+ (Camacho et al. 2009; fig. 1). Redundant sequences are merged into one sequence by keeping the count information. Then, low read-number sequences (<10 by default) are remapped onto high read-number sequences (≥10) at a given sequence-similarity threshold (99% by default), and the unmapped sequences are discarded. Blastn searches are conducted against MitoFish as a reference database with cutoff values of identity 97% and e-value 10−5, and species names of the top-hit sequences are retrieved. If the second to fifth top-hit sequences of each Blast search contain those of different species, confidence scores of the species assignment are calculated using the following formula:
. 1.

Screenshots of MiFish pipeline. (A) An overview of the pipeline. (B) A view of a progress status shown during pipeline execution. (C) An HTML report that contains results of the species assignment, sequence counts, and web-page links. For each species, a detailed report with confidence scores is provided.

Screenshots of MiFish pipeline. (A) An overview of the pipeline. (B) A view of a progress status shown during pipeline execution. (C) An HTML report that contains results of the species assignment, sequence counts, and web-page links. For each species, a detailed report with confidence scores is provided. All-species and within-species molecular phylogenetic trees are estimated for each environmental sample. Multiple sequence alignments are generated by MAFFT (Katoh and Standley 2013) and neighbor-joining phylogenetic trees are estimated by Morphy (Adachi and Hasegawa 1992). An HTML report is finally presented, which can also be used for calculating ecological indices such as alpha diversity, beta diversity, and correlation coefficients (fig. 1). This report also contains links to major databases such as FishBase (Froese and Pauly 2017), Barcode of Life (Ratnasingham and Hebert 2007), Global Biodiversity Information Facility (Edwards 2004), and MitoFish. Figure 2 shows an example of fish eDNA analysis results produced by MiFish pipeline. The water sample was taken at Uchidomari river in Okinawa Island, Japan (fig. 2) and filtrated up to 960 ml using 0.45 µm Sterivex filter (Millipore) or 0.70 µm glass-fiber filter (Whatman GF/F). DNA was extracted using DNeasy PowerWater Sterivex and DNeasy Blood & Tissue kits (Qiagen) from the Sterivex and glass-fiber filters, respectively. MiFish primers (Miya et al. 2015) were used to amplify eDNA with the annealing temperature of 60°C. MiSeq with V2 chemistry (Illumina) was used for 150-bp paired-end sequencing.
. 2.

Application of MiFish pipeline to a fish eDNA data set. (A) Photos of the sampling site in Okinawa Island, Japan. (B) A summary of the species assignment results produced by MiFish pipeline. The left and right bars indicate the data of eDNA extracted using the Sterivex and glass-fiber filters, respectively. Read numbers are expressed as percentages to the total read numbers. Nonnative species in Okinawa Island are highlighted (Yellow: Invasive species, Pink: Salmons). Fish pictures are from FishBase.

Application of MiFish pipeline to a fish eDNA data set. (A) Photos of the sampling site in Okinawa Island, Japan. (B) A summary of the species assignment results produced by MiFish pipeline. The left and right bars indicate the data of eDNA extracted using the Sterivex and glass-fiber filters, respectively. Read numbers are expressed as percentages to the total read numbers. Nonnative species in Okinawa Island are highlighted (Yellow: Invasive species, Pink: Salmons). Fish pictures are from FishBase. The estimated species composition well represented the fish community in the rivers in Okinawa Island (fig. 2). Dominant species included those typically observed in Okinawa Island rivers such as gobies (e.g., Tridentiger kuroiwae, Rhinogobius giurinus, and Redigobius bikolanus), mullets (e.g., Chelon macrolepis and Chelon affinis), and giant mottled eel (Anguilla marmorata), as well as invasive alien species such as nonnative tilapias (genus Oreochromis) and plecos (Hypostomus plecostomus). It may be noted that salmons (genus Onchorhynchus) also appeared in the list presumably because they are widely consumed as food in Okinawa and drainage likely contains their eDNA. These results would exemplify that MiFish pipeline is a useful tool for eDNA analysis of the endemic fish species composition, invasive species, and also impact of human activities, whereas the detection of salmons also suggests that eDNA analysis can be affected by unexpected environmental influences and/or contamination and need to be interpreted with caution. Taken together, MitoFish, MitoAnnotator, and MiFish pipeline constitute a key platform for studies of evolution, ecology, and conservation of fishes.
  11 in total

1.  Search and clustering orders of magnitude faster than BLAST.

Authors:  Robert C Edgar
Journal:  Bioinformatics       Date:  2010-08-12       Impact factor: 6.937

2.  FLASH: fast length adjustment of short reads to improve genome assemblies.

Authors:  Tanja Magoč; Steven L Salzberg
Journal:  Bioinformatics       Date:  2011-09-07       Impact factor: 6.937

Review 3.  Environmental DNA metabarcoding: Transforming how we survey animal and plant communities.

Authors:  Kristy Deiner; Holly M Bik; Elvira Mächler; Mathew Seymour; Anaïs Lacoursière-Roussel; Florian Altermatt; Simon Creer; Iliana Bista; David M Lodge; Natasha de Vere; Michael E Pfrender; Louis Bernatchez
Journal:  Mol Ecol       Date:  2017-10-26       Impact factor: 6.185

4.  Environmental DNA enables detection of terrestrial mammals from forest pond water.

Authors:  Masayuki Ushio; Hisato Fukuda; Toshiki Inoue; Kobayashi Makoto; Osamu Kishida; Keiichi Sato; Koichi Murata; Masato Nikaido; Tetsuya Sado; Yukuto Sato; Masamichi Takeshita; Wataru Iwasaki; Hiroki Yamanaka; Michio Kondoh; Masaki Miya
Journal:  Mol Ecol Resour       Date:  2017-06-11       Impact factor: 7.090

5.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

6.  BLAST+: architecture and applications.

Authors:  Christiam Camacho; George Coulouris; Vahram Avagyan; Ning Ma; Jason Papadopoulos; Kevin Bealer; Thomas L Madden
Journal:  BMC Bioinformatics       Date:  2009-12-15       Impact factor: 3.169

7.  TagCleaner: Identification and removal of tag sequences from genomic and metagenomic datasets.

Authors:  Robert Schmieder; Yan Wei Lim; Forest Rohwer; Robert Edwards
Journal:  BMC Bioinformatics       Date:  2010-06-23       Impact factor: 3.169

8.  MitoFish and MitoAnnotator: a mitochondrial genome database of fish with an accurate and automatic annotation pipeline.

Authors:  Wataru Iwasaki; Tsukasa Fukunaga; Ryota Isagozawa; Koichiro Yamada; Yasunobu Maeda; Takashi P Satoh; Tetsuya Sado; Kohji Mabuchi; Hirohiko Takeshima; Masaki Miya; Mutsumi Nishida
Journal:  Mol Biol Evol       Date:  2013-08-16       Impact factor: 16.240

9.  bold: The Barcode of Life Data System (http://www.barcodinglife.org).

Authors:  Sujeevan Ratnasingham; Paul D N Hebert
Journal:  Mol Ecol Notes       Date:  2007-05-01

10.  MiFish, a set of universal PCR primers for metabarcoding environmental DNA from fishes: detection of more than 230 subtropical marine species.

Authors:  M Miya; Y Sato; T Fukunaga; T Sado; J Y Poulsen; K Sato; T Minamoto; S Yamamoto; H Yamanaka; H Araki; M Kondoh; W Iwasaki
Journal:  R Soc Open Sci       Date:  2015-07-22       Impact factor: 2.963

View more
  27 in total

1.  Mitochondrial genome architecture and phylogenetic relationships of Odontesthes argentinensis within Atherinomorpha.

Authors:  Javier Calvelo; Alejandro D'Anatro
Journal:  Genetica       Date:  2021-04-04       Impact factor: 1.082

2.  Comparison of species-specific qPCR and metabarcoding methods to detect small pelagic fish distribution from open ocean environmental DNA.

Authors:  Zeshu Yu; Shin-Ichi Ito; Marty Kwok-Shing Wong; Susumu Yoshizawa; Jun Inoue; Sachihiko Itoh; Ryuji Yukami; Kazuo Ishikawa; Chenying Guo; Minoru Ijichi; Susumu Hyodo
Journal:  PLoS One       Date:  2022-09-07       Impact factor: 3.752

3.  Applying convolutional neural networks to speed up environmental DNA annotation in a highly diverse ecosystem.

Authors:  Benjamin Flück; Laëtitia Mathon; Stéphanie Manel; Alice Valentini; Tony Dejean; Camille Albouy; David Mouillot; Wilfried Thuiller; Jérôme Murienne; Sébastien Brosse; Loïc Pellissier
Journal:  Sci Rep       Date:  2022-06-17       Impact factor: 4.996

4.  Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes.

Authors:  Alexander Donath; Frank Jühling; Marwa Al-Arab; Stephan H Bernhart; Franziska Reinhardt; Peter F Stadler; Martin Middendorf; Matthias Bernt
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

Review 5.  Mesozoic origin and 'out-of-India' radiation of ricefishes (Adrianichthyidae).

Authors:  Kazunori Yamahira; Satoshi Ansai; Ryo Kakioka; Hajime Yaguchi; Takeshi Kon; Javier Montenegro; Hirozumi Kobayashi; Shingo Fujimoto; Ryosuke Kimura; Yusuke Takehana; Davin H E Setiamarga; Yasuoki Takami; Rieko Tanaka; Ken Maeda; Hau D Tran; Noriyuki Koizumi; Shinsuke Morioka; Vongvichith Bounsong; Katsutoshi Watanabe; Prachya Musikasinthorn; Sein Tun; L K C Yun; Kawilarang W A Masengi; V K Anoop; Rajeev Raghavan; Jun Kitano
Journal:  Biol Lett       Date:  2021-08-04       Impact factor: 3.812

6.  Environmental DNA metabarcoding to detect pathogenic Leptospira and associated organisms in leptospirosis-endemic areas of Japan.

Authors:  Yukuto Sato; Masaru Mizuyama; Megumi Sato; Toshifumi Minamoto; Ryosuke Kimura; Claudia Toma
Journal:  Sci Rep       Date:  2019-04-25       Impact factor: 4.379

7.  Identification of key genes and pathways downstream of the β-catenin-TCF7L1 complex in pancreatic cancer cells using bioinformatics analysis.

Authors:  Yi-Hang Yuan; Jian Zhou; Yan Zhang; Meng-Dan Xu; Jing Wu; Wei Li; Meng-Yao Wu; Dao-Ming Li
Journal:  Oncol Lett       Date:  2019-06-06       Impact factor: 2.967

8.  Complete mitochondrial genomes of two Pleuronectid species: Clidoderma asperrimum and Verasper variegatus (Teleostei: Pleuronectiformes: Pleuronectidae).

Authors:  Han Kyu Lim; Hyo Sun Jung; Moongeun Yoon; Sang-Hwa Lee; Dong Soo Kim
Journal:  Mitochondrial DNA B Resour       Date:  2019-11-12       Impact factor: 0.658

9.  FishDB: an integrated functional genomics database for fishes.

Authors:  Liandong Yang; Zetan Xu; Honghui Zeng; Ning Sun; Baosheng Wu; Cheng Wang; Jing Bo; Lin Li; Yang Dong; Shunping He
Journal:  BMC Genomics       Date:  2020-11-17       Impact factor: 3.969

10.  Simultaneous absolute quantification and sequencing of fish environmental DNA in a mesocosm by quantitative sequencing technique.

Authors:  Tatsuhiko Hoshino; Ryohei Nakao; Hideyuki Doi; Toshifumi Minamoto
Journal:  Sci Rep       Date:  2021-02-23       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.