Literature DB >> 34791062

MinoTour, real-time monitoring and analysis for nanopore sequencers.

Rory Munro1, Roberto Santos1,2, Alexander Payne1, Teri Forey1, Solomon Osei3, Nadine Holmes4, Matthew Loose1,4.   

Abstract

SUMMARY: MinoTour offers a LIMS system for Oxford Nanopore Technology sequencers, with real-time metrics and analysis available permanently for review. Integration of unique real-time automated analysis can reduce the time required to answer biological questions, including mapping and classification of sequence whilst a run is in progress. Real-time sequence data requires new methods of analysis which do not wait for the completion of a run and MinoTour provides a framework to allow users to exploit these features.
AVAILABILITY AND IMPLEMENTATION: Source code and documentation are available at https://github.com/LooseLab/minotourcli and https://github.com/LooseLab/minotourapp. Docker images are available from https://hub.docker.com/r/adoni5/, and can be installed using a preconfigured docker-compose script at https://github.com/LooseLab/minotour-docker. An example server is available at http://137.44.59.170. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) (2021). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Year:  2021        PMID: 34791062      PMCID: PMC8796371          DOI: 10.1093/bioinformatics/btab780

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

High throughput real-time portable sequencing has transformed next generation sequencing from specialized centres to individual laboratories and previously unimaginable locations (Castro-Wallace ; Quick ). Uniquely, Oxford Nanopore Technologies (ONT) sequencing enables true real-time analysis as sequence data are made available during, as well as after, a run (Loose ; Payne ). Rapid analysis of data provides advantages where time to answer is important, such as pathogen genomics and clinical diagnosis of disease (Goes ; Martignano ; Quick ; Sone ). During a sequencing run, sequencing is controlled and tracked via ONT’s MinKNOW software. MinKNOW can be used to remotely observe and monitor the progress of sequencing runs, view reports on metrics and recent historical run data. Numerous tools provide analysis of nanopore FASTQ data either during or after a run (Bruno ; De Coster ). ONT also provides a cloud based service, epi2me (ONT, 2021), which enables various automated downstream analyses. However, as we show below, minoTour uniquely can capture both real-time sequence data and run metrics, can provide analysis and also acts as a comprehensive run archive. minoTour is open source and extensible, written using the Django framework, providing real-time insights into sequencer performance and sequence analysis. Visualized metrics allow users to see flow cell performance in real time. Built-in pipelines for alignment and metagenomics allow users to see experimental results in real time. Additional pipelines can easily be incorporated. For example, we can include a customized version of the ARTIC pipeline for SARS-CoV2 analysis described in more detail elsewhere (Munro ). minoTour also provides built-in adaptive sequencing (ReadFish only) (Payne ) support in visualizations enabling users to monitor targeted sequencing in real time.

2 Materials and methods

Figure  1 illustrates the path data takes from the sequencer to the user. A single python tool, minFQ (https://github.com/LooseLab/minotourcli), collects data from two sources. Firstly, run metrics are collected from MinKNOW via an ONT provided application programming interface (API). Secondly, base-called data are read either from user specified locations or found via the ONT API. These data are collected independently of each other, with only one needed to create the flowcell entry. They are sent to the minoTour server, which saves the sequencer metrics into the database, and stores the sequence data in Redis, a cache database. Celery performs asynchronous analysis of read data and can apply custom pipelines to perform analyses, ranging from base-called data summarizing to sequence alignment, metagenomics or custom workflows such as ARTIC (Munro ; Tyson ). Read data are split into individual flow cells, which can be selected from a table to show all data held for individual runs using that flow cell (Supplementary  Fig.  S1). These data can optionally be shared with other minoTour users if desired (Supplementary  Fig.  S2). minoTour can also be configured to use twitter/email APIs and notify users about flow cell events such as disk shortages or reaching prespecified coverage thresholds for a reference genome (Supplementary  Figs S3  and  S4). Reverse communication with the MinKNOW API allows users to send messages back into the MinKNOW logs for a permanent record such as notes on library reloading or comments on flow cell performance.
Fig. 1.

Implementation of minoTour showing the flow of base-called FASTQ data and sequencer metrics from the sequencer to the server, through various optional pipelines and finally to visualization for the user

Implementation of minoTour showing the flow of base-called FASTQ data and sequencer metrics from the sequencer to the server, through various optional pipelines and finally to visualization for the user

3 Results and discussion

One of minoTour’s strengths is its ability to automatically provide a historic record of runs (Supplementary  Fig.  S1), and provide real-time key insights into ongoing runs (Supplementary  Figs  S5  and  S6). minoTour also provides a simple overview of all ongoing connected sequencers, quickly showing a user if any are under-performing, such as falling short on yield or speed, who can then respond accordingly (Supplementary  Fig.  S7). minoTour can also remotely stop a run if required via the website. Using minoTour’s minimap2 alignment pipeline for monitoring targeted or adaptive sequencing with ReadFish one can rapidly tell if coverage is accruing over specific targeted regions with sequenced and rejected reads able to be visualized separately (Supplementary  Fig.  S8). minoTour provides a simple metagenomics pipeline, using centrifuge (Kim ) to visualize sample make-up, a breakdown of broad composition as well as more detailed investigation of select prechosen species by aligning reads classified in that taxa against one or more references (Supplementary  Fig.  S9). More complex pipelines such as the ARTIC pipeline can be incorporated (Supplementary  Fig.  S10). We routinely use minoTour in the laboratory to monitor sequencing and use the available analyses to provide insights into sequencing experiments. The use of the Django framework enables other users to extend and develop minoTour at will. minFQ, once installed and activated, remains in the background uploading data to the minoTour server, automatically detecting new runs and sequence data (Supplementary  Fig.  S11). Our implementation of minoTour with docker alongside detailed server installations for development leave the user with a variety of installation options. minoTour has been run on large centralized servers, or locally on laptops, enabling monitoring of sequencing runs for collaborative groups, sequencing facilities and individual users. At this time minoTour supports all platforms running flongle or minION flowcells. minoTour currently supports promethION for live data monitoring. Base-called data can be uploaded provided a suitably powerful server is running the app. Analysis pipelines will run, but will most likely fall behind the volumes of data that can be generated by a promethION. Click here for additional data file.
  10 in total

Review 1.  Acute Vector-Borne Viral Infection: Zika and MinION Surveillance.

Authors:  Jaqueline Goes de Jesus; Marta Giovanetti; Nuno Rodrigues Faria; Luiz Carlos Junior Alcantara
Journal:  Microbiol Spectr       Date:  2019-07

2.  Long-read sequencing identifies GGC repeat expansions in NOTCH2NLC associated with neuronal intranuclear inclusion disease.

Authors:  Jun Sone; Satomi Mitsuhashi; Atsushi Fujita; Takeshi Mizuguchi; Kohei Hamanaka; Keiko Mori; Haruki Koike; Akihiro Hashiguchi; Hiroshi Takashima; Hiroshi Sugiyama; Yutaka Kohno; Yoshihisa Takiyama; Kengo Maeda; Hiroshi Doi; Shigeru Koyano; Hideyuki Takeuchi; Michi Kawamoto; Nobuo Kohara; Tetsuo Ando; Toshiaki Ieda; Yasushi Kita; Norito Kokubun; Yoshio Tsuboi; Kazutaka Katoh; Yoshihiro Kino; Masahisa Katsuno; Yasushi Iwasaki; Mari Yoshida; Fumiaki Tanaka; Ikuo K Suzuki; Martin C Frith; Naomichi Matsumoto; Gen Sobue
Journal:  Nat Genet       Date:  2019-07-22       Impact factor: 38.330

3.  BoardION: real-time monitoring of Oxford Nanopore sequencing instruments.

Authors:  Aimeric Bruno; Jean-Marc Aury; Stefan Engelen
Journal:  BMC Bioinformatics       Date:  2021-05-13       Impact factor: 3.169

4.  Real-time selective sequencing using nanopore technology.

Authors:  Matthew Loose; Sunir Malla; Michael Stout
Journal:  Nat Methods       Date:  2016-07-25       Impact factor: 28.547

5.  Centrifuge: rapid and sensitive classification of metagenomic sequences.

Authors:  Daehwan Kim; Li Song; Florian P Breitwieser; Steven L Salzberg
Journal:  Genome Res       Date:  2016-10-17       Impact factor: 9.043

6.  Nanopore DNA Sequencing and Genome Assembly on the International Space Station.

Authors:  Sarah L Castro-Wallace; Charles Y Chiu; Kristen K John; Sarah E Stahl; Kathleen H Rubins; Alexa B R McIntyre; Jason P Dworkin; Mark L Lupisella; David J Smith; Douglas J Botkin; Timothy A Stephenson; Sissel Juul; Daniel J Turner; Fernando Izquierdo; Scot Federman; Doug Stryke; Sneha Somasekar; Noah Alexander; Guixia Yu; Christopher E Mason; Aaron S Burton
Journal:  Sci Rep       Date:  2017-12-21       Impact factor: 4.379

7.  Readfish enables targeted nanopore sequencing of gigabase-sized genomes.

Authors:  Alexander Payne; Nadine Holmes; Thomas Clarke; Rory Munro; Bisrat J Debebe; Matthew Loose
Journal:  Nat Biotechnol       Date:  2020-11-30       Impact factor: 54.908

8.  Nanopore sequencing from liquid biopsy: analysis of copy number variations from cell-free DNA of lung cancer patients.

Authors:  Filippo Martignano; Uday Munagala; Stefania Crucitta; Alessandra Mingrino; Roberto Semeraro; Marzia Del Re; Iacopo Petrini; Alberto Magi; Silvestro G Conticello
Journal:  Mol Cancer       Date:  2021-02-12       Impact factor: 27.401

9.  NanoPack: visualizing and processing long-read sequencing data.

Authors:  Wouter De Coster; Svenn D'Hert; Darrin T Schultz; Marc Cruts; Christine Van Broeckhoven
Journal:  Bioinformatics       Date:  2018-08-01       Impact factor: 6.937

10.  Real-time, portable genome sequencing for Ebola surveillance.

Authors:  Joshua Quick; Nicholas J Loman; Sophie Duraffour; Jared T Simpson; Ettore Severi; Lauren Cowley; Joseph Akoi Bore; Raymond Koundouno; Gytis Dudas; Amy Mikhail; Nobila Ouédraogo; Babak Afrough; Amadou Bah; Jonathan Hj Baum; Beate Becker-Ziaja; Jan-Peter Boettcher; Mar Cabeza-Cabrerizo; Alvaro Camino-Sanchez; Lisa L Carter; Juiliane Doerrbecker; Theresa Enkirch; Isabel Graciela García Dorival; Nicole Hetzelt; Julia Hinzmann; Tobias Holm; Liana Eleni Kafetzopoulou; Michel Koropogui; Abigail Kosgey; Eeva Kuisma; Christopher H Logue; Antonio Mazzarelli; Sarah Meisel; Marc Mertens; Janine Michel; Didier Ngabo; Katja Nitzsche; Elisa Pallash; Livia Victoria Patrono; Jasmine Portmann; Johanna Gabriella Repits; Natasha Yasmin Rickett; Andrea Sachse; Katrin Singethan; Inês Vitoriano; Rahel L Yemanaberhan; Elsa G Zekeng; Racine Trina; Alexander Bello; Amadou Alpha Sall; Ousmane Faye; Oumar Faye; N'Faly Magassouba; Cecelia V Williams; Victoria Amburgey; Linda Winona; Emily Davis; Jon Gerlach; Franck Washington; Vanessa Monteil; Marine Jourdain; Marion Bererd; Alimou Camara; Hermann Somlare; Abdoulaye Camara; Marianne Gerard; Guillaume Bado; Bernard Baillet; Déborah Delaune; Koumpingnin Yacouba Nebie; Abdoulaye Diarra; Yacouba Savane; Raymond Bernard Pallawo; Giovanna Jaramillo Gutierrez; Natacha Milhano; Isabelle Roger; Christopher J Williams; Facinet Yattara; Kuiama Lewandowski; Jamie Taylor; Philip Rachwal; Daniel Turner; Georgios Pollakis; Julian A Hiscox; David A Matthews; Matthew K O'Shea; Andrew McD Johnston; Duncan Wilson; Emma Hutley; Erasmus Smit; Antonino Di Caro; Roman Woelfel; Kilian Stoecker; Erna Fleischmann; Martin Gabriel; Simon A Weller; Lamine Koivogui; Boubacar Diallo; Sakoba Keita; Andrew Rambaut; Pierre Formenty; Stephan Gunther; Miles W Carroll
Journal:  Nature       Date:  2016-02-03       Impact factor: 69.504

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.