Literature DB >> 23587306

Crowdsourcing genomic analyses of ash and ash dieback - power to the people.

Dan Maclean¹, Kentaro Yoshida, Anne Edwards, Lisa Crossman, Bernardo Clavijo, Matt Clark, David Swarbreck, Matthew Bashton, Patrick Chapman, Mark Gijzen, Mario Caccamo, Allan Downie, Sophien Kamoun, Diane Go Saunders.

Abstract

Ash dieback is a devastating fungal disease of ash trees that has swept across Europe and recently reached the UK. This emergent pathogen has received little study in the past and its effect threatens to overwhelm the ash population. In response to this we have produced some initial genomics datasets and taken the unusual step of releasing them to the scientific community for analysis without first performing our own. In this manner we hope to 'crowdsource' analyses and bring the expertise of the community to bear on this problem as quickly as possible. Our data has been released through our website at oadb.tsl.ac.uk and a public GitHub repository.

Entities: Chemical

Year: 2013 PMID： 23587306 PMCID： PMC3626535 DOI： 10.1186/2047-217X-2-2

Source DB: PubMed Journal: Gigascience ISSN： 2047-217X Impact factor: 6.524

Main text

oadb.tsl.ac.uk: A new resource for the crowdsourcing of genomic analyses on ash and ash dieback

Ash dieback is a devastating disease of ash trees caused by the aggressive fungal pathogen Chalara fraxinea. This fungus emerged in the early 1990s in Poland and has since spread west across Europe reaching native forests in the UK late last year. The emergence of Chalara in the UK caused public outcry where up to 90% of the more than 80 million ash trees are thought to be under threat. The disease, which is a newcomer to Britain, was first reported in the natural environment in October 2012 and has since been recorded in native woodland throughout the UK. There is no known treatment for ash dieback, current control measures include burning infected trees to try and prevent spread [1] and the implications for the UK environment and the economy remain stark. To kick start genomic analyses of the pathogen and host, we took the unconventional step of rapidly generating and releasing genomic sequence data. We released the data through our new ash and ash dieback website, oadb.tsl.ac.uk, which we launched in December 2012. Speed is essential in responses to rapidly appearing and threatening diseases and with this initiative we aim to make it possible for experts from around the world to access the data and analyse it immediately, speeding up the process of discovery. We hope that by providing data as soon as possible we will stimulate crowdsourcing and open community engagement to tackle this devastating pathogen.

The transcriptomics and genomics data we have released so far

We have generated and released Illumina sequence data of both the transcriptome and genome of Chalara and the transcriptome of infected and uninfected ash trees. We took the unusual first step of directly sequencing the “interaction transcriptome” [2] of a lesion dissected from an infected ash twig collected in the field. This enabled us to respond quickly, generating useful information without time-consuming standard laboratory culturing; the shortest route from the wood to the sequencer to the computer. The Chalara transcriptome data, generated at The Sainsbury Laboratory (TSL, Norwich, UK) was derived from two infected ash samples collected at Ashwellthorpe Lower Wood, near Norwich; the location of the first confirmed case of ash dieback in the wild in the UK. Here we extracted RNA from branches of two infected ash trees, prepared cDNA libraries from each and sequenced these to create 76 nt paired-end reads on our Illumina GAII. In parallel to the transcriptome data, genome sequence data were produced in a coordinated effort between The John Innes Centre (JIC), TSL and The Genome Analysis Centre (TGAC) in Norwich. A single C. fraxinea isolate was cultured from infected tissue found in Kenninghall Wood. Genomic DNA libraries were constructed and sequenced on an Illumina MiSeq sequencer as 150 nt and 250 nt paired-end libraries. As soon as these datasets were generated we released them through oadb.tsl.ac.uk. We took the unusual step to release the data before preliminary analysis had been undertaken so that we might take advantage of the huge range of expertise and knowledge available outside our groups, and thereby make the best of the data as quickly as possible via a crowdsourcing approach.

Crowdsourcing: bringing the power of many, marshalling expertise and democratising genomics

Crowdsourcing is a form of massively parallel collaboration, the main distinguishing feature of which is the low overhead to entry of participation and low level of investment from a participant. The power is in the sheer number of people interested in seeing the goals of the project achieved. Scientists have not been slow to adopt these models to carry out work that could not be automated successfully and require human intelligence and expertise. Recently genomic scientists have made inroads to leveraging the power of crowds to annotate and assemble the genome sequence of a novel strain of Escherichia coli O104:H4 bacteria that caused a serious outbreak of foodborne illness in northern Germany in spring 2011. These scientists were able to quickly link up with others across the world with similar skills to rapidly analyse the novel pathogenic strain [3]. Most importantly, crowdsourcing allows for a new form of potentially effective live peer-review, many sets of eyes interrogating and reviewing data and analyses mean that unusual results are quickly highlighted and can be assessed and dealt with appropriately. Whether they are eventually found to be inconsistencies in analysis or more exciting genuine new discoveries, the end product is brought to the scientific community many times faster than the usual peer-review by a small number of reviewers and crucially it all happens out in the open with maximum transparency. The cornerstone of our crowdsourcing is our repository on GitHub [4], a versioning system designed for collaboration in software development that automatically maintains attribution of contribution, meaning that whoever contributes will get full credit for the difference that they made. We are certain that the data will prove useful to anyone who wishes to be involved in the fightback against ash dieback and that concerted, early data-sharing and open analysis is a crucial step in a productive and timely response to emergent pathogen threats.

The future of our data and our initiative

To date, genome analysis of emerging plant pathogens is not rapidly implemented as is routinely done with human pathogens [5,6]. Worse, the data (when available) is not immediately released into the public domain. We hope our openness will encourage the scientific community to engage in this proactive and collaborative model of working when faced with pressing challenges. Already we are seeing a significant amount of work being provided by external groups. Contributions of transcriptome assemblies, protein domain annotations, phylogenetic trees and BLASTs for specific gene family members have been provided from groups across the world.

Credit where credit is due

We absolutely understand the need for scientists to be credited for what they do and we intend to make sure that everyone who contributes receives full attribution. The GitHub repository ensures this, and we are committed to the principle for all other potential results from this initiative. The altmetrics movement is making it possible and acceptable for scientists to cite the varied products of science [7], rather than simply the papers they write and we intend to make it as easy as possible for contributors to be able to cite what they did via commit number and potentially DOIs.

Towards a rapid response for food and ecosystem security

A pathogenic threat to our forests and ecosystems is a threat to our ability to live on the planet sustainably, just as a threat to our crops is a threat to our ability to feed ourselves. In these situations it is vital to respond as quickly as possible so we must embrace the evolution of a new digital immune system [8]. Our initiative is an early step towards developing the crucial function of the digital immune system for response to plant pathogens; the thing we cannot upload to a repository is the people with the expertise and the will to contribute, and that is why we need the scientific community to download our data and provide analyses. Our website and repository can be found at: http://oadb.tsl.ac.uk https://github.com/ash-dieback-crowdsource/data

Abbreviations

DOI: Digital Object Identifier; JIC: The John Innes Centre; Nt: Nucleotide; TGAC: The Genome Analysis Centre; TSL: The Sainsbury Laboratory.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

All authors contributed to the drafting of the manuscript. DM created the oadb website, designed and instantiated the GitHub repository and wrote the commentary, AE and AD sourced biological materials for sequencing, DGOS, KY and SK prepared biological materials, managed sequencing and performed analyses and contributed data to the repository. LC, BC, DS, M Clarke, PC and MG and MC provided analyses of data in the repository. All authors read and approved the final manuscript.

3 in total

1. Altmetrics: Value all research products.

Authors: Heather Piwowar
Journal: Nature Date: 2013-01-10 Impact factor: 49.962

2. Comparative genome analysis provides insights into the evolution and adaptation of Pseudomonas syringae pv. aesculi on Aesculus hippocastanum.

Authors: Sarah Green; David J Studholme; Bridget E Laue; Federico Dorati; Helen Lovell; Dawn Arnold; Joan E Cottrell; Stephen Bridgett; Mark Blaxter; Edgar Huitema; Richard Thwaites; Paul M Sharp; Robert W Jackson; Sophien Kamoun
Journal: PLoS One Date: 2010-04-19 Impact factor: 3.240

3. The rise of a digital immune system.

Authors: Michael C Schatz; Adam M Phillippy
Journal: Gigascience Date: 2012-07-12 Impact factor: 6.524

3 in total

10 in total

1. Foundational and Translational Research Opportunities to Improve Plant Health.

Authors: Richard Michelmore; Gitta Coaker; Rebecca Bart; Gwyn Beattie; Andrew Bent; Toby Bruce; Duncan Cameron; Jeffery Dangl; Savithramma Dinesh-Kumar; Rob Edwards; Sebastian Eves-van den Akker; Walter Gassmann; Jean T Greenberg; Linda Hanley-Bowdoin; Richard J Harrison; Jagger Harvey; Ping He; Alisa Huffaker; Scot Hulbert; Roger Innes; Jonathan D G Jones; Isgouhi Kaloshian; Sophien Kamoun; Fumiaki Katagiri; Jan Leach; Wenbo Ma; John McDowell; June Medford; Blake Meyers; Rebecca Nelson; Richard Oliver; Yiping Qi; Diane Saunders; Michael Shaw; Christine Smart; Prasanta Subudhi; Lesley Torrance; Bret Tyler; Barbara Valent; John Walsh
Journal: Mol Plant Microbe Interact Date: 2017-06-12 Impact factor: 4.171

2. Out of the woods. Ash dieback and the future of emergent pathogenomics.

Authors: Dan MacLean
Journal: Mol Plant Pathol Date: 2014-01 Impact factor: 5.663

Review 3. Hymenoscyphus pseudoalbidus, the causal agent of European ash dieback.

Authors: Andrin Gross; Ottmar Holdenrieder; Marco Pautasso; Valentin Queloz; Thomas Niklaus Sieber
Journal: Mol Plant Pathol Date: 2013-10-07 Impact factor: 5.663

4. Crowdsourcing the corpasome.

Authors: Manuel Corpas
Journal: Source Code Biol Med Date: 2013-06-21

5. Draft genome sequence of Marssonina coronaria, causal agent of apple blotch, and comparisons with the Marssonina brunnea and Marssonina rosae genomes.

Authors: Qiang Cheng; Junxiang Chen; Lijuan Zhao
Journal: PLoS One Date: 2021-02-05 Impact factor: 3.240

6. Bacterial Microbiota of Field-Collected Helicoverpa zea (Lepidoptera: Noctuidae) from Transgenic Bt and Non-Bt Cotton.

Authors: Jean M Deguenon; Anirudh Dhammi; Loganathan Ponnusamy; Nicholas V Travanty; Grayson Cave; Roger Lawrie; Dan Mott; Dominic Reisig; Ryan Kurtz; R Michael Roe
Journal: Microorganisms Date: 2021-04-20

7. Lessons from Fraxinus, a crowd-sourced citizen science game in genomics.

Authors: Ghanasyam Rallapalli; Diane Go Saunders; Kentaro Yoshida; Anne Edwards; Carlos A Lugo; Steve Collin; Bernardo Clavijo; Manuel Corpas; David Swarbreck; Matthew Clark; J Allan Downie; Sophien Kamoun; Dan MacLean
Journal: Elife Date: 2015-07-29 Impact factor: 8.140

8. In search of solutions to grapevine trunk diseases through "crowd-sourced" science.

Authors: Karen L Block; Philippe E Rolshausen; Dario Cantu
Journal: Front Plant Sci Date: 2013-10-02 Impact factor: 5.753

9. Identifying the science and technology dimensions of emerging public policy issues through horizon scanning.

Authors: Miles Parker; Andrew Acland; Harry J Armstrong; Jim R Bellingham; Jessica Bland; Helen C Bodmer; Simon Burall; Sarah Castell; Jason Chilvers; David D Cleevely; David Cope; Lucia Costanzo; James A Dolan; Robert Doubleday; Wai Yi Feng; H Charles J Godfray; David A Good; Jonathan Grant; Nick Green; Arnoud J Groen; Tim T Guilliams; Sunjai Gupta; Amanda C Hall; Adam Heathfield; Ulrike Hotopp; Gary Kass; Tim Leeder; Fiona A Lickorish; Leila M Lueshi; Chris Magee; Tiago Mata; Tony McBride; Natasha McCarthy; Alan Mercer; Ross Neilson; Jackie Ouchikh; Edward J Oughton; David Oxenham; Helen Pallett; James Palmer; Jeff Patmore; Judith Petts; Jan Pinkerton; Richard Ploszek; Alan Pratt; Sophie A Rocks; Neil Stansfield; Elizabeth Surkovic; Christopher P Tyler; Andrew R Watkinson; Jonny Wentworth; Rebecca Willis; Patrick K A Wollner; Kim Worts; William J Sutherland
Journal: PLoS One Date: 2014-05-30 Impact factor: 3.240

10. Community engagement.

Authors:
Journal: Nat Rev Microbiol Date: 2013-04 Impact factor: 60.633

10 in total