Literature DB >> 23055966

Open is Not Enough. Let's Take the Next Step: An Integrated, Community-Driven Computing Platform for Neuroscience.

Yaroslav O Halchenko1, Michael Hanke.   

Abstract

Entities:  

Year:  2012        PMID: 23055966      PMCID: PMC3458431          DOI: 10.3389/fninf.2012.00022

Source DB:  PubMed          Journal:  Front Neuroinform        ISSN: 1662-5196            Impact factor:   4.081


× No keyword cloud information.
The last 5 years have seen dramatic improvements in the collaborative research infrastructure. A need for open research tools has been identified (Ince et al., 2012), and one solution has been clearing houses, such as the INCF Software Center1, and the NITRC portal, which facilitate efforts of peer-to-peer software and data sharing that were previously limited to only well-funded formal consortia (see, Poline et al., 2012, for a recent summary of the status quo). However, collecting these resources into a centralized clearing-house addresses only one necessary aspect on the way to a sustainable software ecosystem for neuroscience – availability. Unfortunately it does not ensure ease of deployment, nor does it offer a sustainable model for long-term maintenance. At the same time, the development model of many neuroscience research software projects is broken. Inefficient and opaque procedures combined with a scarce developer workforce result in tools of insufficient quality and robustness that we rely on to conduct our research. Moreover, as the scientists, students, and research groups responsible for these tools move on to new tasks, their software is often left in a state of limbo, with no continued support for bug fixes or sufficiently coordinated maintenance. Over time, changes in underlying computing environments break the tools completely, and they commonly become abandoned – with costly consequences for the scientists depending upon them. To address this problem, we need to bring our tools further into the open, and consolidate development efforts on an open and community-driven platform – one that is capable of providing easy access, installation, and maintenance for any research software. Such effort will not only help to improve aspects of software engineering, but also meet many unfulfilled requirements toward the goal of practical open science. To the best of our knowledge, NeuroDebian is the most comprehensive attempt at improving the neuroscience software ecosystem. It applies proven principles and procedures from Free and Open-Source Software (FOSS) development to the maintenance and deployment of neuroscience software. NeuroDebian's strategy is to work with scientists and developers, helping them to directly integrate their software with the Debian operating system (OS). Using the Debian OS as a foundation for these efforts offers five unique advantages for end-users and developers alike. Installing and upgrading software takes only minutes – as a result of using the included software package management tools – regardless of application complexity. Knowledge of custom installation procedures is simply not needed. A recent survey confirms that researchers who use software package management systems spend less time on system administration and more time on scientifically relevant tasks (Hanke and Halchenko, 2011). Strict open standards – for included software, as defined in Debian's policy (Jackson et al., 2012), help yield the maximum interoperability of all components without sacrificing robustness (Fortuna et al., 2011) or creating unnecessary burdens. This has enabled developers to make routine releases of the Debian OS for the past 20 years, which is accomplished by a group of loosely connected individuals working together over the internet – not unlike collaborating scientists. Applying these established open standards to neuroscience software improves the longevity and robustness of open science. Debian offers more than 29,000 readily usable pieces of software – making it the largest archive of its kind – especially for scientific purposes. This wide selection is a priceless resource not only for using, but also for building atop these libraries and applications from a variety of fields of endeavor. The joint maintenance of shared components reduces the overall maintenance cost, (Ghosh, 2006) and makes the integration of special-interest neuroscience software both more time- and effort-effective. Debian is open to anyone – Anyone is free to contribute anything to Debian within the limits set by its Social Contract (Perens et al., 1997). Debian is a well established self-governing project (O'Mahony and Ferraro, 2007) and is not controlled by any external entity that might impose a marketing strategy or decide what software ought to be eligible for inclusion into the system. Although the project is primarily focused on the development of an operating system built entirely from FOSS, the Social Contract explicitly acknowledges the necessity to also support restrictively licensed and proprietary closed-source solutions. Consequently, Debian provides all required tools and infrastructure to build and distribute non-free software as well. Debian's “do-ocracy” principle puts the person(s) contributing completely in charge of the details of their contribution. This aspect allows for integration of any software directly into the Debian system, while also being able to rely on its existing infrastructure (bug tracking system, software repositories with more than a hundred mirrors worldwide, etc.). As a result, this largely abolishes the need for a research-specific software distribution infrastructure. Standardization of binary and source distributions promotes reproducibility – both for troubleshooting and results. Additionally, standardized source packages encourage contributions by other developers – even ones who may be unfamiliar with the specific purpose of a piece of software. This helps to mitigate the shortage of man-power for research software maintenance. We believe that these advantages make the Debian software ecosystem an ideal environment for consolidating neuroscience research software. The Debian OS has been ported to numerous hardware architectures and runs natively on any hardware relevant for neuroscience research. Additionally, through means of virtualization an entire system can be deployed in cloud-computing scenarios or run alongside commercial OS's (e.g., OS X and MS Windows), thus making neuroscience software universally available. For the past 6 years, NeuroDebian has consistently proven the validity of these points, especially regarding the efficiency of the Debian platform. NeuroDebian is largely the result of two researchers working in their spare time, with the help of a handful of additional volunteers. Despite never having received direct institutional funding and having only limited man-power, today NeuroDebian software packages are used by thousands of researchers in hundreds of labs world-wide – with a current growth of about 20 new subscriptions each day (Figure 1). What had originally started as a small project to deploy neuroimaging-related software for personal convenience, now provides software for many related disciplines – electrophysiology, neural modeling, psychophysics, and distributed computing – thus enhancing its utility for multi-modal or multi-disciplinary projects. Over 50 packages have successfully migrated from NeuroDebian into Debian, which are now officially part of the Debian OS and its more than 100 derived GNU/Linux distributions, such as Ubuntu (developed by Canonical Ltd.). By virtue of collaboration with other related efforts inside the Debian project (Debian Med and Debian Science), Debian offers an impressive set of several hundred special-purpose tools of relevance for neuroscience research computing environments. As a result, the 2011 release of Debian 6.0 became the first OS to offer comprehensive support for neuroimaging research. Moreover, the upcoming release of Debian 7.0 will nearly double the number of neuroscience-related packages already included in Debian.
Figure 1

NeuroDebian repository access statistics from March 2009 to April 2012. (A) Geographic distribution of NeuroDebian repository accesses, aggregated by city or region as determined by GeoIP. Where no region-specific GeoIP information is available, statistics are placed into the center of the respective country. The color of each circle depicts the total number of package downloads and its size corresponds to the number of IP addresses which accessed the repository from that region. Push-pin icons indicate the location of NeuroDebian repository mirrors. (B) Daily statistic. The blue curve (# of update requests) shows the number of IP-address/OS-release combinations accessing the repository on any given day. This number should well approximate the lower bound of current “subscribers” of the repository. The red curve corresponds to the number of daily binary package downloads.

NeuroDebian repository access statistics from March 2009 to April 2012. (A) Geographic distribution of NeuroDebian repository accesses, aggregated by city or region as determined by GeoIP. Where no region-specific GeoIP information is available, statistics are placed into the center of the respective country. The color of each circle depicts the total number of package downloads and its size corresponds to the number of IP addresses which accessed the repository from that region. Push-pin icons indicate the location of NeuroDebian repository mirrors. (B) Daily statistic. The blue curve (# of update requests) shows the number of IP-address/OS-release combinations accessing the repository on any given day. This number should well approximate the lower bound of current “subscribers” of the repository. The red curve corresponds to the number of daily binary package downloads. To complement these “official” packages, NeuroDebian provides a large number of additional software packages in a supplementary repository. This includes software which is still in the integration process, as well as “backports” of recent research software releases for older versions of Debian and Ubuntu – addressing researchers’ desire to combine the latest software technology with a stable computing platform. The NeuroDebian repository network (eight community-provided mirrors in six countries; see Figure 1) also distributes various data packages that are frequently used in research or teaching activities, such as stereotaxic atlases and example datasets. Apart from individual packages, NeuroDebian offers a complete Virtual Machine (VM) that can be used on any major operating system. VMs are a popular approach for streamlining the deployment of complex research software on less science-savvy platforms (Hanke and Halchenko, 2011), and to facilitate collaborative work through the availability of a common computing environment (see the strategy of the 2nd winner of “The Executable Paper Grand Challenge” by Elsevier, Gorp and Mazanek, 2011). Instead of being tuned for a single tool, such as VMs offered by individual research groups for their own particular software, the NeuroDebian VM is designed to offer a multitude of integrated research software. The NeuroDebian VM has been downloaded more than 2900 times, and is frequently used for educational workshops. Students are provided with a fully functional computing environment, which they can explore and take home alongside other course materials after the workshop or seminar has ended. With such versatility, NeuroDebian offers a reliable foundation for day-to-day research activities. One such indicator for the robustness of the (Neuro)Debian system can be seen from the growing number of derivative works that are being built upon it. To our knowledge, NeuroDebian serves as the basis for several virtual appliances (e.g., XNAT; Marcus et al., 2007), the Lin4Neuro distribution (Nemoto et al., 2011), as well as the NITRC Computational Environment. NeuroDebian has shown that by adopting established tools and workflows it is possible to create a powerful integrated environment with minimal investment – offering a practical solution that is compliant with all proposed requirements for transparency of research software (Morin et al., 2012). Our vision for the future of neuroscience research software is one where the whole scientific community fully embraces the principles of the FOSS movement. This will accelerate the dissemination of new software technologies, foster efforts to improve software quality, and facilitate reproducible research – eventually leading to a universally available, truly comprehensive, and sustainable integrated computing platform. To achieve this goal, it is necessary to distribute the work in a way that does not represent an additional and unnecessary burden for the individual scientist: (1) Developers of research software need to encourage and facilitate external contributions. This requires public access to all of the standard tools of a typical FOSS development workflow, such as a version control system, bug tracker, and mailing list. Software needs to be released under a standard FOSS license in order to avoid wasteful reimplementation efforts caused by license incompatibilities. Efforts to integrate individual tools into larger analysis suites or software distributions must be promoted, as increased exposure by frequent re-use (instead of duplication) helps to improve software quality and minimize the “cost.” (2) Users of research software need to offer as much feedback as possible. Software authors need to know how, and how often, their tools are being used in order to be able to obtain sufficient funding for future development and maintenance. Knowledge of typical usage patterns is also critical to decide which functionality is most important and where development resources should be focused. There are tools to report anonymous usage statistics automatically (e.g., Debian popularity contest), abolishing the need for time-consuming manual actions. When software defects are discovered, they must be reported immediately. Reports need to be directed to a public forum to allow other researchers to check for the status of a particular defect and coordinate the efforts to fix it. Engagement with end-users will help guarantee the robust and correct operation of their computing research environments and thus improve the quality and productivity of research. While these actions only require very little time from an individual researcher, their combined effect when adopted by a community can be massive. The breath-taking development pace of the NiPy and NeuralEnsemble communities are flagship examples of the power and potential of what can be achieved when the proposed principles are put into effect. As a global community of scientists, it is now our chance to take the next step toward an accessible open science. Together we can create an integrated computing platform that we all freely share, to exchange data and ideas, implemented as software, that we all maintain collaboratively. It is unlikely that any single project with a classical funding scheme will ever accomplish what is needed. The tools are there – we just need to do it.
  7 in total

1.  Evolution of a modular software network.

Authors:  Miguel A Fortuna; Juan A Bonachela; Simon A Levin
Journal:  Proc Natl Acad Sci U S A       Date:  2011-11-21       Impact factor: 11.205

2.  Research priorities. Shining light into black boxes.

Authors:  A Morin; J Urban; P D Adams; I Foster; A Sali; D Baker; P Sliz
Journal:  Science       Date:  2012-04-13       Impact factor: 47.728

3.  The case for open computer programs.

Authors:  Darrel C Ince; Leslie Hatton; John Graham-Cumming
Journal:  Nature       Date:  2012-02-22       Impact factor: 49.962

4.  The Extensible Neuroimaging Archive Toolkit: an informatics platform for managing, exploring, and sharing neuroimaging data.

Authors:  Daniel S Marcus; Timothy R Olsen; Mohana Ramaratnam; Randy L Buckner
Journal:  Neuroinformatics       Date:  2007

5.  Data sharing in neuroimaging research.

Authors:  Jean-Baptiste Poline; Janis L Breeze; Satrajit Ghosh; Krzysztof Gorgolewski; Yaroslav O Halchenko; Michael Hanke; Christian Haselgrove; Karl G Helmer; David B Keator; Daniel S Marcus; Russell A Poldrack; Yannick Schwartz; John Ashburner; David N Kennedy
Journal:  Front Neuroinform       Date:  2012-04-05       Impact factor: 4.081

6.  Lin4Neuro: a customized Linux distribution ready for neuroimaging analysis.

Authors:  Kiyotaka Nemoto; Ippeita Dan; Christopher Rorden; Takashi Ohnishi; Daisuke Tsuzuki; Masako Okamoto; Fumio Yamashita; Takashi Asada
Journal:  BMC Med Imaging       Date:  2011-01-25       Impact factor: 1.930

7.  Neuroscience Runs on GNU/Linux.

Authors:  Michael Hanke; Yaroslav O Halchenko
Journal:  Front Neuroinform       Date:  2011-07-07       Impact factor: 4.081

  7 in total
  45 in total

1.  scikit-image: image processing in Python.

Authors:  Stéfan van der Walt; Johannes L Schönberger; Juan Nunez-Iglesias; François Boulogne; Joshua D Warner; Neil Yager; Emmanuelle Gouillart; Tony Yu
Journal:  PeerJ       Date:  2014-06-19       Impact factor: 2.984

2.  The NITRC image repository.

Authors:  David N Kennedy; Christian Haselgrove; Jon Riehl; Nina Preuss; Robert Buccigrossi
Journal:  Neuroimage       Date:  2015-06-02       Impact factor: 6.556

3.  Feedback contribution to surface motion perception in the human early visual cortex.

Authors:  Ingo Marquardt; Peter De Weerd; Marian Schneider; Omer Faruk Gulban; Dimo Ivanov; Yawen Wang; Kâmil Uludağ
Journal:  Elife       Date:  2020-06-04       Impact factor: 8.140

4.  A high-resolution 7-Tesla fMRI dataset from complex natural stimulation with an audio movie.

Authors:  Michael Hanke; Florian J Baumgartner; Pierre Ibe; Falko R Kaule; Stefan Pollmann; Oliver Speck; Wolf Zinke; Jörg Stadler
Journal:  Sci Data       Date:  2014-05-27       Impact factor: 6.444

5.  Neurophysiological analytics for all! Free open-source software tools for documenting, analyzing, visualizing, and sharing using electronic notebooks.

Authors:  David M Rosenberg; Charles C Horn
Journal:  J Neurophysiol       Date:  2016-04-20       Impact factor: 2.714

6.  The open diffusion data derivatives, brain data upcycling via integrated publishing of derivatives and reproducible open cloud services.

Authors:  Paolo Avesani; Brent McPherson; Soichi Hayashi; Cesar F Caiafa; Robert Henschel; Eleftherios Garyfallidis; Lindsey Kitchell; Daniel Bullock; Andrew Patterson; Emanuele Olivetti; Olaf Sporns; Andrew J Saykin; Lei Wang; Ivo Dinov; David Hancock; Bradley Caron; Yiming Qian; Franco Pestilli
Journal:  Sci Data       Date:  2019-05-23       Impact factor: 6.444

7.  BIDS apps: Improving ease of use, accessibility, and reproducibility of neuroimaging data analysis methods.

Authors:  Krzysztof J Gorgolewski; Fidel Alfaro-Almagro; Tibor Auer; Pierre Bellec; Mihai Capotă; M Mallar Chakravarty; Nathan W Churchill; Alexander Li Cohen; R Cameron Craddock; Gabriel A Devenyi; Anders Eklund; Oscar Esteban; Guillaume Flandin; Satrajit S Ghosh; J Swaroop Guntupalli; Mark Jenkinson; Anisha Keshavan; Gregory Kiar; Franziskus Liem; Pradeep Reddy Raamana; David Raffelt; Christopher J Steele; Pierre-Olivier Quirion; Robert E Smith; Stephen C Strother; Gaël Varoquaux; Yida Wang; Tal Yarkoni; Russell A Poldrack
Journal:  PLoS Comput Biol       Date:  2017-03-09       Impact factor: 4.475

8.  NAPR: a Cloud-Based Framework for Neuroanatomical Age Prediction.

Authors:  Heath R Pardoe; Ruben Kuzniecky
Journal:  Neuroinformatics       Date:  2018-01

9.  How the Human Brain Represents Perceived Dangerousness or "Predacity" of Animals.

Authors:  Andrew C Connolly; Long Sha; J Swaroop Guntupalli; Nikolaas Oosterhof; Yaroslav O Halchenko; Samuel A Nastase; Matteo Visconti di Oleggio Castello; Hervé Abdi; Barbara C Jobst; M Ida Gobbini; James V Haxby
Journal:  J Neurosci       Date:  2016-05-11       Impact factor: 6.167

10.  A Modality-Independent Network Underlies the Retrieval of Large-Scale Spatial Environments in the Human Brain.

Authors:  Derek J Huffman; Arne D Ekstrom
Journal:  Neuron       Date:  2019-09-17       Impact factor: 17.173

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.