Literature DB >> 21304642

Extending Standards for Genomics and Metagenomics Data: A Research Coordination Network for the Genomic Standards Consortium (RCN4GSC).

John C Wooley, Dawn Field, Frank-Oliver Glöckner.   

Abstract

Through a newly established Research Coordination Network for the Genomic Standards Consortium (RCN4GSC), the GSC will continue its leadership in establishing and integrating genomic standards through community-based efforts. These efforts, undertaken in the context of genomic and metagenomic research aim to ensure the electronic capture of all genomic data and to facilitate the achievement of a community consensus around collecting and managing relevant contextual information connected to the sequence data. The GSC operates as an open, inclusive organization, welcoming inspired biologists with a commitment to community service. Within the collaborative framework of the ongoing, international activities of the GSC, the RCN will expand the range of research domains engaged in these standardization efforts and sustain scientific networking to encourage active participation by the broader community. The RCN4GSC, funded for five years by the US National Science Foundation, will primarily support outcome-focused working meetings and the exchange of early-career scientists between GSC research groups in order to advance key standards contributions such as GCDML. Focusing on the timely delivery of the extant GSC core projects, the RCN will also extend the pioneering efforts of the GSC to engage researchers active in developing ecological, environmental and biodiversity data standards. As the initial goals of the GSC are increasingly achieved, promoting the comprehensive use of effective standards will be essential to ensure the effective use of sequence and associated data, to provide access for all biologists to all of the information, and to create interdisciplinary opportunities for discovery. The RCN will facilitate these implementation activities through participation in major scientific conferences and presentations on scientific advances enabled by community usage of genomic standards.

Entities:  

Year:  2009        PMID: 21304642      PMCID: PMC3035207          DOI: 10.4056/sigs.26218

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The sustained, rapid completion of whole genome sequences has transformed biological research for the entire breadth of scales from molecular to population and ecosystem level studies. Ensuring effective use of genomic and metagenomic data requires a community-driven process that initially establishes standardized mechanisms supporting acquisition, electronic capture and communication of all genomic and metagenomic data and subsequently, that enables the willingness and commitment for full participation in their usage. Implementation of such goals is also essential to ensure effective use of these data and equitable access by all biologists to all of this information; in turn, the availability of comprehensive, standards-driven information resources will create interdisciplinary opportunities for discovery. The Genomic Standards Consortium (GSC) pioneered and has promoted the establishment of such community based, consensus building efforts. To engage the entire community, deliver high quality products and enable experimentalists and information resource providers to appreciate the value of consistent usage of the standards, GSC operates as an open organization. To build upon the initial successes, expand its international participation, and include additional research communities, the Genomic Standards Consortium (GSC) has established a Research Coordination Network (RCN) under US National Science Foundation funding. The RCN, termed the Research Coordination Network for the Genomic Standards Consortium (RCN4GSC), will further the work of the GSC in promoting and integrating standards describing genomic and metagenomic data sets within the international community. Research Coordination Network awards are a unique mechanism of the National Science Foundation to build community efforts that are of central importance for 21st Century biology. The creation of this RCN for the GSC will ensure a worldwide, community driven process for establishing standardized mechanisms for the electronic capture of genomic data and for obtaining willingness to participate. Achieving this goal is essential to ensure effective use of the sequence and associated data, to provide access for all biologists to all of the information, and to create interdisciplinary opportunities for discovery. In the process of sustaining and extending the pioneering efforts of the GSC, the RCN will primarily provide five years of funding (2009-2013) to support small collaborative workshops aimed at advancing core objectives and processes for standards and the exchange of early-career scientists between GSC research groups. The exchanges will allow rapid progress on technical deliverables of the GSC such as GCDML.

Organization

The RCN will be led through its Steering Committee by John C. Wooley, UCSD (PI); Dawn Field, CEH Oxford; Frank Oliver Glöckner, MPI-Bremen; George Garrity, MSU; Nikos Kyrpides, JGI; Karen Nelson, JCVI; Owen White, University of Maryland, as drawn from the GSC Board. Other RCN members responsible for leading core projects and activities include James Cole, MSU; Peter Dawyndt, University of Ghent; Renzo Kottmann, MPI-Bremen; Lynette Hirschman, MITRE; Victor Markowitz, LBNL; Inigo San Gil, UNM; and Lynn Schriml, University of Maryland. Other steering committee members and core leaders will be selected to cover the expanded efforts in ecological, environmental and biodiversity data.

Goals of the RCN4GSC

Expanding and fully incorporating early consensus building activities and the institutional commitments inspired and implemented by the GSC, the creation of the RCN4GSC will ensure continued worldwide, community-driven capacity to establish standardized mechanisms for the electronic capture of genomic data and for obtaining willingness to participate. The RCN4GSC, using open access solutions and engaged with other international working bodies, will largely focus on a set of already defined GSC ‘core projects’ as well as provide the community with a forum in which to propose and develop future consensus-drive activities when appropriate. Specifically, as an arm of the GSC, this RCN will focus extending of the “Minimum Information about a (Meta) Genome Sequence” (MIGS/MIMS) specification and the GSC Genome Catalogue [1], the Genomic Contextual Data Markup Language (GCDML) [2] the Genomic Rosetta Stone [3], Habitat-Lite (based on the Environment Ontology) [4] and the Standards in Genomic Sciences eJournal [5] which will contain, among other types of articles, genome and metagenome notes as well as Standard Operating Procedures (SOPs) [6]. The RCN will also further extend the genomic standards effort to engage the developers of data standards for ecological, environmental and biodiversity data. Such efforts will work to ensure that extensive analyses can readily be done on all genome and metagenome data sets. Implementing this RCN will allow the standardization effort to partner closely with an inclusive set of major stakeholders and above all, with the community as a whole, to lead to the best product and ensure community support. This RCN, thus, addresses an important goal of 21st century biology; namely, establishing an integrative biology across the vast scales of time, space and complexity of life.

Promotion of Inclusive Participation

The GSC has always been an open organization and through this RCN, hopes to widen international participation in its core activities. The GSC and its RCN reflect the democratization of access, provided in principle by the internet, that can be enabled by the establishment of and compliance to standards that ensure all genomic data is available to the vast diversity of biologists. Thus, biologists with enthusiasm and a commitment to community service are always welcome to join and encouraged to participate actively. Coordination with professional societies and communication to federal and foundation stake-holders will be among strategies for expanding participation, as will increasing participation from Asia. Similarly, the RCN will allow the GSC members to expand community networking to enlist new participants and inform federal and foundation stakeholders, as well as to reach more experimentalists through numerous contributions at major meetings. The contributions will focus on attributes like standards for metagenomics, metadata and meta-analysis, including brief updates and presentations of specific research advances enabled by the use of the standards. These venues for expanding participation will be managed by the Communications Committee, who will focus on enlisting further diversity in participation and in reaching out to beginning biologists. We will also ramp up efforts to coordinate with large sequencing centers and knowledge resources, both of which are represented on the RCN to share their experiences and sustain a dialogue around ongoing advances.

Major Networking Activities

The RCN will sustain international GSC meetings and small group focused working meetings. Intense collaborations around the maturation of GCDML, the Genome Rosetta Stone and the Genome Catalogue will be an early focus. Productivity will be sustained year round by routine audio-video and web-enabled interactions, including small working group sessions and collaborations via the RCN's extant web hub at http://gensc.org. Experts in both genomics and computing will collaborate with domain specialists to extend GCDML and be sure its development is consistent with biological science expectations in general. These activities will also expand ongoing coordination required for implementation of all the GSC’s core projects, of all the GSC’s core projects, in particular in the context of the international standards community [7,8]. At the same time, the RCN will help the GSC to sustain and build new interactions outside the domain of molecular research. An important early activity will be further development of a two-way exchange between the NSF’s Long Term Ecological Research (LTER) Network and the GSC [9]. Other exchanges, including those with environmental and biodiversity data efforts as well as specific microbial and metagenomic efforts, will follow. It is critical that the GSC’s efforts evolve in close association with efforts to develop ecological data standards, such as Ecological Metadata Language (EML), biodiversity standards, such as Darwin Core, and environmental research programs, such as the Global Lake Ecological Observatory Network. For effective use of funds, smaller working group sessions will also occur at major professional society meetings when possible. The “e science” nature of this RCN also means that considerable progress can be made throughout the year via document sharing on wiki and blog environments for community input and continuous dialogue.
  9 in total

1.  Defining linkages between the GSC and NSF's LTER program: how the Ecological Metadata Language (EML) relates to GCDML and other outcomes.

Authors:  Inigo San Gil; Wade Sheldon; Tom Schmidt; Mark Servilla; Raul Aguilar; Corinna Gries; Tanya Gray; Dawn Field; James Cole; Jerry Yun Pan; Giri Palanisamy; Donald Henshaw; Margaret O'Brien; Linda Kinkel; Katherine McMahon; Renzo Kottmann; Linda Amaral-Zettler; John Hobbie; Philip Goldstein; Robert P Guralnick; James Brunt; William K Michener
Journal:  OMICS       Date:  2008-06

2.  Toward a standards-compliant genomic and metagenomic publication record.

Authors:  George M Garrity; Dawn Field; Nikos Kyrpides; Lynette Hirschman; Susanna-Assunta Sansone; Samuel Angiuoli; James R Cole; Frank Oliver Glöckner; Eugene Kolker; George Kowalchuk; Mary Ann Moran; Dave Ussery; Owen White
Journal:  OMICS       Date:  2008-06

3.  Meeting report: the fifth Genomic Standards Consortium (GSC) workshop.

Authors:  Dawn Field; George M Garrity; Susanna-Assunta Sansone; Peter Sterk; Tanya Gray; Nikos Kyrpides; Lynette Hirschman; Frank Oliver Glöckner; Renzo Kottmann; Sam Angiuoli; Owen White; Peter Dawyndt; Nick Thomson; Inigo San Gil; Norman Morrison; Tatiana Tatusova; Ilene Mizrachi; Robert Vaughan; Guy Cochrane; Leonid Kagan; Sean Murphy; Lynn Schriml
Journal:  OMICS       Date:  2008-06

4.  The first RSBI (ISA-TAB) workshop: "can a simple format work for complex studies?".

Authors:  Susanna-Assunta Sansone; Philippe Rocca-Serra; Marco Brandizi; Alvis Brazma; Dawn Field; Jennifer Fostel; Andrew G Garrow; Jack Gilbert; Federico Goodsaid; Nigel Hardy; Phil Jones; Allyson Lister; Michael Miller; Norman Morrison; Tim Rayner; Nataliya Sklyar; Chris Taylor; Weida Tong; Guy Warner; Stefan Wiemann
Journal:  OMICS       Date:  2008-06

5.  Toward an online repository of Standard Operating Procedures (SOPs) for (meta)genomic annotation.

Authors:  Samuel V Angiuoli; Aaron Gussman; William Klimke; Guy Cochrane; Dawn Field; George Garrity; Chinnappa D Kodira; Nikos Kyrpides; Ramana Madupu; Victor Markowitz; Tatiana Tatusova; Nick Thomson; Owen White
Journal:  OMICS       Date:  2008-06

6.  Habitat-Lite: a GSC case study based on free text terms for environmental metadata.

Authors:  Lynette Hirschman; Cheryl Clark; K Bretonnel Cohen; Scott Mardis; Joanne Luciano; Renzo Kottmann; James Cole; Victor Markowitz; Nikos Kyrpides; Norman Morrison; Lynn M Schriml; Dawn Field
Journal:  OMICS       Date:  2008-06

7.  Laying the foundation for a Genomic Rosetta Stone: creating information hubs through the use of consensus identifiers.

Authors:  Bart Van Brabant; Tanya Gray; Bert Verslyppe; Nikos Kyrpides; Karin Dietrich; Frank Oliver Glöckner; James Cole; Ryan Farris; Lynn M Schriml; Paul De Vos; Bernard De Baets; Dawn Field; Peter Dawyndt
Journal:  OMICS       Date:  2008-06

8.  A standard MIGS/MIMS compliant XML Schema: toward the development of the Genomic Contextual Data Markup Language (GCDML).

Authors:  Renzo Kottmann; Tanya Gray; Sean Murphy; Leonid Kagan; Saul Kravitz; Thierry Lombardot; Dawn Field; Frank Oliver Glöckner
Journal:  OMICS       Date:  2008-06

9.  Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project.

Authors:  Chris F Taylor; Dawn Field; Susanna-Assunta Sansone; Jan Aerts; Rolf Apweiler; Michael Ashburner; Catherine A Ball; Pierre-Alain Binz; Molly Bogue; Tim Booth; Alvis Brazma; Ryan R Brinkman; Adam Michael Clark; Eric W Deutsch; Oliver Fiehn; Jennifer Fostel; Peter Ghazal; Frank Gibson; Tanya Gray; Graeme Grimes; John M Hancock; Nigel W Hardy; Henning Hermjakob; Randall K Julian; Matthew Kane; Carsten Kettner; Christopher Kinsinger; Eugene Kolker; Martin Kuiper; Nicolas Le Novère; Jim Leebens-Mack; Suzanna E Lewis; Phillip Lord; Ann-Marie Mallon; Nishanth Marthandan; Hiroshi Masuya; Ruth McNally; Alexander Mehrle; Norman Morrison; Sandra Orchard; John Quackenbush; James M Reecy; Donald G Robertson; Philippe Rocca-Serra; Henry Rodriguez; Heiko Rosenfelder; Javier Santoyo-Lopez; Richard H Scheuermann; Daniel Schober; Barry Smith; Jason Snape; Christian J Stoeckert; Keith Tipton; Peter Sterk; Andreas Untergasser; Jo Vandesompele; Stefan Wiemann
Journal:  Nat Biotechnol       Date:  2008-08       Impact factor: 54.908

  9 in total
  6 in total

1.  The Genomic Observatories Metadatabase (GeOMe): A new repository for field and sampling event metadata associated with genetic samples.

Authors:  John Deck; Michelle R Gaither; Rodney Ewing; Christopher E Bird; Neil Davies; Christopher Meyer; Cynthia Riginos; Robert J Toonen; Eric D Crandall
Journal:  PLoS Biol       Date:  2017-08-03       Impact factor: 8.029

2.  MetaHCR: a web-enabled metagenome data management system for hydrocarbon resources.

Authors:  Peter C Marks; Marc Bigler; Eric B Alsop; Adrien Vigneron; Bart P Lomans; Renato De Paula; Brett Geissler; Nicolas Tsesmetzis
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

Review 3.  Global genetic diversity status and trends: towards a suite of Essential Biodiversity Variables (EBVs) for genetic composition.

Authors:  Sean Hoban; Frederick I Archer; Laura D Bertola; Jason G Bragg; Martin F Breed; Michael W Bruford; Melinda A Coleman; Robert Ekblom; W Chris Funk; Catherine E Grueber; Brian K Hand; Rodolfo Jaffé; Evelyn Jensen; Jeremy S Johnson; Francine Kershaw; Libby Liggins; Anna J MacDonald; Joachim Mergeay; Joshua M Miller; Frank Muller-Karger; David O'Brien; Ivan Paz-Vinas; Kevin M Potter; Orly Razgour; Cristiano Vernesi; Margaret E Hunter
Journal:  Biol Rev Camb Philos Soc       Date:  2022-04-12

4.  RCN4GSC Workshop Report: Managing Data at the Interface of Biodiversity and (Meta)Genomics, March 2011.

Authors:  Robert J Robbins; Linda Amaral-Zettler; Holly Bik; Stan Blum; James Edwards; Dawn Field; George Garrity; Jack A Gilbert; Renzo Kottmann; Leonard Krishtalka; Hilmar Lapp; Carolyn Lawrence; Norman Morrison; Eamonn Ó Tuama; Cynthia Parr; Inigo San Gil; David Schindel; Lynn Schriml; David Vieglas; John Wooley
Journal:  Stand Genomic Sci       Date:  2012-07-28

5.  Semantics in support of biodiversity knowledge discovery: an introduction to the biological collections ontology and related ontologies.

Authors:  Ramona L Walls; John Deck; Robert Guralnick; Steve Baskauf; Reed Beaman; Stanley Blum; Shawn Bowers; Pier Luigi Buttigieg; Neil Davies; Dag Endresen; Maria Alejandra Gandolfo; Robert Hanner; Alyssa Janning; Leonard Krishtalka; Andréa Matsunaga; Peter Midford; Norman Morrison; Éamonn Ó Tuama; Mark Schildhauer; Barry Smith; Brian J Stucky; Andrea Thomer; John Wieczorek; Jamie Whitacre; John Wooley
Journal:  PLoS One       Date:  2014-03-03       Impact factor: 3.240

6.  The BiSciCol Triplifier: bringing biodiversity data to the Semantic Web.

Authors:  Brian J Stucky; John Deck; Tom Conlin; Lukasz Ziemba; Nico Cellinese; Robert Guralnick
Journal:  BMC Bioinformatics       Date:  2014-07-29       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.