Literature DB >> 28025343

Reefgenomics.Org - a repository for marine genomics data.

Yi Jin Liew1, Manuel Aranda2, Christian R Voolstra2.   

Abstract

Over the last decade, technological advancements have substantially decreased the cost and time of obtaining large amounts of sequencing data. Paired with the exponentially increased computing power, individual labs are now able to sequence genomes or transcriptomes to investigate biological questions of interest. This has led to a significant increase in available sequence data. Although the bulk of data published in articles are stored in public sequence databases, very often, only raw sequencing data are available; miscellaneous data such as assembled transcriptomes, genome annotations etc. are not easily obtainable through the same means. Here, we introduce our website (http://reefgenomics.org) that aims to centralize genomic and transcriptomic data from marine organisms. Besides providing convenient means to download sequences, we provide (where applicable) a genome browser to explore available genomic features, and a BLAST interface to search through the hosted sequences. Through the interface, multiple datasets can be queried simultaneously, allowing for the retrieval of matching sequences from organisms of interest. The minimalistic, no-frills interface reduces visual clutter, making it convenient for end-users to search and explore processed sequence data. DATABASE URL: http://reefgenomics.org.
© The Author(s) 2016. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2016        PMID: 28025343      PMCID: PMC5199144          DOI: 10.1093/database/baw152

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


Introduction

Driven primarily by continuous reduction in sequencing costs and increasing availability of computing resources over the last decade, the genomes of several marine organisms e.g. Amphimedon queenslandica (1), Acropora digitifera (2), Aiptasia pallida (3), and Hydra vulgaris (4), and the transcriptomes of many others (5–8) have now been sequenced. However, a disconnect exists between what is submitted in the form of primary sequence data and what is available in the form of assembled and annotated data. While the majority of studies provide primary sequence data to public repositories, e.g. NCBI (National Center for Biotechnology Information), EBI (European Bioinformatics Institute), and DDBJ (DNA Data Bank of Japan), many studies elect not to upload assembled and annotated genomes or transcriptomes (all mRNAs expressed from the genes of an organism) to public sequence databases. Also, transcriptomic data tend to be more disparate, as illustrated by a recent 20-coral comparative metastudy (9) that used published primary data to infer the evolutionary success of reef-building corals. Although the sources of all data are cited, a web database to peruse, search via BLAST (Basic Local Alignment and Search Tool) (10), and download relevant sequence data were also provided for the convenience of the readers (http://comparative.reefgenomics.org). To facilitate dissemination of similar data, we designed and host a website with simplicity in mind. We aim to provide an online platform for sharing assembled sequence data, and at the same time, facilitate access and retrieval of sequence files, simplify searches for related sequences among hosted data, and to enable the visual exploration of genomic features. We intend that the ease of access will facilitate further analyses and pave the way for other comparative studies using these and additional data, fostering collaborations and discovery within the marine biology community.

Results

Main landing page

The main page (http://reefgenomics.org) has a text box - similar in design to a popular search engine - that allows new and experienced users alike to quickly jump to their organism(s) of interest. For new users that are unfamiliar with the datasets hosted on the website, we ease data discovery by featuring a collection of datasets, and we also provide a link to a complete list of all datasets on our website. Also on the main page is a prominent ‘Contribute’ button, which provides a point of contact for groups to contribute their data to our data portal (Figure 1A).
Figure 1.

Layout of http://reefgenomics.org. (A) Landing page shows the search bar and tiles corresponding to featured projects and a tile to explore all datasets. (B) Project page shows the typical layout of a project-specific page. Hovering the cursor over tiles darkens them, as shown in the ‘Browse’ tile.

Layout of http://reefgenomics.org. (A) Landing page shows the search bar and tiles corresponding to featured projects and a tile to explore all datasets. (B) Project page shows the typical layout of a project-specific page. Hovering the cursor over tiles darkens them, as shown in the ‘Browse’ tile.

Project-specific data sharing

From the main page, users are able to visit subdomains containing data produced by a project. At the time of writing, the hosted data range from individual genome projects, e.g. the genome of Aiptasia (3), to multi-institute collaborative efforts, e.g. a comparative study of 20 coral transcriptomes and genomes (9). Whenever possible, we opted to use short, memorable subdomain names instead of nested subdirectories. For instance, the Aiptasia genome project is located at http://aiptasia.reefgenomics.org; while the comparative study is at http://comparative.reefgenomics.org. Repeat users can quickly navigate to their organisms/datasets of interest by typing the memorable URLs in their browsers. Subdomains typically contain three buttons: ‘BLAST’, ‘Download’, and ‘Browse’ (Figure 1B). Some subdomains have fewer buttons depending on the hosted contents: for instance, transcriptome data cannot be viewed on a genome browser. The first button links users to a custom BLAST server based on SequenceServer (11), which produces aesthetically pleasing BLAST results that takes advantage of modern web standards. This BLAST server also has the advantage being able to search multiple databases simultaneously with a single query, which simplifies tasks, e.g. retrieving homolog for a particular gene of interest across many organisms. The ‘Download’ page is a typical HTML page that houses links to retrieve datasets, with a MD5 hash for users to verify that their datasets of interest were downloaded correctly. Last, the ‘Browse’ button allows for visual exploration of the hosted data via JBrowse (12). Data tracks can be added or removed via the sidebar on the left; relevant genes or scaffolds can be searched using the text box on the top-right, which supports autocompletion of features such as scaffold or gene names (Figure 2). Whenever applicable, we also provide links to the manuscript that produced/feature the data, as well as links to raw sequence data hosted on NCBI, effectively closing the loop from manuscript to raw data to processed data.
Figure 2.

Visual exploration of genomic data accessible via the ‘Browse’ button. In this example, the repeats identified in the Aiptasia genome are overlaid on top of the gene models, allowing for a quick appraisal of genes with many repeat regions, and the relative position of these regions within the gene model.

Visual exploration of genomic data accessible via the ‘Browse’ button. In this example, the repeats identified in the Aiptasia genome are overlaid on top of the gene models, allowing for a quick appraisal of genes with many repeat regions, and the relative position of these regions within the gene model.

Discussion

Currently, there are several other online portals with data from marine organisms: Metazome (https://metazome.jgi.doe.gov) (13), Compagen (http://compagen.org) (14), and OIST Genomic Projects (http://marinegenomics.oist.jp) (among others). Although there are similarities between reefgenomics.org and the other three websites, there are several features that distinguish our website from other data portals that are summarized in Table 1.
Table 1.

Overview of features of other data portals with similarity to reefgenomics.org

Portalreefgenomics.orgMetazomeCompagenOIST genomic projects
General features
ScopeMarine organismsMetazoansEarly branching metazoansMarine organisms
Site organizationBy projectBy organismBy organismBy organism
Requires registrationNoYesNoNo
Data availability
AnnotationsPresentPresentAbsentPresent
SourceOpen for contributionsJGI-funded/JGI-linked projectsOpen for contributionsOIST-funded projects
Data exploration
BLAST backendSequenceServerSelf-developedwwwblastSelf-developed
Search multiple databases?YesYesNoNo
Genome browserJBrowseJBrowseN/AGBrowse
Overview of features of other data portals with similarity to reefgenomics.org The reefgenomics.org data portal provides a clean and easy-to-use interface for interested researchers to access and explore data from marine organisms. Our intention is not to duplicate available sequence data, but to centralize data (on a project-specific basis) generated by multiple labs that are otherwise hard to access. Arguably, the value and success of reefgenomics.org as a resource grows with the people that use it, the data deployed, and ensuring that the accessible content is up-to-date. Our hope is to promote growing community adaptation by providing an easily accessible and queriable database, facilitate hosting data for interested researchers, and by hosting data from larger projects within the framework of consortia, such as GIGA (the Global Invertebrate Genome Alliance) (15) and ReFuGe 2020 (Reef Future Genomics 2020) (16). Collaborative efforts that use reefgenomics.org as a central repository facilitate comparative analyses, while ensuring that this website will continue to be relevant and a central hub of ‘-omics’ data pertaining to marine organisms for the years to come.

Methods

Website hosting

This website is hosted on a Linode Cloud virtual server (https://www.linode.com) running Debian Stable and Apache 2. Linode also has the advantage of offering tiered plans for different usage requirements, making it convenient to scale the website up if it is under heavy use.

Website design considerations

When designing the website, we coded it in a way to reduce dependencies and maximize compatibility with modern desktop browsers (Firefox, Chrome, and Edge). As such, whenever possible, the use of HTML5 and CSS3 was preferred over the use of large JavaScript libraries.

Use of standard file formats

All genomic, transcriptomic, and proteomic data hosted on the website uses the standard FASTA format; genome feature annotations are in GFF3. Tabular data, e.g. annotations of individual genes, are provided as plain-text, tab-separated values to ease downstream parsing on the command-line and visual inspection via text editors or Microsoft Excel.
  14 in total

1.  The Global Invertebrate Genomics Alliance (GIGA): developing community resources to study diverse invertebrate genomes.

Authors:  Heather Bracken-Grissom; Allen G Collins; Timothy Collins; Keith Crandall; Daniel Distel; Casey Dunn; Gonzalo Giribet; Steven Haddock; Nancy Knowlton; Mark Martindale; Mónica Medina; Charles Messing; Stephen J O'Brien; Gustav Paulay; Nicolas Putnam; Timothy Ravasi; Greg W Rouse; Joseph F Ryan; Anja Schulze; Gert Wörheide; Maja Adamska; Xavier Bailly; Jesse Breinholt; William E Browne; M Christina Diaz; Nathaniel Evans; Jean-François Flot; Nicole Fogarty; Matthew Johnston; Bishoy Kamel; Akito Y Kawahara; Tammy Laberge; Dennis Lavrov; François Michonneau; Leonid L Moroz; Todd Oakley; Karen Osborne; Shirley A Pomponi; Adelaide Rhodes; Scott R Santos; Nori Satoh; Robert W Thacker; Yves Van de Peer; Christian R Voolstra; David Mark Welch; Judith Winston; Xin Zhou
Journal:  J Hered       Date:  2014 Jan-Feb       Impact factor: 2.645

2.  Compagen, a comparative genomics platform for early branching metazoan animals, reveals early origins of genes regulating stem-cell differentiation.

Authors:  Georg Hemmrich; Thomas C G Bosch
Journal:  Bioessays       Date:  2008-10       Impact factor: 4.345

3.  The Amphimedon queenslandica genome and the evolution of animal complexity.

Authors:  Mansi Srivastava; Oleg Simakov; Jarrod Chapman; Bryony Fahey; Marie E A Gauthier; Therese Mitros; Gemma S Richards; Cecilia Conaco; Michael Dacre; Uffe Hellsten; Claire Larroux; Nicholas H Putnam; Mario Stanke; Maja Adamska; Aaron Darling; Sandie M Degnan; Todd H Oakley; David C Plachetzki; Yufeng Zhai; Marcin Adamski; Andrew Calcino; Scott F Cummins; David M Goodstein; Christina Harris; Daniel J Jackson; Sally P Leys; Shengqiang Shu; Ben J Woodcroft; Michel Vervoort; Kenneth S Kosik; Gerard Manning; Bernard M Degnan; Daniel S Rokhsar
Journal:  Nature       Date:  2010-08-05       Impact factor: 49.962

4.  The dynamic genome of Hydra.

Authors:  Jarrod A Chapman; Ewen F Kirkness; Oleg Simakov; Steven E Hampson; Therese Mitros; Thomas Weinmaier; Thomas Rattei; Prakash G Balasubramanian; Jon Borman; Dana Busam; Kathryn Disbennett; Cynthia Pfannkoch; Nadezhda Sumin; Granger G Sutton; Lakshmi Devi Viswanathan; Brian Walenz; David M Goodstein; Uffe Hellsten; Takeshi Kawashima; Simon E Prochnik; Nicholas H Putnam; Shengquiang Shu; Bruce Blumberg; Catherine E Dana; Lydia Gee; Dennis F Kibler; Lee Law; Dirk Lindgens; Daniel E Martinez; Jisong Peng; Philip A Wigge; Bianca Bertulat; Corina Guder; Yukio Nakamura; Suat Ozbek; Hiroshi Watanabe; Konstantin Khalturin; Georg Hemmrich; André Franke; René Augustin; Sebastian Fraune; Eisuke Hayakawa; Shiho Hayakawa; Mamiko Hirose; Jung Shan Hwang; Kazuho Ikeo; Chiemi Nishimiya-Fujisawa; Atshushi Ogura; Toshio Takahashi; Patrick R H Steinmetz; Xiaoming Zhang; Roland Aufschnaiter; Marie-Kristin Eder; Anne-Kathrin Gorny; Willi Salvenmoser; Alysha M Heimberg; Benjamin M Wheeler; Kevin J Peterson; Angelika Böttger; Patrick Tischler; Alexander Wolf; Takashi Gojobori; Karin A Remington; Robert L Strausberg; J Craig Venter; Ulrich Technau; Bert Hobmayer; Thomas C G Bosch; Thomas W Holstein; Toshitaka Fujisawa; Hans R Bode; Charles N David; Daniel S Rokhsar; Robert E Steele
Journal:  Nature       Date:  2010-03-14       Impact factor: 49.962

5.  Using the Acropora digitifera genome to understand coral responses to environmental change.

Authors:  Chuya Shinzato; Eiichi Shoguchi; Takeshi Kawashima; Mayuko Hamada; Kanako Hisata; Makiko Tanaka; Manabu Fujie; Mayuki Fujiwara; Ryo Koyanagi; Tetsuro Ikuta; Asao Fujiyama; David J Miller; Nori Satoh
Journal:  Nature       Date:  2011-07-24       Impact factor: 49.962

6.  Symbiodinium transcriptomes: genome insights into the dinoflagellate symbionts of reef-building corals.

Authors:  Till Bayer; Manuel Aranda; Shinichi Sunagawa; Lauren K Yum; Michael K Desalvo; Erika Lindquist; Mary Alice Coffroth; Christian R Voolstra; Mónica Medina
Journal:  PLoS One       Date:  2012-04-18       Impact factor: 3.240

7.  Phytozome: a comparative platform for green plant genomics.

Authors:  David M Goodstein; Shengqiang Shu; Russell Howson; Rochak Neupane; Richard D Hayes; Joni Fazo; Therese Mitros; William Dirks; Uffe Hellsten; Nicholas Putnam; Daniel S Rokhsar
Journal:  Nucleic Acids Res       Date:  2011-11-22       Impact factor: 16.971

8.  De Novo Assembly and Characterization of Four Anthozoan (Phylum Cnidaria) Transcriptomes.

Authors:  Sheila A Kitchen; Camerron M Crowder; Angela Z Poole; Virginia M Weis; Eli Meyer
Journal:  G3 (Bethesda)       Date:  2015-09-17       Impact factor: 3.154

9.  Transcriptome deep-sequencing and clustering of expressed isoforms from Favia corals.

Authors:  Shaadi F Pooyaei Mehr; Rob DeSalle; Hung-Teh Kao; Apurva Narechania; Zhou Han; Dan Tchernov; Vincent Pieribone; David F Gruber
Journal:  BMC Genomics       Date:  2013-08-12       Impact factor: 3.969

10.  Comparative genomics explains the evolutionary success of reef-forming corals.

Authors:  Debashish Bhattacharya; Shobhit Agrawal; Manuel Aranda; Sebastian Baumgarten; Mahdi Belcaid; Jeana L Drake; Douglas Erwin; Sylvian Foret; Ruth D Gates; David F Gruber; Bishoy Kamel; Michael P Lesser; Oren Levy; Yi Jin Liew; Matthew MacManes; Tali Mass; Monica Medina; Shaadi Mehr; Eli Meyer; Dana C Price; Hollie M Putnam; Huan Qiu; Chuya Shinzato; Eiichi Shoguchi; Alexander J Stokes; Sylvie Tambutté; Dan Tchernov; Christian R Voolstra; Nicole Wagner; Charles W Walker; Andreas Pm Weber; Virginia Weis; Ehud Zelzion; Didier Zoccola; Paul G Falkowski
Journal:  Elife       Date:  2016-05-24       Impact factor: 8.140

View more
  33 in total

1.  Reduced thermal tolerance in a coral carrying CRISPR-induced mutations in the gene for a heat-shock transcription factor.

Authors:  Phillip A Cleves; Amanda I Tinoco; Jacob Bradford; Dimitri Perrin; Line K Bay; John R Pringle
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-09       Impact factor: 11.205

2.  Advancing Genomics through the Global Invertebrate Genomics Alliance (GIGA).

Authors:  Christian R Voolstra; Gert Wörheide; Jose V Lopez
Journal:  Invertebr Syst       Date:  2017-03-16       Impact factor: 2.583

3.  MEDFORD: A human- and machine-readable metadata markup language.

Authors:  Polina Shpilker; John Freeman; Hailey McKelvie; Jill Ashey; Jay-Miguel Fonticella; Hollie Putnam; Jane Greenberg; Lenore Cowen; Alva Couch; Noah M Daniels
Journal:  Database (Oxford)       Date:  2022-08-17       Impact factor: 4.462

4.  CRISPR/Cas9-mediated genome editing in a reef-building coral.

Authors:  Phillip A Cleves; Marie E Strader; Line K Bay; John R Pringle; Mikhail V Matz
Journal:  Proc Natl Acad Sci U S A       Date:  2018-04-25       Impact factor: 11.205

5.  Fast and pervasive transcriptomic resilience and acclimation of extremely heat-tolerant coral holobionts from the northern Red Sea.

Authors:  Romain Savary; Daniel J Barshis; Christian R Voolstra; Anny Cárdenas; Nicolas R Evensen; Guilhem Banc-Prandi; Maoz Fine; Anders Meibom
Journal:  Proc Natl Acad Sci U S A       Date:  2021-05-11       Impact factor: 11.205

6.  A stony coral cell atlas illuminates the molecular and cellular basis of coral symbiosis, calcification, and immunity.

Authors:  Shani Levy; Anamaria Elek; Xavier Grau-Bové; Simón Menéndez-Bravo; Marta Iglesias; Amos Tanay; Tali Mass; Arnau Sebé-Pedrós
Journal:  Cell       Date:  2021-05-03       Impact factor: 41.582

7.  Genome-Based Analyses of Six Hexacorallian Species Reject the "Naked Coral" Hypothesis.

Authors:  Xin Wang; Guénola Drillon; Taewoo Ryu; Christian R Voolstra; Manuel Aranda
Journal:  Genome Biol Evol       Date:  2017-10-01       Impact factor: 3.416

8.  Association of coral algal symbionts with a diverse viral community responsive to heat shock.

Authors:  Jan D Brüwer; Shobhit Agrawal; Yi Jin Liew; Manuel Aranda; Christian R Voolstra
Journal:  BMC Microbiol       Date:  2017-08-17       Impact factor: 3.605

9.  Transcriptomes and expression profiling of deep-sea corals from the Red Sea provide insight into the biology of azooxanthellate corals.

Authors:  Lauren K Yum; Sebastian Baumgarten; Till Röthig; Cornelia Roder; Anna Roik; Craig Michell; Christian R Voolstra
Journal:  Sci Rep       Date:  2017-07-25       Impact factor: 4.379

10.  Comparative analysis of the genomes of Stylophora pistillata and Acropora digitifera provides evidence for extensive differences between species of corals.

Authors:  Christian R Voolstra; Yong Li; Yi Jin Liew; Sebastian Baumgarten; Didier Zoccola; Jean-François Flot; Sylvie Tambutté; Denis Allemand; Manuel Aranda
Journal:  Sci Rep       Date:  2017-12-14       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.