Literature DB >> 22700703

A decade of Web Server updates at the Bioinformatics Links Directory: 2003-2012.

Michelle D Brazas1, David Yim, Winston Yeung, B F Francis Ouellette.   

Abstract

The 2012 Bioinformatics Links Directory update marks the 10th special Web Server issue from Nucleic Acids Research. Beginning with content from their 2003 publication, the Bioinformatics Links Directory in collaboration with Nucleic Acids Research has compiled and published a comprehensive list of freely accessible, online tools, databases and resource materials for the bioinformatics and life science research communities. The past decade has exhibited significant growth and change in the types of tools, databases and resources being put forth, reflecting both technology changes and the nature of research over that time. With the addition of 90 web server tools and 12 updates from the July 2012 Web Server issue of Nucleic Acids Research, the Bioinformatics Links Directory at http://bioinformatics.ca/links_directory/ now contains an impressive 134 resources, 455 databases and 1205 web server tools, mirroring the continued activity and efforts of our field.

Entities:  

Mesh:

Year:  2012        PMID: 22700703      PMCID: PMC3394264          DOI: 10.1093/nar/gks632

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


COMMENTARY

‘Because the number of artifacts has increased greatly, it is impossible for many bioinformatics researchers to track tools, databases, and methods in the field—or even perhaps within their own specialty area. More critically, however, biologist users and scientists approaching the field do not have a comprehensive index of bioinformatics algorithms, databases, and literature annotated with information about their context and appropriate use.’ (1) The need to not only share bioinformatics databases, tools and resources but also compile a bioinformatics ‘resourceome’ has been evident for some time (1). In dedicating a special issue each year since 1993 to new databases or updates and a second issue each year since 2003 to new web server tools or updates, Nucleic Acids Research has successfully created a viable and recognizable dissemination mechanism. Each year since 2005 and extending back to 2003, the Bioinformatics Links Directory has coordinated with Nucleic Acids Research (NAR) to update their community driven ‘resourceome’ with all of NAR’s web server tools (2–8). This coordinated effort has resulted in a comprehensive public repository of bioinformatics tools and databases, annotated with contextual information and organized by functional relevance (http://bioinformatics.ca/links_directory). Users requiring a bioinformatics tool or database for their research may search the Bioinformatics Links Directory using any number of its features, from browsing the biological categories and their drop-down subcategories on the main page, to computing a keyword search from the search box or selecting related links. Content within the Directory is organized by biological subject, with subcategories of common tasks relevant to each subject listed under the corresponding category (Figure 1). A keyword search from the search box (Figure 2) returns links containing the entered keywords found within either the link’s title, description or tag terms—a search for ‘gene set enrichment’ for example returns links containing a Boolean combination of gene AND set AND enrichment, with these keywords highlighted in yellow where they appear in the returned list (Figure 3). Other specific searches may also be computed such as exact word matches using quotation marks. Returned links from a given search may be refined to one or more of the link types within the Links Directory (tools, resources or databases), using the ‘Search Options’ feature (Figure 4). From a link of interest, related or similar links within the Links Directory are retrievable by selecting the ‘Related Links’ tab, where related links are chosen based on the PubMed-related citations feature for the primary link. To aid with determining the usefulness and currency of a link, all links within the Links Directory with one or more citations have a per PubMed Identifier (PMID) and cumulative Google Scholar (9) citation count, as well as a Google+ and Twitter sharing count (10) (Figure 5). A keyword search will return links in descending order of cumulative citation count. Detailed explanations and examples of searching within the Links Directory are provided on the tutorials page at http://www.bioinformatics.ca/links_directory/tutorial. These annotations and search features provide the ‘biologist user with information about a link’s context’ and usefulness so to narrow down the algorithm or database within the Bioinformatics Links Directory ‘resourceome’ best suited to their research need (1).
Figure 1.

The homepage of the Bioinformatics Links Directory displays the functional categories and subcategories within which more than 1700 tools, resources and databases are contained. The DNA category is shown expanded into its subcategories.

Figure 2.

Searching for links within the Bioinformatics Links Directory may be done through use of keywords in the search box. Suggested search options are provided in the search guide box.

Figure 3.

Keyword searches such as for gene set enrichment return matching categories and links, which contain the requested keywords highlighted in yellow.

Figure 4.

A returned keyword search may be refined using the Search Options, where returned links may be refined to include or exclude all tools, resources and databases.

Figure 5.

All links within the Bioinformatics Links Directory now contain a per PMID and cumulative Google Scholar citation count for the citations listed for each link, as well as a Google+ and Twitter sharing count.

The homepage of the Bioinformatics Links Directory displays the functional categories and subcategories within which more than 1700 tools, resources and databases are contained. The DNA category is shown expanded into its subcategories. Searching for links within the Bioinformatics Links Directory may be done through use of keywords in the search box. Suggested search options are provided in the search guide box. Keyword searches such as for gene set enrichment return matching categories and links, which contain the requested keywords highlighted in yellow. A returned keyword search may be refined using the Search Options, where returned links may be refined to include or exclude all tools, resources and databases. All links within the Bioinformatics Links Directory now contain a per PMID and cumulative Google Scholar citation count for the citations listed for each link, as well as a Google+ and Twitter sharing count.

SHIFTING TRENDS

The past decade of NAR Web Server releases has seen significant growth and change in the types of tools, databases and resources being put forth. Although each Web Server issue has seen a steady number of new web servers and updates, the functional utility classification of these web servers has differed greatly over the decade. The Bioinformatics Links Directory in 2006 compiled a graph highlighting the trends in five subcategories of links published by NAR between 2003 and 2006 (3). An update of that graph (Figure 6) shows the exceptional changes in new bioinformatics web servers within these same selected subcategories; Table 1 lists how substantial these changes have been across the entire field of bioinformatics as well (Table 1). Subcategories with active contributions of new web servers in 2006 differ from the subcategories with active contributions of new web servers in 2012 (Figure 6 and Table 1). For example subcategories such as ‘DNA: Tools for the Bench’ subcategory, which contains new web servers for conducting generic DNA analyses such as primer design, exhibited strong growth over 2003–2006 but shows very little growth in 2012 as tools in this category remain relevant over time (Figure 6). In contrast, other subcategories such as Protein: 3D Structural Features have remained active in web server development over time, as the introduction of better technologies and algorithms has permitted a deeper appreciation of proteins and their structural features in 3D (Figure 6).
Figure 6.

Number of NAR Web Servers from selected subcategories of the Bioinformatics Links Directory (http://bioinformatics.ca/links_directory/). The number of NAR Web Servers published from 2003–2012 for five selected subcategories of the Bioinformatics Links Directory: Protein: 3D Structural Features; Expression: Gene Regulation; DNA: Tools for the Bench; Sequence Comparison: Analysis of Aligned Sequences and RNA: Functional RNAs is shown.

Table 1.

Historical summary of the number of resources, databases and web server tools listed in each subcategory of the Bioinformatics Links Directory between 2005 and 2012

Name2005 Total2006 Total2007 Total2008 Total2009 Total2010 Total2011
2012a
ResourcesDatabasesToolsTotalResourcesDatabasesToolsTotal
Computer related
    Bio-* programming tools2020202020186121861218
    C/C++333332112112
    Databases222235156156
    Java444444314314
    Linux/Unix1212111111105141051410
    PERL555555415415
    PHP3311111111
    Statistics999999279279
    Web development6622222222
    Web services567710173161931518
    Workflows3156178
DNA
    Annotations293856576271176573165966
    Databases565654256
    DNA and genomic analysis171920232425512228512632
    Gene prediction173233343737113537112931
    Mapping and assembly121415151720321621321116
    Phylogeny reconstruction243743464953444553444149
    Sequence polymorphisms223239414244183948173947
    Sequence retrieval and submission232630323230162330172028
    Structure and sequence feature detection7111814214515016131015016339137149
    Tools for the bench445563657176276776246066
Education
    Bioinformatics-related news sources9999999977
    Community19192423231817219171119
    Courses, programs and workshops5555543311
    Directories and portals12151515151812618121518
    General1515141414141311412113
    Tutorials and directed learning resources9999910101099
Expression
    cDNA, EST, SAGE2429364448491103849182938
    Databases212120320
    Gene regulation5996119120128138316127146316119138
    Gene set analysis11241293013233
    Networks812162151712122
    Protein expression789172223131822121821
    Splicing916191921223172031619
    Transcript expression and microarrays5475891011081215101071225996110
Human genome
    Annotations223137383946253946243541
    Databases3113230232
    Ethics778886516415
    Genomics44331019321520321621
    Health and disease111419232729172331172432
    Other resources222529292931941730941629
    Sequence polymorphisms172533363846244349244450
Literature
    Databases7777
    Goldmines566665145145
    Open access resources222234134134
    Search tools6101212131468146612
    Text mining711152230314303443539
Model organisms
    Databases959594194
    Fish1111111111115611459
    Fly121617171721281323171624
    General resources20232728293231216313111630
    Microbes1931384553603124964295162
    Mouse and rat28323535364241721424112439
    Other organisms131821212122361322341320
    Other vertebrates1010101010111371113711
    Plants1116192125282101830292031
    Worm99999101461114813
    Yeast111518181821291021271221
Other molecules
    Carbohydrates5666677755
    Compounds26121141511516
    Databases262625126
    Enzymes2
    Metabolites347121121311314
    Peptides11
    Small molecules13669132131521517
Protein
    2D Structure prediction295158606365165665656
    3D Structural features537075851001210310612107110
    3D Structure comparison 783545505971166875166269
    3D Structure prediction4859607083418590218386
    3D Structure retrieval, viewing284551525658474758384253
    Annotation and function2635444753575576255661
    Biochemical features253740414646224347224145
    Databases11611171152117
    Do-it-all tools for protein6881314153121531114
    Domains and motifs50861121151211241411512914109123
    Identification, presentation and format1213141414141131411415
    Networks & Interactions, pathways, enzymes48668894107125321113137219113134
    Localization and targeting1930383839413394233437
    Molecular dynamics and docking19212734401444514849
    Phylogeny reconstruction283644455354454756444856
    Protein expression788810101271012811
    Proteomics212527333739343239342633
    Sequence comparison7141718181818
    Sequence data77891010163101539
    Sequence features142531333846224650234752
    Sequence retrieval212729293129162128172028
RNA
    Databases02412523225
    Functional RNAs9141926323705394454146
    General resources810101010114341152411
    Motifs9192122232501262712526
    Sequence retrieval111110111190279279
    Structure prediction, visualization and design24384754586203656836467
Sequence comparison
    Alignment editing and visualization1420212123250025252323
    Analysis of aligned sequences27435960626400656515758
    Comparative genomics192633353748274049263341
    Multiple sequence alignments193850565765106768116264
    Other alignment tools611111111120013131212
    Pairwise sequence alignments11222326333510343513031
    Similarity searching133147474950124952134549
Total unique resources, databases and tools in Bioinformatics Links Directory700+1000+1100+1200+140015001444801250187413445512051794

aA complete listing of all URLs listed in the Nucleic Acids Research 2012 Web Server Issue can be accessed online at: http://bioinformatics.ca/links_directory/narweb2012.

Number of NAR Web Servers from selected subcategories of the Bioinformatics Links Directory (http://bioinformatics.ca/links_directory/). The number of NAR Web Servers published from 2003–2012 for five selected subcategories of the Bioinformatics Links Directory: Protein: 3D Structural Features; Expression: Gene Regulation; DNA: Tools for the Bench; Sequence Comparison: Analysis of Aligned Sequences and RNA: Functional RNAs is shown. Historical summary of the number of resources, databases and web server tools listed in each subcategory of the Bioinformatics Links Directory between 2005 and 2012 aA complete listing of all URLs listed in the Nucleic Acids Research 2012 Web Server Issue can be accessed online at: http://bioinformatics.ca/links_directory/narweb2012. Indeed, many of these trends in subcategory activity reflect the introduction of a new research technology and/or an underlying research need. As originally discussed in 2009, yet still evident throughout Table 1, there are observable sharp spikes in the number of informatics tools available in a given biological area or specific to a particular task (6). Although between 2006 and 2007, there was a spike in ‘DNA: Structure and Sequence Feature Detection’, ‘Expression: Gene Regulation’ and ‘Expression: Transcript Expression and Microarrays’ most likely in response to the pervasive use of expression platforms and transcript analyses at that time, current spikes in ‘Human Genome: Genomics’, ‘Sequence Comparison: Comparative Genomics’ and ‘Sequence Comparison: Pairwise Sequence Alignments’ between 2010 and 2012 may be reflective of the introduction of next-generation sequencing technologies. Table 1 highlights all the trends between 2005 and 2012 across all subcategories of the Bioinformatics Links Directory (Table 1) (individual calendar year data are not available for 2003–2005 as the Bioinformatics Links Directory collaboration with NAR began in 2005 for all data before that). Of note are the aggressive coverage of databases, which began in 2011 and the need for additional subcategories such as ‘Other Molecules: Enzymes and Peptides’ in response to the growing diversity and depth in the field.

CONSTANT GROWTH

The number of bioinformatics links produced by the community has nonetheless continued to accumulate over time. When first released in 2002 under Bioinformatics.ca, there were only 385 links in the database. By 2005, when the Bioinformatics Links Directory began its collaboration with NAR, the Directory already contained 700+ web servers and resources (2). The Bioinformatics Links Directory has continued to grow (and diminish in size as web servers become extinct) with each successive year (Table 1). The Bioinformatics Links Directory now contains more than 1200 unique web servers. This year's NAR Web Server Issue introduces an additional 90 web servers and 12 server updates (Table 1). The complete listing of URLs cited in the 2012 Web Server Issue can be accessed online at the Nucleic Acids Research website, http://nar.oxfordjournals.org/, and at http://bioinformatics.ca/links_directory/narweb2012/. Growth has also resulted in change. In 2011, the Bioinformatics Links Directory introduced all the databases published by Nucleic Acids Research over their 2010–2011 window and completed a major reorganization of its existing links (8). Links were reclassified into one of three types reflecting the growing variety of bioinformatics materials being generated to support research: (i) Resource—a static resource whose intention is to convey bioinformatic information; (ii) Tool—a bioinformatic web server or downloadable software tool that can query, analyze, extract or modify input data and (iii) Database—a biological data store that can be queried. Under this new schema and with this expanded content, the Bioinformatics Links Directory now contains 1794 links comprising 455 databases, 134 resources and 1205 web server tools (Table 1). The Bioinformatics Links Directory has also initiated active curation of its content, removing dead content and correcting content errors, which has resulted in more accurate although occasionally smaller counts for 2012 (Table 1). Over time, the Bioinformatics Links Directory is expected to continue its growth and evolution in keeping with the current pace of research.

FUTURE CHALLENGES

With such a vast number of bioinformatics tools available for biological research, now more than ever, there is a need for a well-annotated and user-friendly ‘resourceome’ (1). In 2011, the Bioinformatics Links Directory introduced features to facilitate peer review of its links and link ownership (8). The application of Google +1 and other social media features, such as Twitter to the Bioinformatics Links Directory, provides users with the opportunity to share useful links; registered users may also post comments and review a link’s usefulness or start a discussion forum. Along with enhanced search capabilities and MeSH term and keyword tagging, such a peer-review system allows users to rapidly narrow down the links most applicable to their research need. Other features added provide link owners with the capacity to edit, manage and communicate on their own link without requiring input or support from the Bioinformatics Links Directory (8). Although necessary to remain current and to advance the utility of the Bioinformatics Links Directory, these improvements will only prove useful if driven by the community. As a community-driven repository, everyone in the research or bioinformatics community has the opportunity to help make the collection better and more meaningful. Anyone may (i) suggest a link through links@bioinformatics.ca or through the ‘Suggest a URL’ page on the Bioinformatics Links Directory; (ii) register for a free account and submit their own tool or database; (iii) rate a tool, resource or database; (iv) register for a free account and submit their review of a tool, resource or database; (v) enrich a tool or database with tutorials or user-oriented content; (vi) initiate a discussion forum on a problem related to the use of any given tool or database or (vii) share a useful tool, resource or database with their colleagues. After a decade of output from the NAR Web Server issues, it will be interesting to monitor the development, maturity and impact of these bioinformatics web servers and databases over the next decade.

FUNDING

Ontario Institute for Cancer Research and the government of Ontario, and from Genome Canada and the Ontario Genome Institute though a platform award to the The Center of Applied Genomics (TCAG) at the Hospital for Sick Children (SickKids). Funding for open access charge: Waived by Oxford University Press. Conflict of interest statement. None declared.
  8 in total

1.  Evolution in bioinformatic resources: 2009 update on the Bioinformatics Links Directory.

Authors:  Michelle D Brazas; Joseph Tadashi Yamada; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2009-06-15       Impact factor: 16.971

2.  Providing web servers and training in Bioinformatics: 2010 update on the Bioinformatics Links Directory.

Authors:  Michelle D Brazas; Joseph T Yamada; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2010-06-11       Impact factor: 16.971

3.  The 2011 Bioinformatics Links Directory update: more resources, tools and databases and features to empower the bioinformatics community.

Authors:  Michelle D Brazas; David S Yim; Joseph T Yamada; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2011-07       Impact factor: 16.971

4.  Time to organize the bioinformatics resourceome.

Authors:  Nicola Cannata; Emanuela Merelli; Russ B Altman
Journal:  PLoS Comput Biol       Date:  2005-12       Impact factor: 4.475

5.  A compilation of molecular biology web servers: 2006 update on the Bioinformatics Links Directory.

Authors:  Joanne A Fox; Scott McMillan; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

6.  Conducting research on the web: 2007 update for the bioinformatics links directory.

Authors:  Joanne A Fox; Scott McMillan; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2007-06-22       Impact factor: 16.971

7.  The Bioinformatics Links Directory: a compilation of molecular biology web servers.

Authors:  Joanne A Fox; Stefanie L Butland; Scott McMillan; Graeme Campbell; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

8.  Keeping pace with the data: 2008 update on the Bioinformatics Links Directory.

Authors:  Michelle D Brazas; Joanne A Fox; Timothy Brown; Scott McMillan; B F Francis Ouellette
Journal:  Nucleic Acids Res       Date:  2008-07-01       Impact factor: 16.971

  8 in total
  9 in total

1.  Ten simple rules for making research software more robust.

Authors:  Morgan Taschuk; Greg Wilson
Journal:  PLoS Comput Biol       Date:  2017-04-13       Impact factor: 4.475

2.  Expanding roles in a library-based bioinformatics service program: a case study.

Authors:  Meng Li; Yi-Bu Chen; William A Clintworth
Journal:  J Med Libr Assoc       Date:  2013-10

3.  OMICtools: an informative directory for multi-omic data analysis.

Authors:  Vincent J Henry; Anita E Bandrowski; Anne-Sophie Pepin; Bruno J Gonzalez; Arnaud Desfeux
Journal:  Database (Oxford)       Date:  2014-07-14       Impact factor: 3.451

4.  Bio-TDS: bioscience query tool discovery system.

Authors:  Etienne Z Gnimpieba; Menno S VanDiermen; Shayla M Gustafson; Bill Conn; Carol M Lushbough
Journal:  Nucleic Acids Res       Date:  2016-10-18       Impact factor: 16.971

5.  U-Index, a dataset and an impact metric for informatics tools and databases.

Authors:  Alison Callahan; Rainer Winnenburg; Nigam H Shah
Journal:  Sci Data       Date:  2018-03-20       Impact factor: 6.444

6.  A snapshot of 3649 Web-based services published between 1994 and 2017 shows a decrease in availability after 2 years.

Authors:  Ágnes Ősz; Lőrinc Sándor Pongor; Danuta Szirmai; Balázs Győrffy
Journal:  Brief Bioinform       Date:  2019-05-21       Impact factor: 11.622

7.  G-Links: a gene-centric link acquisition service.

Authors:  Kazuki Oshita; Masaru Tomita; Kazuharu Arakawa
Journal:  F1000Res       Date:  2014-11-19

8.  Navigating the changing learning landscape: perspective from bioinformatics.ca.

Authors:  Michelle D Brazas; B F Francis Ouellette
Journal:  Brief Bioinform       Date:  2013-03-20       Impact factor: 11.622

9.  Tools and data services registry: a community effort to document bioinformatics resources.

Authors:  Jon Ison; Kristoffer Rapacki; Hervé Ménager; Matúš Kalaš; Emil Rydza; Piotr Chmura; Christian Anthon; Niall Beard; Karel Berka; Dan Bolser; Tim Booth; Anthony Bretaudeau; Jan Brezovsky; Rita Casadio; Gianni Cesareni; Frederik Coppens; Michael Cornell; Gianmauro Cuccuru; Kristian Davidsen; Gianluca Della Vedova; Tunca Dogan; Olivia Doppelt-Azeroual; Laura Emery; Elisabeth Gasteiger; Thomas Gatter; Tatyana Goldberg; Marie Grosjean; Björn Grüning; Manuela Helmer-Citterich; Hans Ienasescu; Vassilios Ioannidis; Martin Closter Jespersen; Rafael Jimenez; Nick Juty; Peter Juvan; Maximilian Koch; Camille Laibe; Jing-Woei Li; Luana Licata; Fabien Mareuil; Ivan Mičetić; Rune Møllegaard Friborg; Sebastien Moretti; Chris Morris; Steffen Möller; Aleksandra Nenadic; Hedi Peterson; Giuseppe Profiti; Peter Rice; Paolo Romano; Paola Roncaglia; Rabie Saidi; Andrea Schafferhans; Veit Schwämmle; Callum Smith; Maria Maddalena Sperotto; Heinz Stockinger; Radka Svobodová Vařeková; Silvio C E Tosatto; Victor de la Torre; Paolo Uva; Allegra Via; Guy Yachdav; Federico Zambelli; Gert Vriend; Burkhard Rost; Helen Parkinson; Peter Løngreen; Søren Brunak
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.