Literature DB >> 25712261

Biological databases for human research.

Dong Zou1, Lina Ma1, Jun Yu2, Zhang Zhang3.   

Abstract

The completion of the Human Genome Project lays a foundation for systematically studying the human genome from evolutionary history to precision medicine against diseases. With the explosive growth of biological data, there is an increasing number of biological databases that have been developed in aid of human-related research. Here we present a collection of human-related biological databases and provide a mini-review by classifying them into different categories according to their data types. As human-related databases continue to grow not only in count but also in volume, challenges are ahead in big data storage, processing, exchange and curation.
Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Big data; Curation; Database; Database category; Human

Mesh:

Year:  2015        PMID: 25712261      PMCID: PMC4411498          DOI: 10.1016/j.gpb.2015.01.006

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

As biological data accumulate at larger scales and increase at exponential paces, thanks principally to higher-throughput and lower-cost DNA sequencing technologies, the number of biological databases that have been developed to manage such data deluge is growing at ever-faster rates. The major objectives of biological databases are not only to store, organize and share data in a structured and searchable manner with the aim to facilitate data retrieval and visualization for humans, but also to provide web application programming interfaces (APIs) for computers to exchange and integrate data from various database resources in an automated manner. Therefore, developing databases to deal with gigantic volumes of biological data is a fundamentally essential task in bioinformatics. To be short, biological databases integrate enormous amounts of omics data, serving as crucially important resources and becoming increasingly indispensable for scientists from wet-lab biologists to in silico bioinformaticians. According to a report of 2014 Molecular Biology Database Collection in the journal Nucleic Acids Research, there are a sum of 1552 databases that are publicly accessible online [1]. It should be noted, however, that such count of publicly accessible databases is conservative. In fact, there are some databases providing online services without publication in peer-reviewed journal (e.g., The RNA Modification Database at http://mods.rna.albany.edu) or being developed by commercial companies (e.g., Ingenuity Pathway Analysis at http://www.ingenuity.com/products/ipa), making them underrepresented in the scientific community. Considering the continuously proliferating number of biological databases, it becomes increasingly daunting and time-consuming to navigate in the huge volume of databases of interest. The completion of the Human Genome Project in 2003 holds significant benefits for many fields from human evolution to personalized healthcare and precision medicine. In this report, we present a collection of biological databases relevant to human research and provide a mini-review by classifying them into different categories.

Database classification

Biological databases are developed for diverse purposes, encompass various types of data at heterogeneous coverage and are curated at different levels with different methods, so that there are accordingly several different criteria applicable to database classification.

Scope of data coverage

According to the scope of data coverage, biological databases can be classified as comprehensive and specialized databases. Comprehensive databases cover different types of data from numerous species and typical examples are GenBank [2], European Molecular Biology Laboratory (EMBL) [3], and DNA Data Bank of Japan (DDBJ) [4]. These three databases were established as the International Nucleotide Sequence Database Collaboration in 1988 to collect and disseminate DNA and RNA sequences. On the other hand, specialized databases contain specific types of data or data from specific organisms. For example, WormBase [5] is for nematode biology and genomics and RiceWiki [6] is for community curation of rice genes.

Level of biocuration

According to level of data curation, biological databases can roughly fall into primary and secondary or derivative databases. Primary databases contain raw data as archival repository such as the NCBI Sequence Read Archive (SRA) [7], whereas secondary or derivative databases contain curated information as added value, e.g., NCBI RefSeq [8].

Method of biocuration

As a consequence of the explosive growth of data, curation increasingly requires collective intelligence for collaborative data integration and annotation. Therefore, biological databases can also be classified as (1) expert-curated databases, e.g., RefSeq [8] and TAIR, [9] and (2) community-curated databases, which are curated in a collective and collaborative manner by a number of researchers, e.g., LncRNAWiki [10] and GeneWiki [11].

Type of data managed

According to the types of data managed in different databases, biological databases can roughly fall into the following categories: (1) DNA, (2) RNA, (3) protein, (4) expression, (5) pathway, (6) disease, (7) nomenclature, (8) literature, and (9) standard and ontology.

Human-related databases

Decoding the human genome bears great significance in, from a theoretical view, unveiling human evolutionary history, and from an application view, exploring personalized medicine against diverse diseases. Considering the heterogeneity in data type, scope and curation, biological databases can be classified into multiple categories under different criteria as presented above, making it easier for people to effectively characterize databases and identify the database(s) of interest. However, some databases are inaccessible over time or poorly maintained/updated or even never used [12]. In this study, therefore, we assemble a collection of human-related databases that are widely used and currently accessible via the Internet (Table 1). As database classification based on data type is informative and straightforward, we assign one major category to each database, albeit one database may correspond to multiple categories. In what follows, we focus on databases categorized in DNA, RNA, protein, expression, pathway and disease, respectively.
Table 1

Human-related biological databases∗

NameLinkBrief descriptionRefs.Category#
1000 Genomeshttp://www.1000genomes.orgA deep catalog of human genetic variation[17]DNA
AFNDhttp://www.allelefrequencies.netAllele Frequency Net Database[37]
dbSNPhttp://www.ncbi.nlm.nih.gov/snpDatabase of single nucleotide polymorphisms[13]
DEGhttp://www.essentialgene.orgDatabase of Essential Genes[38]
EGAhttp://www.ebi.ac.uk/egaEuropean Genome–phenome Archive[14]
Ensemblhttp://www.ensembl.orgEnsembl genome browser[39]
euGeneshttp://eugenes.orgGenomic information for eukaryotic organisms[40]
GeneCardshttp://www.genecards.orgIntegrated database of human genes[41]
IMG/HMPhttps://img.jgi.doe.gov/imgm_hmpHuman Microbiome MetaGenomes[15]
JASPARhttp://jaspar.genereg.netTranscription factor binding profile database[42]
JGAhttp://trace.ddbj.nig.ac.jp/jgaJapanese Genotype–phenotype Archive[43]
KEGGhttp://www.genome.jp/keggKyoto Encyclopedia of Genes and Genomes[44]
MITOMAPhttp://www.mitomap.orgHuman mitochondrial genome database[45]
NCBI RefSeqhttp://www.ncbi.nlm.nih.gov/refseqNCBI Reference Sequence Database[8]
PolymiRTShttp://compbio.uthsc.edu/miRSNPPolymorphism in miRNAs and their Target Sites[46]
UCSC Genome Browserhttp://genome.ucsc.eduUCSC Genome Browser database[47]



ChIPBasehttp://deepbase.sysu.edu.cn/chipbaseDatabase of transcriptional regulation of lncRNA and miRNA genes[48]RNA
DARNEDhttp://darned.ucc.ieDAtabase of RNa EDiting in humans[49]
DIANA-LncBasehttp://diana.imis.athena-innovation.gr/DianaTools/index.php?r=lncBase/indexmiRNA targets on lncRNAs[50]
GENCODEhttp://www.gencodegenes.orgEncyclopedia of genes and gene variants[17]
H-DBAShttp://www.h-invitational.jp/h-dbasHuman-transcriptome DataBase for Alternative Splicing[51]
HEXEventhttp://hexevent.mmg.uci.eduDatabase of Human EXon splicing Events[52]
LNCipediahttp://www.lncipedia.orgAnnotated human lncRNA sequences[53]
LncRNA2Targethttp://www.lncrna2target.orgDatabase of differentially-expressed genes after lncRNA knockdown or overexpression[54]
lncRNAdbhttp://www.lncrnadb.orglncRNA Database[20]
lncRNASNPhttp://bioinfo.life.hust.edu.cn/lncRNASNPDatabase of SNPs in lncRNAs[55]
LncRNAWikihttp://lncrna.big.ac.cnHuman lncRNA Wiki[10]
miRBasehttp://www.mirbase.orgmiRNA Database[21]
miRTarBasehttp://mirtarbase.mbc.nctu.edu.twExperimentally-validated miRNA–target interactions[56]
miRWalkhttp://mirwalk.uni-hd.deDatabase of miRNA–target interactions[57]
NONCODEhttp://www.noncode.orgDatabase of ncRNA genes[58]
NPInterhttp://www.bioinfo.org/NPInterDatabase of ncRNA interactions[59]
RADAR http://RNAedit.comRigorously Annotated Database of A-to-I RNA editing[60]
piRNABankhttp://pirnabank.ibab.ac.inDatabase of piwi-interacting RNAs[61]
RBPDBhttp://rbpdb.ccbr.utoronto.caDatabase of RNA-binding specificities[62]
RDBhttp://ndbserver.rutgers.eduThe nucleic acid database[63]
Rfamhttp://rfam.xfam.orgDatabase of ncRNA families[19]
RNAcentralhttp://rnacentral.orgInternational database of ncRNA sequences[18]
snoRNABasehttps://www-snorna.biotoul.frDatabase of human H/ACA and C/D box snoRNAs[64]
starBasehttp://starbase.sysu.edu.cnDatabase of ncRNA interaction networks[65]
TarBasehttp://diana.imis.athena-innovation.gr/DianaTools/index.php?r=tarbase/indexExperimentally-validated miRNA:gene interactions[66]
TargetScanhttp://www.targetscan.orgPredicted miRNA targets in mammals[67]



CATHhttp://cath.biochem.ucl.ac.ukProtein structure classification[68]Protein
CPLMhttp://cplm.biocuckoo.orgCompendium of Protein Lysine Modifications[69]
DIPhttp://dip.doe-mbi.ucla.eduDatabase of Interacting Proteins[70]
EKPDhttp://ekpd.biocuckoo.orgEukaryotic Kinase and Phosphatase Database[71]
HPRDhttp://www.hprd.orgHuman Protein Reference Database[72]
hUbiquitomehttp://bioinfo.bjmu.edu.cn/hubi/Ubiquitination sites and cascades[73]
InterProhttp://www.ebi.ac.uk/interproProtein sequence analysis and classification[74]
MEROPShttp://merops.sanger.ac.ukDatabase of proteolytic enzymes, their substrates, and inhibitors[75]
MINThttp://mint.bio.uniroma2.it/mintMolecular INTeraction Database[76]
ModBasehttp://salilab.org/modbaseDatabase of comparative protein structure models[77]
mUbiSiDahttp://reprod.njmu.edu.cn/mUbiSiDaMammalian Ubiquitination Site Database[78]
PANTHERhttp://www.pantherdb.orgProtein ANalysis THrough Evolutionary Relationships[79]
PDBhttp://www.rcsb.org/pdbProtein Data Bank for 3D structures of biological macromolecules[25]
PDBehttp://www.ebi.ac.uk/pdbeProtein Data Bank in Europe[80]
Pfamhttp://pfam.xfam.orgDatabase of conserved protein families and domains[23]
PhosSNPhttp://phossnp.biocuckoo.orgGenetic polymorphisms that influence protein phosphorylation[81]
PIRhttp://pir.georgetown.eduProtein Information Resource[82]
PROSITEhttp://www.expasy.org/prositeDatabase of protein domains, families and functional sites[83]
SysPTMhttp://lifecenter.sgst.cn/SysPTMPost-translational modifications[84]
TreeFamhttp://www.treefam.orgDatabase of phylogenetic trees of animal species[24]
UniPROBEhttp://thebrain.bwh.harvard.edu/uniprobeUniversal PBM Resource for Oligonucleotide Binding Evaluation[85]
UniProthttp://www.uniprot.orgUniversal protein resource[22]
UUCDhttp://uucd.biocuckoo.orgUbiquitin and Ubiquitin-like Conjugation Database[86]



ArrayExpresshttp://www.ebi.ac.uk/arrayexpressDatabase of functional genomics experiments[87]Expression
BioGPShttp://biogps.orgPortal for querying and organizing gene annotation resources[88]
Expression Atlashttp://www.ebi.ac.uk/gxaDifferential and baseline expression[27]
Human Protein Atlashttp://www.proteinatlas.orgTissue-based map of the human proteome[29]
MOPEDhttps://www.proteinspire.orgMulti-Omics Profiling Expression Database[89]
NCBI GEOhttp://www.ncbi.nlm.nih.gov/geoGene Expression Omnibus[26]
NREDhttp://nred.matticklab.comDatabase of lncRNA expression[90]
ONCOMINEhttps://www.oncomine.orgCancer microarray database[91]
PrimerBankhttp://pga.mgh.harvard.edu/primerbankPublic resource for PCR primers[92]
PRIDEhttp://www.ebi.ac.uk/pridePRoteomics IDEntifications[93]
TiGERhttp://bioinfo.wilmer.jhu.edu/tigerTissue-specific Gene Expression and Regulation[28]
WikiCellhttp://www.wikicell.orgUnified resource for Human transcriptomics research[94]



CPDBhttp://consensuspathdb.orgDatabase of human interaction networks[95]Pathway
HMDBhttp://www.hmdb.caHuman Metabolome Database[96]
KEGG PATHWAYhttp://www.genome.jp/kegg/pathway.htmlKEGG pathway maps[30]
MetaCychttp://metacyc.orgMetabolic pathway database[97]
Pathway Commonshttp://www.pathwaycommons.orgPathway commons[98]
PIDhttp://pid.nci.nih.govPathway Interaction Database[99]
Reactomehttp://www.reactome.orgCurated and peer-reviewed pathway database[100]
UniPathwayhttp://www.grenoble.prabi.fr/obiwarehouse/unipathwayUniversal Pathway[101]



AlzBasehttp://alz.big.ac.cn/alzBaseDatabase for gene dysregulation in Alzheimer’s disease[102]Disease
CADgenehttp://www.bioguo.org/CADgeneCoronary Artery Disease gene database[103]
COSMIChttp://cancer.sanger.ac.ukCatalog Of Somatic Mutations In Cancer[104]
DiseaseMethhttp://bioinfo.hrbmu.edu.cn/diseasemethHuman disease methylation database[105]
DisGeNEThttp://www.disgenet.org/web/DisGeNET/v2.1Gene–disease associations[106]
GOBOhttp://co.bmc.lu.se/goboGene expression-based Outcome for Breast cancer Online[107]
GWAS Centralhttp://www.gwascentral.orgA comprehensive resource for the comparison and interrogation of genome-wide association studies[108]
GWASdbhttp://jjwanglab.org/gwasdbHuman genetic variants identified by genome-wide association studies[109]
HbVarhttp://globin.cse.psu.edu/hbvarHemoglobin variants and thalassemias[110]
HGMDhttp://www.hgmd.orgHuman Gene Mutation Database[111]
ICGChttp://icgc.orgInternational Cancer Genome Consortium[33]
IDbaseshttp://structure.bmc.lu.se/idbaseImmunodeficiency-causing variations[112]
LncRNADiseasehttp://cmbi.bjmu.edu.cn/lncrnadiseaselncRNA and disease database[113]
LOVDhttp://www.lovd.nlLeiden open (source) Variation Database[114]
MalaCardshttp://www.malacards.orgHuman maladies and their annotations[115]
MethHChttp://methhc.mbc.nctu.edu.twDatabase of DNA methylation and gene expression in human cancer[116]
MethyCancerhttp://methycancer.psych.ac.cnDatabase of human DNA Methylation and cancer[117]
miR2Diseasehttp://www.miR2Disease.orgDatabase for miRNA deregulation in human disease[118]
MITOMAPhttp://www.mitomap.org/MITOMAPPolymorphisms and mutations in human mitochondrial DNA[119]
NHGRI GWAS Cataloghttp://www.genome.gov/gwastudiesCurated resource of SNP-trait associations[120]
OMIMhttp://omim.orgOnline Mendelian Inheritance in Man[121]
T2D@ZJUhttp://tcm.zju.edu.cn/t2dConnections associated with type 2 diabetes[122]
TCGAhttp://cancergenome.nih.govThe Cancer Genome Atlas[32]
Universal Mutation Databasehttp://www.umd.be/Locus-specific database[123]
ViRBasehttp://www.rna-society.org/virbaseVirus–host ncRNA associated interactions[124]



GOhttp://geneontology.orgGene ontology[125]Standard and ontology
HGNChttp://www.genenames.orgDatabase of human gene names[126]



Europe PMChttp://europepmc.orgLiterature database in Europe[127]Literature
PubMedhttp://www.ncbi.nlm.nih.gov/pubmedDatabase of biomedical literature from MEDLINE[128]
PubMed Centralhttp://www.ncbi.nlm.nih.gov/pmcFree full-text literature archive[129]

Note: ∗This collection, however, by no means pictures the whole range of human-related databases that are currently available. Primary databases (DDBJ/EMBL/GenBank) are not shown, as they contain raw data.

#A database may correspond to multiple categories and only one major category is shown here.

Human-related biological databases∗ Note: ∗This collection, however, by no means pictures the whole range of human-related databases that are currently available. Primary databases (DDBJ/EMBL/GenBank) are not shown, as they contain raw data. #A database may correspond to multiple categories and only one major category is shown here.

DNA databases

A DNA database centers on managing DNA data from many or some specific species. The primary function of human DNA databases includes establishment of the reference genome (e.g., NCBI RefSeq [8]), profiling of human genetic variation (e.g., dbSNP [13]), association of genotype with phenotype (e.g., EGA [14]), and identification of human microbiome metagenomes (e.g., IMG/HMP [15]). A representative example of DNA database is GenBank [2], a collection of all publicly-available DNA sequences (http://www.ncbi.nlm.nih.gov/genbank). Since its inception in 1982, GenBank grows at an extraordinary pace and as of December 2014, contains over 184 billion nucleotide bases in more than 179 million sequences (http://www.ncbi.nlm.nih.gov/genbank/statistics).

RNA databases

It is well acknowledged that only a tiny proportion of the human genome is transcribed into mRNAs, whereas the vast majority of the genome is transcribed into “dark matter”—non-coding RNAs (ncRNAs) that do not encode proteins [16], including microRNAs (miRNAs), small nucleolar RNAs (snoRNAs), piwiRNAs (piRNAs), and long non-coding RNA (lncRNA). Therefore, an increasing number of human RNA databases have been built for deciphering ncRNAs (e.g., GENCODE [17]), in particular lncRNAs that attract the rising interest (e.g., LncRNAWiki [10]), and characterizing their functions and interactions (e.g., RNAcentral [18]). A representative example of RNA database is RNAcentral [18]. It provides unified access to the ncRNA sequence data supplied by multiple databases including Rfam [19], lncRNAdb [20], and miRBase [21] (http://rnacentral.org).

Protein databases

The purpose of constructing protein databases includes collection of universal proteins (e.g., UniProt [22]), identification of protein families and domains (e.g., Pfam [23]), reconstruction of phylogenetic trees (e.g., TreeFam [24]), and profiling of protein structures (e.g., PDB [25]). A representative example of protein database is PDB, the main primary database for 3D structures of biological macromolecules determined by X-ray crystallography and NMR. Established in 1971, PDB contains 105,465 biological macromolecular structures as of 30 December 2014, in which 27,393 entries belong to human (http://www.rcsb.org/pdb). Another example is the Universal Protein Resource (UniProt). As a collaborative project between EMBL-EBI, Swiss Institute of Bioinformatics (SIB), and Protein Information Resource (PIR), UniProt provides a comprehensive, high-quality, and freely-accessible resource of protein sequence and functional information. Currently, UniProt includes three member databases: UniProt Knowledgebase (UniProtKB), UniProt Reference Clusters (UniRef), and UniProt Archive (UniParc). In addition, UniProtKB consists of two sections: Swiss-Prot (containing a collection of 547,357 manually-annotated and -reviewed proteins as of January 2015) and TrEMBL (containing a collection of 89,451,166 un-reviewed proteins as of January 2015) (http://www.uniprot.org).

Expression databases

Expression databases can be used for various purposes, including archiving expression data (e.g., GEO [26]), detecting differential and baseline expression (e.g., Expression Atlas [27]), exploring tissue-specific gene expression and regulation (e.g., TiGER [28]), and profiling expression information based on both RNA and protein data (e.g., Human Protein Atlas [29]). A representative case of expression database is Human Protein Atlas. As of 30 December 2014, it encompasses expression profiles for a large majority of human protein-coding genes based on both RNA (transcriptome analysis based on 213 tissue and cell line samples) and protein data (proteome analysis based on 24,028 antibodies) (http://www.proteinatlas.org).

Pathway databases

Pathway databases contain biological pathways for metabolic, signaling, and regulatory pathway analysis. A representative example is KEGG PATHWAY [30], a curated biological pathway resource on the molecular interaction and reaction networks. As the core of KEGG, KEGG PATHWAY integrates many entities that are stored in KEGG sibling databases, including genes, proteins, RNAs, chemical compounds, and chemical reactions (http://www.genome.jp/kegg/pathway.html).

Disease databases

There are at least 200 forms of cancer in the world, causing 14.6% of all human deaths (http://en.wikipedia.org/wiki/Cancer). Thus, obtaining complete cancer genomes and identifying molecular mutations and abnormal genes can provide new insights for cancer prevention, detection, and eventually, personalized treatment [31]. Toward this end, there are two well-known cancer projects, viz., The Cancer Genome Atlas (TCGA) [32] and International Cancer Genome Consortium (ICGC) [33]. TCGA, founded in 2006 by the National Cancer Institute and National Human Genome Research Institute at the National Institutes of Health, aims to collect a wide diversity of omics data (including exome, SNP, mRNA, miRNA, and methylation) for more than 20 different types of human cancer (http://cancergenome.nih.gov). Unlike TCGA, ICGC is a voluntary collaborative organization initiated in 2008 and open to all cancer and genomic researchers in the world. It aims to obtain a comprehensive description of genomic, transcriptomic, and epigenomic changes in 50 different tumor types and/or subtypes, which are of clinical and societal importance across the globe (http://icgc.org).

Perspectives

Here we summarize a collection of biological databases relevant to human research. This collection, however, by no means pictures the whole range of human-related databases that are currently available. As primary databases store raw data, databases in this collection are most derivative databases, which are built from primary databases and contain curated information for different data types, and thus would be of great usefulness for studying the human genome. In the era of big data, human-related biological databases continue to grow not only in count but also in volume, posing unprecedented challenges in data storage, processing, exchange, and curation. From this point, it would be necessary to establish a cloud computing platform to store and process such big data and facilitate construction/update of a secondary or derivative database [34]. As biological databases are physically distributed and heterogeneous in data type and format, it is additionally required to build web open APIs to ease data exchange and sharing among different resources [35]. The last but not the least is curation, which becomes an indispensable part in biological databases, principally because curation involves added value by standardization and quality control and accordingly enhances data interoperability and consistency [36]. Taken together, biological databases hold great utilities for human research and can be regarded as an indicator of our potential to translate big data into big discovery. Considering the current situation in China when compared to other countries, it is our hope that this report may raise the general awareness, albeit better improved nowadays, of the significant role of human-related biological databases not only for academic studies but also for clinical applications.

Competing interests

The authors declared that there are no competing interests.
  128 in total

Review 1.  The cancer genome.

Authors:  Michael R Stratton; Peter J Campbell; P Andrew Futreal
Journal:  Nature       Date:  2009-04-09       Impact factor: 49.962

2.  The Human Gene Mutation Database (HGMD) and its exploitation in the fields of personalized genomics and molecular evolution.

Authors:  Peter D Stenson; Edward V Ball; Matthew Mort; Andrew D Phillips; Katy Shaw; David N Cooper
Journal:  Curr Protoc Bioinformatics       Date:  2012-09

3.  Ongoing and future developments at the Universal Protein Resource.

Authors: 
Journal:  Nucleic Acids Res       Date:  2010-11-04       Impact factor: 16.971

4.  MITOMAP: a human mitochondrial genome database--2004 update.

Authors:  Marty C Brandon; Marie T Lott; Kevin Cuong Nguyen; Syawal Spolim; Shamkant B Navathe; Pierre Baldi; Douglas C Wallace
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

5.  Europe PMC: a full-text literature database for the life sciences and platform for innovation.

Authors: 
Journal:  Nucleic Acids Res       Date:  2014-11-06       Impact factor: 16.971

6.  ViRBase: a resource for virus-host ncRNA-associated interactions.

Authors:  Yanhui Li; Changliang Wang; Zhengqiang Miao; Xiaoman Bi; Deng Wu; Nana Jin; Liqiang Wang; Hao Wu; Kun Qian; Chunhua Li; Ting Zhang; Chunrui Zhang; Ying Yi; Hongyan Lai; Yongfei Hu; Lixin Cheng; Kwong-Sak Leung; Xiaobo Li; Fengmin Zhang; Kongning Li; Xia Li; Dong Wang
Journal:  Nucleic Acids Res       Date:  2014-10-01       Impact factor: 16.971

7.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer.

Authors:  Simon A Forbes; David Beare; Prasad Gunasekaran; Kenric Leung; Nidhi Bindal; Harry Boutselakis; Minjie Ding; Sally Bamford; Charlotte Cole; Sari Ward; Chai Yin Kok; Mingming Jia; Tisham De; Jon W Teague; Michael R Stratton; Ultan McDermott; Peter J Campbell
Journal:  Nucleic Acids Res       Date:  2014-10-29       Impact factor: 16.971

8.  PID: the Pathway Interaction Database.

Authors:  Carl F Schaefer; Kira Anthony; Shiva Krupa; Jeffrey Buchoff; Matthew Day; Timo Hannay; Kenneth H Buetow
Journal:  Nucleic Acids Res       Date:  2008-10-02       Impact factor: 16.971

9.  TiGER: a database for tissue-specific gene expression and regulation.

Authors:  Xiong Liu; Xueping Yu; Donald J Zack; Heng Zhu; Jiang Qian
Journal:  BMC Bioinformatics       Date:  2008-06-09       Impact factor: 3.169

10.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.

Authors:  Jun-Hao Li; Shun Liu; Hui Zhou; Liang-Hu Qu; Jian-Hua Yang
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

View more
  22 in total

1.  Universal concept signature analysis: genome-wide quantification of new biological and pathological functions of genes and pathways.

Authors:  Xu Chi; Maureen A Sartor; Sanghoon Lee; Meenakshi Anurag; Snehal Patil; Pelle Hall; Matthew Wexler; Xiao-Song Wang
Journal:  Brief Bioinform       Date:  2020-09-25       Impact factor: 11.622

2.  Quantum-effective exact multiple patterns matching algorithms for biological sequences.

Authors:  Kapil Kumar Soni; Akhtar Rasool
Journal:  PeerJ Comput Sci       Date:  2022-05-12

3.  NeuroMuscleDB: a Database of Genes Associated with Muscle Development, Neuromuscular Diseases, Ageing, and Neurodegeneration.

Authors:  Mohammad Hassan Baig; Iliyas Rashid; Prachi Srivastava; Khurshid Ahmad; Arif Tasleem Jan; Gulam Rabbani; Dukhwan Choi; George E Barreto; Ghulam Md Ashraf; Eun Ju Lee; Inho Choi
Journal:  Mol Neurobiol       Date:  2019-01-25       Impact factor: 5.590

4.  On bioinformatic resources.

Authors:  Runsheng Chen
Journal:  Genomics Proteomics Bioinformatics       Date:  2015-03-02       Impact factor: 7.691

Review 5.  Omics databases on kidney disease: where they can be found and how to benefit from them.

Authors:  Theofilos Papadopoulos; Magdalena Krochmal; Katryna Cisek; Marco Fernandes; Holger Husi; Robert Stevens; Jean-Loup Bascands; Joost P Schanstra; Julie Klein
Journal:  Clin Kidney J       Date:  2016-03-21

6.  A Network-Based Data Integration Approach to Support Drug Repurposing and Multi-Target Therapies in Triple Negative Breast Cancer.

Authors:  Francesca Vitali; Laurie D Cohen; Andrea Demartini; Angela Amato; Vincenzo Eterno; Alberto Zambelli; Riccardo Bellazzi
Journal:  PLoS One       Date:  2016-09-15       Impact factor: 3.240

7.  Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

Authors:  Ayush Singhal; Michael Simmons; Zhiyong Lu
Journal:  PLoS Comput Biol       Date:  2016-11-30       Impact factor: 4.475

8.  Establishment of a integrative multi-omics expression database CKDdb in the context of chronic kidney disease (CKD).

Authors:  Marco Fernandes; Holger Husi
Journal:  Sci Rep       Date:  2017-01-12       Impact factor: 4.379

9.  BioImg.org: A Catalog of Virtual Machine Images for the Life Sciences.

Authors:  Martin Dahlö; Frédéric Haziza; Aleksi Kallio; Eija Korpelainen; Erik Bongcam-Rudloff; Ola Spjuth
Journal:  Bioinform Biol Insights       Date:  2015-09-10

Review 10.  VKORC1 and VKORC1L1: Why do Vertebrates Have Two Vitamin K 2,3-Epoxide Reductases?

Authors:  Johannes Oldenburg; Matthias Watzka; Carville G Bevans
Journal:  Nutrients       Date:  2015-07-30       Impact factor: 5.717

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.