| Literature DB >> 30976793 |
Fábio Madeira1, Young Mi Park1, Joon Lee1, Nicola Buso1, Tamer Gur1, Nandana Madhusoodanan1, Prasad Basutkar1, Adrian R N Tivey1, Simon C Potter1, Robert D Finn1, Rodrigo Lopez1.
Abstract
The EMBL-EBI provides free access to popular bioinformatics sequence analysis applications as well as to a full-featured text search engine with powerful cross-referencing and data retrieval capabilities. Access to these services is provided via user-friendly web interfaces and via established RESTful and SOAP Web Services APIs (https://www.ebi.ac.uk/seqdb/confluence/display/JDSAT/EMBL-EBI+Web+Services+APIs+-+Data+Retrieval). Both systems have been developed with the same core principles that allow them to integrate an ever-increasing volume of biological data, making them an integral part of many popular data resources provided at the EMBL-EBI. Here, we describe the latest improvements made to the frameworks which enhance the interconnectivity between public EMBL-EBI resources and ultimately enhance biological data discoverability, accessibility, interoperability and reusability.Entities:
Year: 2019 PMID: 30976793 PMCID: PMC6602479 DOI: 10.1093/nar/gkz268
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.New visualization of cross-reference links in EBI Search.
New and updated bioinformatics tools available through Job Dispatcher in 2019. The OpenAPI user interface for these tools is available from: https://www.ebi.ac.uk/Tools/common/tools/help
| Category | Tools |
|---|---|
| Multiple Sequence Alignment ( | Clustal Omega, Kalign, MAFFT, MUSCLE, T-Coffee, MView and WebPRANK |
| Pairwise Sequence Alignment ( | Needle, Stretcher, Water, Matcher, LALIGN, and GeneWise |
| Phylogeny Analysis ( | Simple Phylogeny |
| Protein Functional Analysis ( | InterProScan 5, PfamScan, Phobius, Pratt, RADAR, HMMER3 phmmer and HMMER3 hmmscan |
| RNA Analysis ( | Infernal cmscan and MapMi |
| Sequence Format Conversion ( | Seqret and MView |
| Sequence Operation ( | Seqcksum |
| Sequence Similarity Search ( | NCBI BLAST+, PSI-BLAST, FASTA, SSEARCH, FASTM/S/F, GGSEARCH, GLSEARCH, PSI-Search and PSI-Search2 |
| Sequence Statistics ( | SAPS, Pepinfo, Pepstats, Pepwindow, Cpgplot, Newcpgreport, Isochore, Dotmatcher, Dottup, Dotpath and Polydot |
| Sequence Translation ( | Transeq, Sixpack, Backtranseq and Backtranambig |
New and Updated Data resources available through Job Dispatcher in 2019
| Category | Datasets |
|---|---|
| UniProtKB protein sequences | UniProtKB, SwissProt, SwissProt Isoforms, TrEMBL, UniProtKB Taxonomic Subsets (13 subgroups, including: bacteria, archaea, eukaryota, etc.), Reference Proteomes, Representative Proteomes (15, 35, 55, 75), UniProt Reference (UniRef 50, 90 and 100), UniParc, Unimes and UniProtKB-PDB |
| Patent protein sequences | EPO, JPO, KIPO, UPSPTO |
| Structures protein sequences | PDBe and PSI structure targets |
| Protein families | Pfam, TIGRFAM, Superfamily, Gene3D, PIRSF and TreeFam |
| Other protein sequences | Enzyme Portal, IntAct, IPD-IMGT/HLA, IPD-KIR, IPD-MHC, MEROPS (MP, MPEP and MPRO), ChEMBL and Quest for Orthologs |
| ENA nucleotide sequences | ENA sequence releases and updates for Coding, Non-coding, Barcode, Geospatial and others (10 subgroups, including: Expressed Sequence Tag, Genome Survey Sequence, etc.) |
| Ensembl Genomes sequences | Genomes from Bacteria, Fungi, Plants, Metazoa and Protists |
| Structures of nucleotide sequences | PDBe |
| Other nucleotide sequences | IMGT/LIGM-DB, IMGT/HLA (CDS and genomic), IPD-KIR (CDS and genomic) and IPD-MHC (CDS and genomic) |
Data resources available through EBI Search in 2019
| Category | Data resources |
|---|---|
| Genomes and metagenomes | Ensembl Genomes, Ensembl, HGNC, DGVa, EGA, LRG, WormBase ParaSite, MGnify |
| Nucleotide sequences | ENA, RNAcentral, Rfam, NRNL1, NRNL2, IMGT/HLA, IPD-KIR, IPD-MHC |
| Protein sequences | UniProtKB, UniParc, UniRef, EPO, JPO, KIPO, USPTO, NRPL1, NRPL2 |
| Macromolecular structures | PDBe, EMDB |
| Bioactive molecules | ChEBI, ChEMBL, Ligands |
| Gene expression | ArrayExpress, Expression Atlases, GEO, dbGaP |
| Molecular interactions | IntAct |
| Reactions, pathways | Rhea, Reactome, BioModels, MetaboLights, MetabolomeExpress, Metabolomics Workbench |
| Protein families | InterPro, TreeFam, Pfam, MEROPS, GPCRDB |
| Protein expression data | PRIDE, GNPS, GPMdb, MassIVE, PeptideAtlas, LINCS, Paxdb, jPOST |
| Enzymes | IntEnz, Enzyme Portal |
| Literature | Europe PMC, Patent families |
| Samples and ontologies | Taxonomy, GO, EFO, SBO, MESH, BioSamples, Identifiers.org registry, ORCID data claims, OLS, bio.tools |
| Diseases | OMIM, Human diseases |