Literature DB >> 31062021

MAFFT-DASH: integrated protein sequence and structural alignment.

John Rozewicki1,2, Songling Li1,2, Karlou Mar Amada2, Daron M Standley1,2, Kazutaka Katoh1,2.   

Abstract

Here, we describe a web server that integrates structural alignments with the MAFFT multiple sequence alignment (MSA) tool. For this purpose, we have prepared a web-based Database of Aligned Structural Homologs (DASH), which provides structural alignments at the domain and chain levels for all proteins in the Protein Data Bank (PDB), and can be queried interactively or by a simple REST-like API. MAFFT-DASH integration can be invoked with a single flag on either the web (https://mafft.cbrc.jp/alignment/server/) or command-line versions of MAFFT. In our benchmarks using 878 cases from the BAliBase, HomFam, OXFam, Mattbench and SISYPHUS datasets, MAFFT-DASH showed 10-20% improvement over standard MAFFT for MSA problems with weak similarity, in terms of Sum-of-Pairs (SP), a measure of how well a program succeeds at aligning input sequences in comparison to a reference alignment. When MAFFT alignments were supplemented with homologous sequences, further improvement was observed. Potential applications of DASH beyond MSA enrichment include functional annotation through detection of remote homology and assembly of template libraries for homology modeling.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31062021      PMCID: PMC6602451          DOI: 10.1093/nar/gkz342

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  31 in total

1.  BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations.

Authors:  A Bahr; J D Thompson; J C Thierry; O Poch
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  PDP: protein domain parser.

Authors:  Nickolai Alexandrov; Ilya Shindyalov
Journal:  Bioinformatics       Date:  2003-02-12       Impact factor: 6.937

3.  3DCoffee: combining protein sequences and structures within multiple sequence alignments.

Authors:  Orla O'Sullivan; Karsten Suhre; Chantal Abergel; Desmond G Higgins; Cédric Notredame
Journal:  J Mol Biol       Date:  2004-07-02       Impact factor: 5.469

4.  Detecting local structural similarity in proteins by maximizing number of equivalent residues.

Authors:  Daron M Standley; Hiroyuki Toh; Haruki Nakamura
Journal:  Proteins       Date:  2004-11-01

5.  The iRMSD: a local measure of sequence alignment accuracy using structural information.

Authors:  Fabrice Armougom; Sébastien Moretti; Vladimir Keduas; Cedric Notredame
Journal:  Bioinformatics       Date:  2006-07-15       Impact factor: 6.937

6.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

7.  Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee.

Authors:  Fabrice Armougom; Sébastien Moretti; Olivier Poirot; Stéphane Audic; Pierre Dumas; Basile Schaeli; Vladimir Keduas; Cedric Notredame
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

8.  ADDA: a domain database with global coverage of the protein universe.

Authors:  Andreas Heger; Christopher Andrew Wilton; Ashwin Sivakumar; Liisa Holm
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  MAFFT version 5: improvement in accuracy of multiple sequence alignment.

Authors:  Kazutaka Katoh; Kei-ichi Kuma; Hiroyuki Toh; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2005-01-20       Impact factor: 16.971

10.  OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.

Authors:  G P S Raghava; Stephen M J Searle; Patrick C Audley; Jonathan D Barber; Geoffrey J Barton
Journal:  BMC Bioinformatics       Date:  2003-10-10       Impact factor: 3.169

View more
  139 in total

1.  Sequencing of Complete Chloroplast Genomes.

Authors:  Berthold Heinze
Journal:  Methods Mol Biol       Date:  2021

2.  Analysis of Protein Intermolecular Interactions with MAFFT-DASH.

Authors:  John Rozewicki; Songling Li; Kazutaka Katoh; Daron M Standley
Journal:  Methods Mol Biol       Date:  2021

3.  Mustguseal and Sister Web-Methods: A Practical Guide to Bioinformatic Analysis of Protein Superfamilies.

Authors:  Dmitry Suplatov; Yana Sharapova; Vytas Švedas
Journal:  Methods Mol Biol       Date:  2021

4.  Allosteric regulation of menaquinone (vitamin K2) biosynthesis in the human pathogen Mycobacterium tuberculosis.

Authors:  Ghader Bashiri; Laura V Nigon; Ehab N M Jirgis; Ngoc Anh Thu Ho; Tamsyn Stanborough; Stephanie S Dawes; Edward N Baker; Esther M M Bulloch; Jodie M Johnston
Journal:  J Biol Chem       Date:  2020-02-06       Impact factor: 5.157

5.  Wolbachia Endosymbiont of the Horn Fly (Haematobia irritans irritans): a Supergroup A Strain with Multiple Horizontally Acquired Cytoplasmic Incompatibility Genes.

Authors:  Mukund Madhav; Rhys Parry; Jess A T Morgan; Peter James; Sassan Asgari
Journal:  Appl Environ Microbiol       Date:  2020-03-02       Impact factor: 4.792

6.  Defining clinical subgroups and genotype-phenotype correlations in NBAS-associated disease across 110 patients.

Authors:  Christian Staufner; Bianca Peters; Matias Wagner; Seham Alameer; Ivo Barić; Pierre Broué; Derya Bulut; Joseph A Church; Ellen Crushell; Buket Dalgıç; Anibh M Das; Anke Dick; Nicola Dikow; Carlo Dionisi-Vici; Felix Distelmaier; Neslihan Ekşi Bozbulut; François Feillet; Emmanuel Gonzales; Nedim Hadzic; Fabian Hauck; Robert Hegarty; Maja Hempel; Theresia Herget; Christoph Klein; Vassiliki Konstantopoulou; Robert Kopajtich; Alice Kuster; Martin W Laass; Elke Lainka; Catherine Larson-Nath; Alexander Leibner; Eberhard Lurz; Johannes A Mayr; Patrick McKiernan; Karine Mention; Ute Moog; Neslihan Onenli Mungan; Korbinian M Riedhammer; René Santer; Irene Valenzuela Palafoll; Jerry Vockley; Dominik S Westphal; Arnaud Wiedemann; Saskia B Wortmann; Gaurav D Diwan; Robert B Russell; Holger Prokisch; Sven F Garbade; Stefan Kölker; Georg F Hoffmann; Dominic Lenz
Journal:  Genet Med       Date:  2019-11-25       Impact factor: 8.822

7.  Transposable elements expression in Rhinella marina (cane toad) specimens submitted to immune and stress challenge.

Authors:  Adriana Ludwig; Michelle Orane Schemberger; Camilla Borges Gazolla; Joana de Moura Gama; Iraine Duarte; Ana Luisa Kalb Lopes; Carolina Mathias; Desirrê Alexia Lourenço Petters-Vandresen; Michelle Louise Zattera; Daniel Pacheco Bruschi
Journal:  Genetica       Date:  2021-08-12       Impact factor: 1.082

8.  Cultivation of a Lytic Double-Stranded RNA Bacteriophage Infecting Microvirgula aerodenitrificans Reveals a Mutualistic Parasitic Lifestyle.

Authors:  Xiaoyao Cai; Fengjuan Tian; Li Teng; Hongmei Liu; Yigang Tong; Shuai Le; Tingting Zhang
Journal:  J Virol       Date:  2021-08-10       Impact factor: 5.103

9.  Assembly of the complete mitochondrial genome of an endemic plant, Scutellaria tsinyunensis, revealed the existence of two conformations generated by a repeat-mediated recombination.

Authors:  Jingling Li; Yicen Xu; Yuanyu Shan; Xiaoying Pei; Shunyuan Yong; Chang Liu; Jie Yu
Journal:  Planta       Date:  2021-07-24       Impact factor: 4.116

10.  Insight into early diversification of leucine-rich repeat receptor-like kinases provided by the sequenced moss and hornwort genomes.

Authors:  Chihiro Furumizu; Shinichiro Sawa
Journal:  Plant Mol Biol       Date:  2021-01-03       Impact factor: 4.076

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.