Literature DB >> 35108357

MetaLogo: a heterogeneity-aware sequence logo generator and aligner.

Yaowen Chen1, Zhen He1, Yahui Men1, Guohua Dong1, Shuofeng Hu1, Xiaomin Ying1.   

Abstract

Sequence logos are used to visually display conservations and variations in short sequences. They can indicate the fixed patterns or conserved motifs in a batch of DNA or protein sequences. However, most of the popular sequence logo generators are based on the assumption that all the input sequences are from the same homologous group, which will lead to an overlook of the heterogeneity among the sequences during the sequence logo making process. Heterogeneous groups of sequences may represent clades of different evolutionary origins, or genes families with different functions. Therefore, it is essential to divide the sequences into different phylogenetic or functional groups to reveal their specific sequence motifs and conservation patterns. To solve these problems, we developed MetaLogo, which can automatically cluster the input sequences after multiple sequence alignment and phylogenetic tree construction, and then output sequence logos for multiple groups and aligned them in one figure. User-defined grouping is also supported by MetaLogo to allow users to investigate functional motifs in a more delicate and dynamic perspective. MetaLogo can highlight both the homologous and nonhomologous sites among sequences. MetaLogo can also be used to annotate the evolutionary positions and gene functions of unknown sequences, together with their local sequence characteristics. We provide users a public MetaLogo web server (http://metalogo.omicsnet.org), a standalone Python package (https://github.com/labomics/MetaLogo), and also a built-in web server available for local deployment. Using MetaLogo, users can draw informative, customized and publishable sequence logos without any programming experience to present and investigate new knowledge on specific sequence sets.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Keywords:  MetaLogo; logo alignment; multiple sequence logos; phylogenetic tree; web server

Mesh:

Year:  2022        PMID: 35108357      PMCID: PMC8921662          DOI: 10.1093/bib/bbab591

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  18 in total

1.  Observed Antibody Space: A Resource for Data Mining Next-Generation Sequencing of Antibody Repertoires.

Authors:  Aleksandr Kovaltsuk; Jinwoo Leem; Sebastian Kelm; James Snowden; Charlotte M Deane; Konrad Krawczyk
Journal:  J Immunol       Date:  2018-09-14       Impact factor: 5.422

2.  TreeCluster: Clustering biological sequences using phylogenetic trees.

Authors:  Metin Balaban; Niema Moshiri; Uyen Mai; Xingfan Jia; Siavash Mirarab
Journal:  PLoS One       Date:  2019-08-22       Impact factor: 3.240

3.  A comprehensive library of DNA-binding site matrices for 55 proteins applied to the complete Escherichia coli K-12 genome.

Authors:  K Robison; A M McGuire; G M Church
Journal:  J Mol Biol       Date:  1998-11-27       Impact factor: 5.469

4.  ggseqlogo: a versatile R package for drawing sequence logos.

Authors:  Omar Wagih
Journal:  Bioinformatics       Date:  2017-11-15       Impact factor: 6.937

5.  Dynamics of B cell repertoires and emergence of cross-reactive responses in patients with different severities of COVID-19.

Authors:  Zachary Montague; Huibin Lv; Jakub Otwinowski; William S DeWitt; Giulio Isacchini; Garrick K Yip; Wilson W Ng; Owen Tak-Yin Tsang; Meng Yuan; Hejun Liu; Ian A Wilson; J S Malik Peiris; Nicholas C Wu; Armita Nourmohammad; Chris Ka Pun Mok
Journal:  Cell Rep       Date:  2021-05-09       Impact factor: 9.423

6.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

7.  Seq2Logo: a method for construction and visualization of amino acid binding motifs and sequence profiles including sequence weighting, pseudo counts and two-sided representation of amino acid enrichment and depletion.

Authors:  Martin Christen Frølund Thomsen; Morten Nielsen
Journal:  Nucleic Acids Res       Date:  2012-05-25       Impact factor: 16.971

8.  ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules.

Authors:  Haim Ashkenazy; Shiran Abadi; Eric Martz; Ofer Chay; Itay Mayrose; Tal Pupko; Nir Ben-Tal
Journal:  Nucleic Acids Res       Date:  2016-05-10       Impact factor: 16.971

9.  Deep Sequencing of B Cell Receptor Repertoires From COVID-19 Patients Reveals Strong Convergent Immune Signatures.

Authors:  Jacob D Galson; Sebastian Schaetzle; Rachael J M Bashford-Rogers; Matthew I J Raybould; Aleksandr Kovaltsuk; Gavin J Kilpatrick; Ralph Minter; Donna K Finch; Jorge Dias; Louisa K James; Gavin Thomas; Wing-Yiu Jason Lee; Jason Betley; Olivia Cavlan; Alex Leech; Charlotte M Deane; Joan Seoane; Carlos Caldas; Daniel J Pennington; Paul Pfeffer; Jane Osbourn
Journal:  Front Immunol       Date:  2020-12-15       Impact factor: 7.561

10.  Logomaker: beautiful sequence logos in Python.

Authors:  Ammar Tareen; Justin B Kinney
Journal:  Bioinformatics       Date:  2019-12-10       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.