Literature DB >> 31828235

Alignment-Free Sequence Analysis and Applications.

Jie Ren1, Xin Bai1,2, Yang Young Lu1, Kujin Tang1, Ying Wang3, Gesine Reinert4, Fengzhu Sun1,2.   

Abstract

Genome and metagenome comparisons based on large amounts of next generation sequencing (NGS) data pose significant challenges for alignment-based approaches due to the huge data size and the relatively short length of the reads. Alignment-free approaches based on the counts of word patterns in NGS data do not depend on the complete genome and are generally computationally efficient. Thus, they contribute significantly to genome and metagenome comparison. Recently, novel statistical approaches have been developed for the comparison of both long and shotgun sequences. These approaches have been applied to many problems including the comparison of gene regulatory regions, genome sequences, metagenomes, binning contigs in metagenomic data, identification of virus-host interactions, and detection of horizontal gene transfers. We provide an updated review of these applications and other related developments of word-count based approaches for alignment-free sequence analysis.

Entities:  

Keywords:  Markov chain; alignment; alignment-free; horizontal gene transfer; metagenomics; phylogeny; sequence comparison; virus-host interaction

Year:  2018        PMID: 31828235      PMCID: PMC6905628          DOI: 10.1146/annurev-biodatasci-080917-013431

Source DB:  PubMed          Journal:  Annu Rev Biomed Data Sci        ISSN: 2574-3414


  18 in total

1.  Inferring Phylogenomic Relationship of Microbes Using Scalable Alignment-Free Methods.

Authors:  Guillaume Bernard; Timothy G Stephens; Raúl A González-Pech; Cheong Xin Chan
Journal:  Methods Mol Biol       Date:  2021

2.  Sequence Comparison Without Alignment: The SpaM Approaches.

Authors:  Burkhard Morgenstern
Journal:  Methods Mol Biol       Date:  2021

3.  Specificity Analysis of Genome Based on Statistically Identical K-Words With Same Base Combination.

Authors:  Hyein Seo; Yong-Joon Song; Kiho Cho; Dong-Ho Cho
Journal:  IEEE Open J Eng Med Biol       Date:  2020-07-14

4.  Read-SpaM: assembly-free and alignment-free comparison of bacterial genomes with low sequencing coverage.

Authors:  Anna-Katharina Lau; Svenja Dörrer; Chris-André Leimeister; Christoph Bleidorn; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2019-12-17       Impact factor: 3.169

Review 5.  Forest and Trees: Exploring Bacterial Virulence with Genome-wide Association Studies and Machine Learning.

Authors:  Jonathan P Allen; Evan Snitkin; Nathan B Pincus; Alan R Hauser
Journal:  Trends Microbiol       Date:  2021-01-14       Impact factor: 18.230

6.  CRAFT: Compact genome Representation toward large-scale Alignment-Free daTabase.

Authors:  Yang Young Lu; Jiaxing Bai; Yiwen Wang; Ying Wang; Fengzhu Sun
Journal:  Bioinformatics       Date:  2021-04-19       Impact factor: 6.931

Review 7.  Metagenomics: a path to understanding the gut microbiome.

Authors:  Sandi Yen; Jethro S Johnson
Journal:  Mamm Genome       Date:  2021-07-14       Impact factor: 2.957

8.  Large-scale network analysis captures biological features of bacterial plasmids.

Authors:  Mislav Acman; Lucy van Dorp; Joanne M Santini; Francois Balloux
Journal:  Nat Commun       Date:  2020-05-15       Impact factor: 14.919

9.  The number of k-mer matches between two DNA sequences as a function of k and applications to estimate phylogenetic distances.

Authors:  Sophie Röhling; Alexander Linne; Jendrik Schellhorn; Morteza Hosseini; Thomas Dencker; Burkhard Morgenstern
Journal:  PLoS One       Date:  2020-02-10       Impact factor: 3.240

Review 10.  Computer-Assisted and Data Driven Approaches for Surveillance, Drug Discovery, and Vaccine Design for the Zika Virus.

Authors:  Subhash C Basak; Subhabrata Majumdar; Ashesh Nandy; Proyasha Roy; Tathagata Dutta; Marjan Vracko; Apurba K Bhattacharjee
Journal:  Pharmaceuticals (Basel)       Date:  2019-10-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.