Literature DB >> 33606252

Applications of Community Detection Algorithms to Large Biological Datasets.

Itamar Kanter1, Gur Yaari2, Tomer Kalisky3.   

Abstract

Recent advances in data acquiring technologies in biology have led to major challenges in mining relevant information from large datasets. For example, single-cell RNA sequencing technologies are producing expression and sequence information from tens of thousands of cells in every single experiment. A common task in analyzing biological data is to cluster samples or features (e.g., genes) into groups sharing common characteristics. This is an NP-hard problem for which numerous heuristic algorithms have been developed. However, in many cases, the clusters created by these algorithms do not reflect biological reality. To overcome this, a Networks Based Clustering (NBC) approach was recently proposed, by which the samples or genes in the dataset are first mapped to a network and then community detection (CD) algorithms are used to identify clusters of nodes.Here, we created an open and flexible python-based toolkit for NBC that enables easy and accessible network construction and community detection. We then tested the applicability of NBC for identifying clusters of cells or genes from previously published large-scale single-cell and bulk RNA-seq datasets.We show that NBC can be used to accurately and efficiently analyze large-scale datasets of RNA sequencing experiments.

Keywords:  Big data; Community detection; Networks based clustering; Single-cell RNA sequencing

Mesh:

Year:  2021        PMID: 33606252     DOI: 10.1007/978-1-0716-1103-6_3

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  27 in total

1.  Biology: The big challenges of big data.

Authors:  Vivien Marx
Journal:  Nature       Date:  2013-06-13       Impact factor: 49.962

Review 2.  Gene set enrichment analysis: performance evaluation and usage guidelines.

Authors:  Jui-Hung Hung; Tun-Hsiang Yang; Zhenjun Hu; Zhiping Weng; Charles DeLisi
Journal:  Brief Bioinform       Date:  2011-09-07       Impact factor: 11.622

3.  Assessing allele-specific expression across multiple tissues from RNA-seq read data.

Authors:  Matti Pirinen; Tuuli Lappalainen; Noah A Zaitlen; Emmanouil T Dermitzakis; Peter Donnelly; Mark I McCarthy; Manuel A Rivas
Journal:  Bioinformatics       Date:  2015-03-27       Impact factor: 6.937

4.  The landscape of genomic imprinting across diverse adult human tissues.

Authors:  Yael Baran; Meena Subramaniam; Anne Biton; Taru Tukiainen; Emily K Tsang; Manuel A Rivas; Matti Pirinen; Maria Gutierrez-Arcelus; Kevin S Smith; Kim R Kukurba; Rui Zhang; Celeste Eng; Dara G Torgerson; Cydney Urbanek; Jin Billy Li; Jose R Rodriguez-Santana; Esteban G Burchard; Max A Seibold; Daniel G MacArthur; Stephen B Montgomery; Noah A Zaitlen; Tuuli Lappalainen
Journal:  Genome Res       Date:  2015-05-07       Impact factor: 9.043

5.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

6.  Massively parallel single-cell RNA-seq for marker-free decomposition of tissues into cell types.

Authors:  Diego Adhemar Jaitin; Ephraim Kenigsberg; Hadas Keren-Shaul; Naama Elefant; Franziska Paul; Irina Zaretsky; Alexander Mildner; Nadav Cohen; Steffen Jung; Amos Tanay; Ido Amit
Journal:  Science       Date:  2014-02-14       Impact factor: 47.728

7.  Big Data: Astronomical or Genomical?

Authors:  Zachary D Stephens; Skylar Y Lee; Faraz Faghri; Roy H Campbell; Chengxiang Zhai; Miles J Efron; Ravishankar Iyer; Michael C Schatz; Saurabh Sinha; Gene E Robinson
Journal:  PLoS Biol       Date:  2015-07-07       Impact factor: 8.029

8.  Pan-cancer network analysis identifies combinations of rare somatic mutations across pathways and protein complexes.

Authors:  Mark D M Leiserson; Fabio Vandin; Hsin-Ta Wu; Jason R Dobson; Jonathan V Eldridge; Jacob L Thomas; Alexandra Papoutsaki; Younhun Kim; Beifang Niu; Michael McLellan; Michael S Lawrence; Abel Gonzalez-Perez; David Tamborero; Yuwei Cheng; Gregory A Ryslik; Nuria Lopez-Bigas; Gad Getz; Li Ding; Benjamin J Raphael
Journal:  Nat Genet       Date:  2014-12-15       Impact factor: 38.330

9.  Transcriptome and genome sequencing uncovers functional variation in humans.

Authors:  Tuuli Lappalainen; Michael Sammeth; Marc R Friedländer; Peter A C 't Hoen; Jean Monlong; Manuel A Rivas; Mar Gonzàlez-Porta; Natalja Kurbatova; Thasso Griebel; Pedro G Ferreira; Matthias Barann; Thomas Wieland; Liliana Greger; Maarten van Iterson; Jonas Almlöf; Paolo Ribeca; Irina Pulyakhina; Daniela Esser; Thomas Giger; Andrew Tikhonov; Marc Sultan; Gabrielle Bertier; Daniel G MacArthur; Monkol Lek; Esther Lizano; Henk P J Buermans; Ismael Padioleau; Thomas Schwarzmayr; Olof Karlberg; Halit Ongen; Helena Kilpinen; Sergi Beltran; Marta Gut; Katja Kahlem; Vyacheslav Amstislavskiy; Oliver Stegle; Matti Pirinen; Stephen B Montgomery; Peter Donnelly; Mark I McCarthy; Paul Flicek; Tim M Strom; Hans Lehrach; Stefan Schreiber; Ralf Sudbrak; Angel Carracedo; Stylianos E Antonarakis; Robert Häsler; Ann-Christine Syvänen; Gert-Jan van Ommen; Alvis Brazma; Thomas Meitinger; Philip Rosenstiel; Roderic Guigó; Ivo G Gut; Xavier Estivill; Emmanouil T Dermitzakis
Journal:  Nature       Date:  2013-09-15       Impact factor: 49.962

10.  Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex.

Authors:  Alex A Pollen; Tomasz J Nowakowski; Joe Shuga; Xiaohui Wang; Anne A Leyrat; Jan H Lui; Nianzhen Li; Lukasz Szpankowski; Brian Fowler; Peilin Chen; Naveen Ramalingam; Gang Sun; Myo Thu; Michael Norris; Ronald Lebofsky; Dominique Toppani; Darnell W Kemp; Michael Wong; Barry Clerkson; Brittnee N Jones; Shiquan Wu; Lawrence Knutsson; Beatriz Alvarado; Jing Wang; Lesley S Weaver; Andrew P May; Robert C Jones; Marc A Unger; Arnold R Kriegstein; Jay A A West
Journal:  Nat Biotechnol       Date:  2014-08-03       Impact factor: 54.908

View more
  1 in total

Review 1.  Emerging landscape of molecular interaction networks:Opportunities, challenges and prospects.

Authors:  Gauri Panditrao; Rupa Bhowmick; Chandrakala Meena; Ram Rup Sarkar
Journal:  J Biosci       Date:  2022       Impact factor: 2.795

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.