Literature DB >> 33979321

Swarm: A federated cloud framework for large-scale variant analysis.

Amir Bahmani1,2,3, Kyle Ferriter2,3, Vandhana Krishnan2,3, Arash Alavi2,3, Amir Alavi2,3, Philip S Tsao4,5, Michael P Snyder1,2,3, Cuiping Pan5.   

Abstract

Genomic data analysis across multiple cloud platforms is an ongoing challenge, especially when large amounts of data are involved. Here, we present Swarm, a framework for federated computation that promotes minimal data motion and facilitates crosstalk between genomic datasets stored on various cloud platforms. We demonstrate its utility via common inquiries of genomic variants across BigQuery in the Google Cloud Platform (GCP), Athena in the Amazon Web Services (AWS), Apache Presto and MySQL. Compared to single-cloud platforms, the Swarm framework significantly reduced computational costs, run-time delays and risks of security breach and privacy violation.

Entities:  

Year:  2021        PMID: 33979321      PMCID: PMC8143397          DOI: 10.1371/journal.pcbi.1008977

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


  10 in total

1.  The HUGO Gene Nomenclature Committee (HGNC).

Authors:  S Povey; R Lovering; E Bruford; M Wright; M Lush; H Wain
Journal:  Hum Genet       Date:  2001-10-24       Impact factor: 4.132

2.  On the future of genomic data.

Authors:  Scott D Kahn
Journal:  Science       Date:  2011-02-11       Impact factor: 47.728

Review 3.  Cloud computing for genomic data analysis and collaboration.

Authors:  Ben Langmead; Abhinav Nellore
Journal:  Nat Rev Genet       Date:  2018-01-30       Impact factor: 53.242

4.  Cloud-based interactive analytics for terabytes of genomic variants data.

Authors:  Cuiping Pan; Gregory McInnes; Nicole Deflaux; Michael Snyder; Jonathan Bingham; Somalee Datta; Philip S Tsao
Journal:  Bioinformatics       Date:  2017-12-01       Impact factor: 6.937

Review 5.  Tutorial: a guide to performing polygenic risk score analyses.

Authors:  Shing Wan Choi; Timothy Shin-Heng Mak; Paul F O'Reilly
Journal:  Nat Protoc       Date:  2020-07-24       Impact factor: 13.491

6.  Cloud computing for comparative genomics.

Authors:  Dennis P Wall; Parul Kudtarkar; Vincent A Fusaro; Rimma Pivovarov; Prasad Patil; Peter J Tonellato
Journal:  BMC Bioinformatics       Date:  2010-05-18       Impact factor: 3.169

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Privacy Risks from Genomic Data-Sharing Beacons.

Authors:  Suyash S Shringarpure; Carlos D Bustamante
Journal:  Am J Hum Genet       Date:  2015-10-29       Impact factor: 11.025

9.  Expanded encyclopaedias of DNA elements in the human and mouse genomes.

Authors:  Jill E Moore; Michael J Purcaro; Henry E Pratt; Charles B Epstein; Noam Shoresh; Jessika Adrian; Trupti Kawli; Carrie A Davis; Alexander Dobin; Rajinder Kaul; Jessica Halow; Eric L Van Nostrand; Peter Freese; David U Gorkin; Yin Shen; Yupeng He; Mark Mackiewicz; Florencia Pauli-Behn; Brian A Williams; Ali Mortazavi; Cheryl A Keller; Xiao-Ou Zhang; Shaimae I Elhajjajy; Jack Huey; Diane E Dickel; Valentina Snetkova; Xintao Wei; Xiaofeng Wang; Juan Carlos Rivera-Mulia; Joel Rozowsky; Jing Zhang; Surya B Chhetri; Jialing Zhang; Alec Victorsen; Kevin P White; Axel Visel; Gene W Yeo; Christopher B Burge; Eric Lécuyer; David M Gilbert; Job Dekker; John Rinn; Eric M Mendenhall; Joseph R Ecker; Manolis Kellis; Robert J Klein; William S Noble; Anshul Kundaje; Roderic Guigó; Peggy J Farnham; J Michael Cherry; Richard M Myers; Bing Ren; Brenton R Graveley; Mark B Gerstein; Len A Pennacchio; Michael P Snyder; Bradley E Bernstein; Barbara Wold; Ross C Hardison; Thomas R Gingeras; John A Stamatoyannopoulos; Zhiping Weng
Journal:  Nature       Date:  2020-07-29       Impact factor: 69.504

10.  The mutational constraint spectrum quantified from variation in 141,456 humans.

Authors:  Konrad J Karczewski; Laurent C Francioli; Grace Tiao; Beryl B Cummings; Jessica Alföldi; Qingbo Wang; Ryan L Collins; Kristen M Laricchia; Andrea Ganna; Daniel P Birnbaum; Laura D Gauthier; Harrison Brand; Matthew Solomonson; Nicholas A Watts; Daniel Rhodes; Moriel Singer-Berk; Eleina M England; Eleanor G Seaby; Jack A Kosmicki; Raymond K Walters; Katherine Tashman; Yossi Farjoun; Eric Banks; Timothy Poterba; Arcturus Wang; Cotton Seed; Nicola Whiffin; Jessica X Chong; Kaitlin E Samocha; Emma Pierce-Hoffman; Zachary Zappala; Anne H O'Donnell-Luria; Eric Vallabh Minikel; Ben Weisburd; Monkol Lek; James S Ware; Christopher Vittal; Irina M Armean; Louis Bergelson; Kristian Cibulskis; Kristen M Connolly; Miguel Covarrubias; Stacey Donnelly; Steven Ferriera; Stacey Gabriel; Jeff Gentry; Namrata Gupta; Thibault Jeandet; Diane Kaplan; Christopher Llanwarne; Ruchi Munshi; Sam Novod; Nikelle Petrillo; David Roazen; Valentin Ruano-Rubio; Andrea Saltzman; Molly Schleicher; Jose Soto; Kathleen Tibbetts; Charlotte Tolonen; Gordon Wade; Michael E Talkowski; Benjamin M Neale; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2020-05-27       Impact factor: 69.504

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.