Literature DB >> 29741956

A Secure Alignment Algorithm for Mapping Short Reads to Human Genome.

Yongan Zhao1, Xiaofeng Wang1, Haixu Tang1.   

Abstract

The elastic and inexpensive computing resources such as clouds have been recognized as a useful solution to analyzing massive human genomic data (e.g., acquired by using next-generation sequencers) in biomedical researches. However, outsourcing human genome computation to public or commercial clouds was hindered due to privacy concerns: even a small number of human genome sequences contain sufficient information for identifying the donor of the genomic data. This issue cannot be directly addressed by existing security and cryptographic techniques (such as homomorphic encryption), because they are too heavyweight to carry out practical genome computation tasks on massive data. In this article, we present a secure algorithm to accomplish the read mapping, one of the most basic tasks in human genomic data analysis based on a hybrid cloud computing model. Comparing with the existing approaches, our algorithm delegates most computation to the public cloud, while only performing encryption and decryption on the private cloud, and thus makes the maximum use of the computing resource of the public cloud. Furthermore, our algorithm reports similar results as the nonsecure read mapping algorithms, including the alignment between reads and the reference genome, which can be directly used in the downstream analysis such as the inference of genomic variations. We implemented the algorithm in C++ and Python on a hybrid cloud system, in which the public cloud uses an Apache Spark system.

Entities:  

Keywords:  genome privacy; privacy-preserving techniques; read mapping

Mesh:

Year:  2018        PMID: 29741956      PMCID: PMC5998833          DOI: 10.1089/cmb.2017.0094

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  18 in total

Review 1.  A survey of sequence alignment algorithms for next-generation sequencing.

Authors:  Heng Li; Nils Homer
Journal:  Brief Bioinform       Date:  2010-05-11       Impact factor: 11.622

2.  Quantification of private information leakage from phenotype-genotype data: linking attacks.

Authors:  Arif Harmanci; Mark Gerstein
Journal:  Nat Methods       Date:  2016-02-01       Impact factor: 28.547

3.  Drafting the Genetic Privacy Act: science, policy, and practical considerations.

Authors:  G J Annas; L H Glantz; P A Roche
Journal:  J Law Med Ethics       Date:  1995       Impact factor: 1.718

4.  A new initiative on precision medicine.

Authors:  Francis S Collins; Harold Varmus
Journal:  N Engl J Med       Date:  2015-01-30       Impact factor: 91.245

5.  The case for cloud computing in genome informatics.

Authors:  Lincoln D Stein
Journal:  Genome Biol       Date:  2010-05-05       Impact factor: 13.583

6.  Is the $1000 Genome as Near as We Think? A Cost Analysis of Next-Generation Sequencing.

Authors:  Kirsten J M van Nimwegen; Ronald A van Soest; Joris A Veltman; Marcel R Nelen; Gert Jan van der Wilt; Lisenka E L M Vissers; Janneke P C Grutters
Journal:  Clin Chem       Date:  2016-09-14       Impact factor: 8.327

Review 7.  Forensic DNA Phenotyping: Predicting human appearance from crime scene material for investigative purposes.

Authors:  Manfred Kayser
Journal:  Forensic Sci Int Genet       Date:  2015-02-16       Impact factor: 4.882

8.  A community assessment of privacy preserving techniques for human genomes.

Authors:  Xiaoqian Jiang; Yongan Zhao; Xiaofeng Wang; Bradley Malin; Shuang Wang; Lucila Ohno-Machado; Haixu Tang
Journal:  BMC Med Inform Decis Mak       Date:  2014-12-08       Impact factor: 2.796

9.  Resolving individuals contributing trace amounts of DNA to highly complex mixtures using high-density SNP genotyping microarrays.

Authors:  Nils Homer; Szabolcs Szelinger; Margot Redman; David Duggan; Waibhav Tembe; Jill Muehling; John V Pearson; Dietrich A Stephan; Stanley F Nelson; David W Craig
Journal:  PLoS Genet       Date:  2008-08-29       Impact factor: 5.917

10.  Privacy Risks from Genomic Data-Sharing Beacons.

Authors:  Suyash S Shringarpure; Carlos D Bustamante
Journal:  Am J Hum Genet       Date:  2015-10-29       Impact factor: 11.025

View more
  1 in total

1.  Next-generation sequencing for constitutional variants in the clinical laboratory, 2021 revision: a technical standard of the American College of Medical Genetics and Genomics (ACMG).

Authors:  Catherine Rehder; Lora J H Bean; David Bick; Elizabeth Chao; Wendy Chung; Soma Das; Julianne O'Daniel; Heidi Rehm; Vandana Shashi; Lisa M Vincent
Journal:  Genet Med       Date:  2021-04-29       Impact factor: 8.822

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.