Literature DB >> 25241312

Characterizing regions in the human genome unmappable by next-generation-sequencing at the read length of 1000 bases.

Wentian Li1, Jan Freudenberg2.   

Abstract

Repetitive and redundant regions of a genome are particularly problematic for mapping sequencing reads. In the present paper, we compile a list of the unmappable regions in the human genome based on the following definition: hypothetical reads with length 1 kb which cannot be uniquely mapped with zero-mismatch alignment for the described regions, considering both the forward and reverse strand. The respective collection of unmappable regions covers 0.77% of the sequence of human autosomes and 8.25% of the sex chromosomes in the reference genome GRCh37/hg19 (overall 1.23%). Not surprisingly, our unmappable regions overlap greatly with segmental duplication, transposable elements, and structural variants. About 99.8% of bases in our unmappable regions are part of either segmental duplication or transposable elements and 98.3% overlap structural variant annotations. Notably, some of these regions overlap units with important biological functions, including 4% of protein-coding genes. In contrast, these regions have zero intersection with the ultraconserved elements, very low overlap with microRNAs, tRNAs, pseudogenes, CpG islands, tandem repeats, microsatellites, sensitive non-coding regions, and the mapping blacklist regions from the ENCODE project.
Copyright © 2014 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25241312     DOI: 10.1016/j.compbiolchem.2014.08.015

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  4 in total

Review 1.  Mappability and read length.

Authors:  Wentian Li; Jan Freudenberg
Journal:  Front Genet       Date:  2014-11-10       Impact factor: 4.599

2.  The ENCODE Blacklist: Identification of Problematic Regions of the Genome.

Authors:  Haley M Amemiya; Anshul Kundaje; Alan P Boyle
Journal:  Sci Rep       Date:  2019-06-27       Impact factor: 4.379

3.  Radiation Necrosis with Proton Therapy in a Patient with Aarskog-Scott Syndrome and Medulloblastoma.

Authors:  Vidya Puthenpura; Nicholas J DeNunzio; Xue Zeng; Drosoula Giantsoudi; Mariam Aboian; David Ebb; Kristopher T Kahle; Torunn I Yock; Asher M Marks
Journal:  Int J Part Ther       Date:  2021-07-29

Review 4.  Next-generation sequencing to guide cancer therapy.

Authors:  Jeffrey Gagan; Eliezer M Van Allen
Journal:  Genome Med       Date:  2015-07-29       Impact factor: 11.117

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.