Literature DB >> 20037582

Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library.

Hugo Y K Lam1, Xinmeng Jasmine Mu, Adrian M Stütz, Andrea Tanzer, Philip D Cayting, Michael Snyder, Philip M Kim, Jan O Korbel, Mark B Gerstein.   

Abstract

Structural variants (SVs) are a major source of human genomic variation; however, characterizing them at nucleotide resolution remains challenging. Here we assemble a library of breakpoints at nucleotide resolution from collating and standardizing ~2,000 published SVs. For each breakpoint, we infer its ancestral state (through comparison to primate genomes) and its mechanism of formation (e.g., nonallelic homologous recombination, NAHR). We characterize breakpoint sequences with respect to genomic landmarks, chromosomal location, sequence motifs and physical properties, finding that the occurrence of insertions and deletions is more balanced than previously reported and that NAHR-formed breakpoints are associated with relatively rigid, stable DNA helices. Finally, we demonstrate an approach, BreakSeq, for scanning the reads from short-read sequenced genomes against our breakpoint library to accurately identify previously overlooked SVs, which we then validate by PCR. As new data become available, we expect our BreakSeq approach will become more sensitive and facilitate rapid SV genotyping of personal genomes.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 20037582      PMCID: PMC2951730          DOI: 10.1038/nbt.1600

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  49 in total

1.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

2.  Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes.

Authors:  W James Kent; Robert Baertsch; Angie Hinrichs; Webb Miller; David Haussler
Journal:  Proc Natl Acad Sci U S A       Date:  2003-09-19       Impact factor: 11.205

3.  Detection of large-scale variation in the human genome.

Authors:  A John Iafrate; Lars Feuk; Miguel N Rivera; Marc L Listewnik; Patricia K Donahoe; Ying Qi; Stephen W Scherer; Charles Lee
Journal:  Nat Genet       Date:  2004-08-01       Impact factor: 38.330

4.  Frequency of new copy number variation in humans.

Authors:  Gert-Jan B van Ommen
Journal:  Nat Genet       Date:  2005-04       Impact factor: 38.330

5.  Sequence dependence of DNA conformational flexibility.

Authors:  A Sarai; J Mazur; R Nussinov; R L Jernigan
Journal:  Biochemistry       Date:  1989-09-19       Impact factor: 3.162

6.  Predicting DNA duplex stability from the base sequence.

Authors:  K J Breslauer; R Frank; H Blöcker; L A Marky
Journal:  Proc Natl Acad Sci U S A       Date:  1986-06       Impact factor: 11.205

7.  The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility.

Authors:  Enrique Gonzalez; Hemant Kulkarni; Hector Bolivar; Andrea Mangano; Racquel Sanchez; Gabriel Catano; Robert J Nibbs; Barry I Freedman; Marlon P Quinones; Michael J Bamshad; Krishna K Murthy; Brad H Rovin; William Bradley; Robert A Clark; Stephanie A Anderson; Robert J O'connell; Brian K Agan; Seema S Ahuja; Rosa Bologna; Luisa Sen; Matthew J Dolan; Sunil K Ahuja
Journal:  Science       Date:  2005-01-06       Impact factor: 47.728

8.  Large-scale copy number polymorphism in the human genome.

Authors:  Jonathan Sebat; B Lakshmi; Jennifer Troge; Joan Alexander; Janet Young; Pär Lundin; Susanne Månér; Hillary Massa; Megan Walker; Maoyen Chi; Nicholas Navin; Robert Lucito; John Healy; James Hicks; Kenny Ye; Andrew Reiner; T Conrad Gilliam; Barbara Trask; Nick Patterson; Anders Zetterberg; Michael Wigler
Journal:  Science       Date:  2004-07-23       Impact factor: 47.728

9.  Recombination drives the evolution of GC-content in the human genome.

Authors:  Julien Meunier; Laurent Duret
Journal:  Mol Biol Evol       Date:  2004-02-12       Impact factor: 16.240

10.  PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data.

Authors:  Jan O Korbel; Alexej Abyzov; Xinmeng Jasmine Mu; Nicholas Carriero; Philip Cayting; Zhengdong Zhang; Michael Snyder; Mark B Gerstein
Journal:  Genome Biol       Date:  2009-02-23       Impact factor: 13.583

View more
  99 in total

Review 1.  Annotating non-coding regions of the genome.

Authors:  Roger P Alexander; Gang Fang; Joel Rozowsky; Michael Snyder; Mark B Gerstein
Journal:  Nat Rev Genet       Date:  2010-07-13       Impact factor: 53.242

2.  Primate genome architecture influences structural variation mechanisms and functional consequences.

Authors:  Omer Gokcumen; Verena Tischler; Jelena Tica; Qihui Zhu; Rebecca C Iskow; Eunjung Lee; Markus Hsi-Yang Fritz; Amy Langdon; Adrian M Stütz; Pavlos Pavlidis; Vladimir Benes; Ryan E Mills; Peter J Park; Charles Lee; Jan O Korbel
Journal:  Proc Natl Acad Sci U S A       Date:  2013-09-06       Impact factor: 11.205

Review 3.  Direct mutation analysis by high-throughput sequencing: from germline to low-abundant, somatic variants.

Authors:  Michael Gundry; Jan Vijg
Journal:  Mutat Res       Date:  2011-10-12       Impact factor: 2.433

4.  Identification of large rearrangements in cancer genomes with barcode linked reads.

Authors:  Li C Xia; John M Bell; Christina Wood-Bouwens; Jiamin J Chen; Nancy R Zhang; Hanlee P Ji
Journal:  Nucleic Acids Res       Date:  2018-02-28       Impact factor: 16.971

5.  Personal genome sequencing: current approaches and challenges.

Authors:  Michael Snyder; Jiang Du; Mark Gerstein
Journal:  Genes Dev       Date:  2010-03-01       Impact factor: 11.361

Review 6.  On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease.

Authors:  David N Cooper; Albino Bacolla; Claude Férec; Karen M Vasquez; Hildegard Kehrer-Sawatzki; Jian-Min Chen
Journal:  Hum Mutat       Date:  2011-09-02       Impact factor: 4.878

7.  Extensive genomic and transcriptional diversity identified through massively parallel DNA and RNA sequencing of eighteen Korean individuals.

Authors:  Young Seok Ju; Jong-Il Kim; Sheehyun Kim; Dongwan Hong; Hansoo Park; Jong-Yeon Shin; Seungbok Lee; Won-Chul Lee; Sujung Kim; Saet-Byeol Yu; Sung-Soo Park; Seung-Hyun Seo; Ji-Young Yun; Hyun-Jin Kim; Dong-Sung Lee; Maryam Yavartanoo; Hyunseok Peter Kang; Omer Gokcumen; Diddahally R Govindaraju; Jung Hee Jung; Hyonyong Chong; Kap-Seok Yang; Hyungtae Kim; Charles Lee; Jeong-Sun Seo
Journal:  Nat Genet       Date:  2011-07-03       Impact factor: 38.330

8.  FusorSV: an algorithm for optimally combining data from multiple structural variation detection methods.

Authors:  Timothy Becker; Wan-Ping Lee; Joseph Leone; Qihui Zhu; Chengsheng Zhang; Silvia Liu; Jack Sargent; Kritika Shanker; Adam Mil-Homens; Eliza Cerveira; Mallory Ryan; Jane Cha; Fabio C P Navarro; Timur Galeev; Mark Gerstein; Ryan E Mills; Dong-Guk Shin; Charles Lee; Ankit Malhotra
Journal:  Genome Biol       Date:  2018-03-20       Impact factor: 13.583

9.  Towards a comprehensive structural variation map of an individual human genome.

Authors:  Andy W Pang; Jeffrey R MacDonald; Dalila Pinto; John Wei; Muhammad A Rafiq; Donald F Conrad; Hansoo Park; Matthew E Hurles; Charles Lee; J Craig Venter; Ewen F Kirkness; Samuel Levy; Lars Feuk; Stephen W Scherer
Journal:  Genome Biol       Date:  2010-05-19       Impact factor: 13.583

10.  Systematic inference of copy-number genotypes from personal genome sequencing data reveals extensive olfactory receptor gene content diversity.

Authors:  Sebastian M Waszak; Yehudit Hasin; Thomas Zichner; Tsviya Olender; Ifat Keydar; Miriam Khen; Adrian M Stütz; Andreas Schlattl; Doron Lancet; Jan O Korbel
Journal:  PLoS Comput Biol       Date:  2010-11-11       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.