Joseph T Glessner1,2, Xiao Chang3, Yichuan Liu3, Jin Li3, Munir Khan3, Zhi Wei4, Patrick M A Sleiman3,5, Hakon Hakonarson3,5. 1. Department of Pediatrics, Children's Hospital of Philadelphia, 3401 Civic Center Blvd, Philadelphia, PA, 19104, USA. glessner@email.chop.edu. 2. Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, 3400 Civic Center Blvd, Philadelphia, PA, 19104, USA. glessner@email.chop.edu. 3. Department of Pediatrics, Children's Hospital of Philadelphia, 3401 Civic Center Blvd, Philadelphia, PA, 19104, USA. 4. New Jersey Institute of Technology, Newark, NJ, 07102, USA. 5. Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, 3400 Civic Center Blvd, Philadelphia, PA, 19104, USA.
Abstract
BACKGROUND: Not all cells in a given individual are identical in their genomic makeup. Mosaicism describes such a phenomenon where a mixture of genotypic states in certain genomic segments exists within the same individual. Mosaicism is a prevalent and impactful class of non-integer state copy number variation (CNV). Mosaicism implies that certain cell types or subset of cells contain a CNV in a segment of the genome while other cells in the same individual do not. Several studies have investigated the impact of mosaicism in single patients or small cohorts but no comprehensive scan of mosaic CNVs has been undertaken to accurately detect such variants and interpret their impact on human health and disease. RESULTS: We developed a tool called Montage to improve the accuracy of detection of mosaic copy number variants in a high throughput fashion. Montage directly interfaces with ParseCNV2 algorithm to establish disease phenotype genome-wide association and determine which genomic ranges had more or less than expected frequency of mosaic events. We screened for mosaic events in over 350,000 samples using 1% allele frequency as the detection limit. Additionally, we uncovered disease associations of multiple phenotypes with mosaic CNVs at several genomic loci. We additionally investigated the allele imbalance observations genome-wide to define non-diploid and non-integer copy number states. CONCLUSIONS: Our novel algorithm presents an efficient tool with fast computational runtime and high levels of accuracy of mosaic CNV detection. A curated mosaic CNV callset of 3716 events in 2269 samples is presented with comparability to previous reports and disease phenotype associations. The new algorithm can be freely accessed via: https://github.com/CAG-CNV/MONTAGE .
BACKGROUND: Not all cells in a given individual are identical in their genomic makeup. Mosaicism describes such a phenomenon where a mixture of genotypic states in certain genomic segments exists within the same individual. Mosaicism is a prevalent and impactful class of non-integer state copy number variation (CNV). Mosaicism implies that certain cell types or subset of cells contain a CNV in a segment of the genome while other cells in the same individual do not. Several studies have investigated the impact of mosaicism in single patients or small cohorts but no comprehensive scan of mosaic CNVs has been undertaken to accurately detect such variants and interpret their impact on human health and disease. RESULTS: We developed a tool called Montage to improve the accuracy of detection of mosaic copy number variants in a high throughput fashion. Montage directly interfaces with ParseCNV2 algorithm to establish disease phenotype genome-wide association and determine which genomic ranges had more or less than expected frequency of mosaic events. We screened for mosaic events in over 350,000 samples using 1% allele frequency as the detection limit. Additionally, we uncovered disease associations of multiple phenotypes with mosaic CNVs at several genomic loci. We additionally investigated the allele imbalance observations genome-wide to define non-diploid and non-integer copy number states. CONCLUSIONS: Our novel algorithm presents an efficient tool with fast computational runtime and high levels of accuracy of mosaic CNV detection. A curated mosaic CNV callset of 3716 events in 2269 samples is presented with comparability to previous reports and disease phenotype associations. The new algorithm can be freely accessed via: https://github.com/CAG-CNV/MONTAGE .
Entities:
Keywords:
Copy number variation; Genomics; Mosaic; Mosaicism
Authors: Michael J McConnell; Michael R Lindberg; Kristen J Brennand; Julia C Piper; Thierry Voet; Chris Cowing-Zitron; Svetlana Shumilina; Roger S Lasken; Joris R Vermeesch; Ira M Hall; Fred H Gage Journal: Science Date: 2013-11-01 Impact factor: 47.728
Authors: Qian Liu; Justyna A Karolak; Christopher M Grochowski; Theresa A Wilson; Jill A Rosenfeld; Carlos A Bacino; Seema R Lalani; Ankita Patel; Amy Breman; Janice L Smith; Sau Wai Cheung; James R Lupski; Weimin Bi; Pawel Stankiewicz Journal: Genomics Date: 2020-05-06 Impact factor: 5.736
Authors: Mitchell J Machiela; Weiyin Zhou; Joshua N Sampson; Michael C Dean; Kevin B Jacobs; Amanda Black; Louise A Brinton; I-Shou Chang; Chu Chen; Constance Chen; Kexin Chen; Linda S Cook; Marta Crous Bou; Immaculata De Vivo; Jennifer Doherty; Christine M Friedenreich; Mia M Gaudet; Christopher A Haiman; Susan E Hankinson; Patricia Hartge; Brian E Henderson; Yun-Chul Hong; H Dean Hosgood; Chao A Hsiung; Wei Hu; David J Hunter; Lea Jessop; Hee Nam Kim; Yeul Hong Kim; Young Tae Kim; Robert Klein; Peter Kraft; Qing Lan; Dongxin Lin; Jianjun Liu; Loic Le Marchand; Xiaolin Liang; Jolanta Lissowska; Lingeng Lu; Anthony M Magliocco; Keitaro Matsuo; Sara H Olson; Irene Orlow; Jae Yong Park; Loreall Pooler; Jennifer Prescott; Radhai Rastogi; Harvey A Risch; Fredrick Schumacher; Adeline Seow; Veronica Wendy Setiawan; Hongbing Shen; Xin Sheng; Min-Ho Shin; Xiao-Ou Shu; David VanDen Berg; Jiu-Cun Wang; Nicolas Wentzensen; Maria Pik Wong; Chen Wu; Tangchun Wu; Yi-Long Wu; Lucy Xia; Hannah P Yang; Pan-Chyr Yang; Wei Zheng; Baosen Zhou; Christian C Abnet; Demetrius Albanes; Melinda C Aldrich; Christopher Amos; Laufey T Amundadottir; Sonja I Berndt; William J Blot; Cathryn H Bock; Paige M Bracci; Laurie Burdett; Julie E Buring; Mary A Butler; Tania Carreón; Nilanjan Chatterjee; Charles C Chung; Michael B Cook; Michael Cullen; Faith G Davis; Ti Ding; Eric J Duell; Caroline G Epstein; Jin-Hu Fan; Jonine D Figueroa; Joseph F Fraumeni; Neal D Freedman; Charles S Fuchs; Yu-Tang Gao; Susan M Gapstur; Ana Patiño-Garcia; Montserrat Garcia-Closas; J Michael Gaziano; Graham G Giles; Elizabeth M Gillanders; Edward L Giovannucci; Lynn Goldin; Alisa M Goldstein; Mark H Greene; Goran Hallmans; Curtis C Harris; Roger Henriksson; Elizabeth A Holly; Robert N Hoover; Nan Hu; Amy Hutchinson; Mazda Jenab; Christoffer Johansen; Kay-Tee Khaw; Woon-Puay Koh; Laurence N Kolonel; Charles Kooperberg; Vittorio Krogh; Robert C Kurtz; Andrea LaCroix; Annelie Landgren; Maria Teresa Landi; Donghui Li; Linda M Liao; Nuria Malats; Katherine A McGlynn; Lorna H McNeill; Robert R McWilliams; Beatrice S Melin; Lisa Mirabello; Beata Peplonska; Ulrike Peters; Gloria M Petersen; Ludmila Prokunina-Olsson; Mark Purdue; You-Lin Qiao; Kari G Rabe; Preetha Rajaraman; Francisco X Real; Elio Riboli; Benjamín Rodríguez-Santiago; Nathaniel Rothman; Avima M Ruder; Sharon A Savage; Ann G Schwartz; Kendra L Schwartz; Howard D Sesso; Gianluca Severi; Debra T Silverman; Margaret R Spitz; Victoria L Stevens; Rachael Stolzenberg-Solomon; Daniel Stram; Ze-Zhong Tang; Philip R Taylor; Lauren R Teras; Geoffrey S Tobias; Kala Viswanathan; Sholom Wacholder; Zhaoming Wang; Stephanie J Weinstein; William Wheeler; Emily White; John K Wiencke; Brian M Wolpin; Xifeng Wu; Jay S Wunder; Kai Yu; Krista A Zanetti; Anne Zeleniuch-Jacquotte; Regina G Ziegler; Mariza de Andrade; Kathleen C Barnes; Terri H Beaty; Laura J Bierut; Karl C Desch; Kimberly F Doheny; Bjarke Feenstra; David Ginsburg; John A Heit; Jae H Kang; Cecilia A Laurie; Jun Z Li; William L Lowe; Mary L Marazita; Mads Melbye; Daniel B Mirel; Jeffrey C Murray; Sarah C Nelson; Louis R Pasquale; Kenneth Rice; Janey L Wiggs; Anastasia Wise; Margaret Tucker; Luis A Pérez-Jurado; Cathy C Laurie; Neil E Caporaso; Meredith Yeager; Stephen J Chanock Journal: Am J Hum Genet Date: 2015-03-05 Impact factor: 11.025
Authors: Juan R González; Benjamín Rodríguez-Santiago; Alejandro Cáceres; Roger Pique-Regi; Nathaniel Rothman; Stephen J Chanock; Lluís Armengol; Luis A Pérez-Jurado Journal: BMC Bioinformatics Date: 2011-05-17 Impact factor: 3.169