Literature DB >> 27974498

Statistical Methods for Identifying Sequence Motifs Affecting Point Mutations.

Yicheng Zhu1, Teresa Neeman2, Von Bing Yap3, Gavin A Huttley1.   

Abstract

Mutation processes differ between types of point mutation, genomic locations, cells, and biological species. For some point mutations, specific neighboring bases are known to be mechanistically influential. Beyond these cases, numerous questions remain unresolved, including: what are the sequence motifs that affect point mutations? How large are the motifs? Are they strand symmetric? And, do they vary between samples? We present new log-linear models that allow explicit examination of these questions, along with sequence logo style visualization to enable identifying specific motifs. We demonstrate the performance of these methods by analyzing mutation processes in human germline and malignant melanoma. We recapitulate the known CpG effect, and identify novel motifs, including a highly significant motif associated with A[Formula: see text]G mutations. We show that major effects of neighbors on germline mutation lie within [Formula: see text] of the mutating base. Models are also presented for contrasting the entire mutation spectra (the distribution of the different point mutations). We show the spectra vary significantly between autosomes and X-chromosome, with a difference in T[Formula: see text]C transition dominating. Analyses of malignant melanoma confirmed reported characteristic features of this cancer, including statistically significant strand asymmetry, and markedly different neighboring influences. The methods we present are made freely available as a Python library https://bitbucket.org/pycogent3/mutationmotif.
Copyright © 2017 by the Genetics Society of America.

Entities:  

Keywords:  5-methyl-cytosine; bioinformatics; context dependent mutation; germline mutation; log-linear model; mutation spectrum; sequence motif analysis; somatic mutation

Mesh:

Year:  2016        PMID: 27974498      PMCID: PMC5289855          DOI: 10.1534/genetics.116.195677

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  40 in total

Review 1.  Isochores and the evolutionary genomics of vertebrates.

Authors:  G Bernardi
Journal:  Gene       Date:  2000-01-04       Impact factor: 3.688

2.  Transcription-coupled TA and GC strand asymmetries in the human genome.

Authors:  M Touchon; S Nicolay; A Arneodo; Y d'Aubenton-Carafa; C Thermes
Journal:  FEBS Lett       Date:  2003-12-18       Impact factor: 4.124

Review 3.  Oxidative DNA damage: mechanisms, mutation, and disease.

Authors:  Marcus S Cooke; Mark D Evans; Miral Dizdaroglu; Joseph Lunec
Journal:  FASEB J       Date:  2003-07       Impact factor: 5.191

Review 4.  Evidence for ecological speciation and its alternative.

Authors:  Dolph Schluter
Journal:  Science       Date:  2009-02-06       Impact factor: 47.728

5.  Evidence for recent, population-specific evolution of the human mutation rate.

Authors:  Kelley Harris
Journal:  Proc Natl Acad Sci U S A       Date:  2015-03-02       Impact factor: 11.205

6.  A comprehensive catalogue of somatic mutations from a human cancer genome.

Authors:  Erin D Pleasance; R Keira Cheetham; Philip J Stephens; David J McBride; Sean J Humphray; Chris D Greenman; Ignacio Varela; Meng-Lay Lin; Gonzalo R Ordóñez; Graham R Bignell; Kai Ye; Julie Alipaz; Markus J Bauer; David Beare; Adam Butler; Richard J Carter; Lina Chen; Anthony J Cox; Sarah Edkins; Paula I Kokko-Gonzales; Niall A Gormley; Russell J Grocock; Christian D Haudenschild; Matthew M Hims; Terena James; Mingming Jia; Zoya Kingsbury; Catherine Leroy; John Marshall; Andrew Menzies; Laura J Mudie; Zemin Ning; Tom Royce; Ole B Schulz-Trieglaff; Anastassia Spiridou; Lucy A Stebbings; Lukasz Szajkowski; Jon Teague; David Williamson; Lynda Chin; Mark T Ross; Peter J Campbell; David R Bentley; P Andrew Futreal; Michael R Stratton
Journal:  Nature       Date:  2009-12-16       Impact factor: 49.962

7.  Genome-wide patterns and properties of de novo mutations in humans.

Authors:  Laurent C Francioli; Paz P Polak; Amnon Koren; Androniki Menelaou; Sung Chun; Ivo Renkens; Cornelia M van Duijn; Morris Swertz; Cisca Wijmenga; Gertjan van Ommen; P Eline Slagboom; Dorret I Boomsma; Kai Ye; Victor Guryev; Peter F Arndt; Wigard P Kloosterman; Paul I W de Bakker; Shamil R Sunyaev
Journal:  Nat Genet       Date:  2015-05-18       Impact factor: 38.330

8.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer.

Authors:  Simon A Forbes; David Beare; Prasad Gunasekaran; Kenric Leung; Nidhi Bindal; Harry Boutselakis; Minjie Ding; Sally Bamford; Charlotte Cole; Sari Ward; Chai Yin Kok; Mingming Jia; Tisham De; Jon W Teague; Michael R Stratton; Ultan McDermott; Peter J Campbell
Journal:  Nucleic Acids Res       Date:  2014-10-29       Impact factor: 16.971

9.  An expanded sequence context model broadly explains variability in polymorphism levels across the human genome.

Authors:  Varun Aggarwala; Benjamin F Voight
Journal:  Nat Genet       Date:  2016-02-15       Impact factor: 38.330

10.  A Simple Model-Based Approach to Inferring and Visualizing Cancer Mutation Signatures.

Authors:  Yuichi Shiraishi; Georg Tremmel; Satoru Miyano; Matthew Stephens
Journal:  PLoS Genet       Date:  2015-12-02       Impact factor: 5.917

View more
  10 in total

1.  Quantifying Influences on Intragenomic Mutation Rate.

Authors:  Helmut Simon; Gavin Huttley
Journal:  G3 (Bethesda)       Date:  2020-08-05       Impact factor: 3.154

2.  Signals of Variation in Human Mutation Rate at Multiple Levels of Sequence Context.

Authors:  Rachael C Aikens; Kelsey E Johnson; Benjamin F Voight
Journal:  Mol Biol Evol       Date:  2019-05-01       Impact factor: 16.240

3.  Context-Dependent Mutation Dynamics, Not Selection, Explains the Codon Usage Bias of Most Angiosperm Chloroplast Genes.

Authors:  Brian R Morton
Journal:  J Mol Evol       Date:  2021-12-21       Impact factor: 2.395

4.  Substitution rate heterogeneity across hexanucleotide contexts in noncoding chloroplast DNA.

Authors:  Brian R Morton
Journal:  G3 (Bethesda)       Date:  2022-07-29       Impact factor: 3.542

5.  Non-random genetic alterations in the cyanobacterium Nostoc sp. exposed to space conditions.

Authors:  Yuguang Liu; Patricio Jeraldo; William Herbert; Samantha McDonough; Bruce Eckloff; Jean-Pierre de Vera; Charles Cockell; Thomas Leya; Mickael Baqué; Jin Jen; Dirk Schulze-Makuch; Marina Walther-Antonio
Journal:  Sci Rep       Date:  2022-07-22       Impact factor: 4.996

6.  The cytidine deaminase under-representation reporter (CDUR) as a tool to study evolution of sequences under deaminase mutational pressure.

Authors:  Maxwell Shapiro; Stephen Meier; Thomas MacCarthy
Journal:  BMC Bioinformatics       Date:  2018-05-02       Impact factor: 3.169

7.  EvoLSTM: context-dependent models of sequence evolution using a sequence-to-sequence LSTM.

Authors:  Dongjoon Lim; Mathieu Blanchette
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

8.  Machine Learning Techniques for Classifying the Mutagenic Origins of Point Mutations.

Authors:  Yicheng Zhu; Cheng Soon Ong; Gavin A Huttley
Journal:  Genetics       Date:  2020-03-19       Impact factor: 4.562

9.  A Bayesian Framework for Inferring the Influence of Sequence Context on Point Mutations.

Authors:  Guy Ling; Danielle Miller; Rasmus Nielsen; Adi Stern
Journal:  Mol Biol Evol       Date:  2020-03-01       Impact factor: 16.240

10.  Fast neutron mutagenesis in soybean enriches for small indels and creates frameshift mutations.

Authors:  Skylar R Wyant; M Fernanda Rodriguez; Corey K Carter; Wayne A Parrott; Scott A Jackson; Robert M Stupar; Peter L Morrell
Journal:  G3 (Bethesda)       Date:  2022-02-04       Impact factor: 3.542

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.