| Literature DB >> 35470453 |
Natasha Berthold1,2,3, Julia Pytte2,3, Cynthia M Bulik4,5,6, Monika Tschochner1, Sarah E Medland7, Patrick Anthony Akkari2,8,9,10.
Abstract
Anorexia nervosa (AN) is a devastating disorder with evidence of underexplored heritability. Twin and family studies estimate heritability (h2 ) to be 57%-64%, and genome-wide association studies (GWAS) reveal significant genetic correlations with psychiatric and anthropometric traits and a total of nine genome-wide significant loci. Whether significantly associated single nucleotide polymorphisms identified by GWAS are causal or tag true causal variants, remains to be elucidated. We propose a novel method for bridging this knowledge gap by fine-mapping short structural variants (SSVs) in and around GWAS-identified loci. SSV fine-mapping of loci associated with complex disorders such as schizophrenia, amyotrophic lateral sclerosis, and Alzheimer's disease has uncovered genetic risk markers, phenotypic variability between patients, new pathological mechanisms, and potential therapeutic targets. We analyze previous investigations' methods and propose utilizing an evaluation algorithm to prioritize 10 SSVs for each of the top two AN GWAS-identified loci followed by Sanger sequencing and fragment analysis via capillary electrophoresis to characterize these SSVs for case/control association studies. Success of previous SSV analyses in complex disorders and effective utilization of similar methodologies supports our proposed method. Furthermore, the structural and spatial properties of the 10 SSVs identified for each of the top two AN GWAS-associated loci, cell adhesion molecule 1 (CADM1) and NCK interacting protein with SH3 domain (NCKIPSD), are similar to previous studies. We propose SSV fine-mapping of AN-associated loci will identify causal genetic architecture. Deepening understandings of AN may lead to novel therapeutic targets and subsequently increase quality-of-life for individuals living with the illness. PUBLIC SIGNIFICANCE STATEMENT: Anorexia nervosa is a severe and complex illness, arising from a combination of environmental and genetic factors. Recent studies estimate the contribution of genetic variability; however, the specific DNA sequences and how they contribute remain unknown. We present a novel approach, arguing that the genetic variant class, short structural variants, could answer this knowledge gap and allow development of biologically targeted therapeutics, improving quality-of-life and patient outcomes for affected individuals.Entities:
Keywords: anorexia nervosa; feeding and eating disorders; genetic association studies; genome-wide association study; heritability; mental disorders; microsatellite repeats
Mesh:
Substances:
Year: 2022 PMID: 35470453 PMCID: PMC9545787 DOI: 10.1002/eat.23716
Source DB: PubMed Journal: Int J Eat Disord ISSN: 0276-3478 Impact factor: 5.791
The eight newly identified genetic loci associated with anorexia nervosa
| CHR | Lead SNP | Nearest gene | Functions |
|
|---|---|---|---|---|
| 3 | rs9821797 |
| Growth and cellular signaling in dendrites and sarcomeres; stress fiber formation (Cho et al., | 6.99 × 10−15 |
| 11 | rs6589488 |
| Cellular adhesion; neural network formation; synaptic formation and number (Jin et al., | 6.31 × 10−11 |
| 2 | rs2287348 |
|
| 5.62 × 10−9 |
| 10 | rs2008387 |
| Alkylating agent removal (Yu et al., | 1.73 × 10−8 |
| 3 | rs9874207 |
| Transcription factor (Siper et al., | 2.05 × 10−8 |
| 1 | rs10747478 |
| RNA splicing in neuronal cell maturation (Romanelli et al., | 3.13 × 10−8 |
| 5 | rs370838138 |
| Sodium dependent intercellular adhesion (Kools et al., | 3.17 × 10−8 |
| 3 | rs13100344 |
| Catlysation of 5‐formylcytadine at position 34 of methionine transfer RNA (Nakano et al., | 4.12 × 10−8 |
Note: Eight genetic loci were identified in the 2019 ANGI GWA by gene proximity to each lead SNP. Lead SNP was determined as the most strongly associated. p‐Value was considered significant (after Bonferroni adjustment) at ≤.05. The major functions for each of the nearest genes to the lead SNP have been described in column 4.
Abbreviations: ASB3, ankyrin repeat and SOCS box containing 3; CADM1, cell adhesion molecule 1; CDH10, cadherin 10; CHR, chromosome; ERLEC1, endoplasmic reticulum lectin 1; FOXP1, forkhead box P1; MGMT, O‐6‐methylguanine‐DNA methyltransferase; NCKIPSD, NCK interacting protein with SH3 domain; NSUN3, NOP2/Sun RNA methyltransferase 3; PTBP2, polypyrimidine tract binding protein 2; RNA, ribonucleic acid; rs, reference SNP accession number; SNP, single nucleotide polymorphism.
Initial short structural variants prioritized by the short structural variant evaluation algorithm for future characterization and investigation in anorexia nervosa case/control studies
| Gene | rs Number | Symbol | Gene feature |
|---|---|---|---|
|
| rs11358670 | 28T | Intronic variant |
|
| rs58589028 | 29T | Intronic variant |
|
| rs61694033 | 25A | Intronic variant |
|
| rs72085573 | 20T | 5′ Intergenic region |
|
| rs140815983 | 15A | 3′ Intergenic region |
|
| rs147798460 | 32T | 3′ Intergenic region |
|
| rs148209064 | 33A | Intronic variant |
|
| rs386374979 | 29T | 3′ UTR downstream contiguous variant |
|
| rs747352768 | 11TGG | Exonic variant (Coding exon 8) |
|
| rs991408884 | 33T | 3′ UTR variant |
|
| rs71074264 | 24T | Intronic variant ( |
|
| rs71627345 | 21A | Intronic variant ( |
|
| rs375474983 | 5ACAA | Intronic variant |
|
| rs377051084 | 9AGGG | Intronic variant |
|
| rs545029045 | 20AC | Intronic variant |
|
| rs757842104 | 31T | Downstream variant |
|
| rs34837885 | 8AAAT | Intronic variant |
|
| rs35746542 | 24T | Intronic variant |
|
| rs67509214 | 28A | Intronic variant |
|
| rs71074266 | 22A | Intronic variant |
Note: The 20 SSVs prioritized by the SSV evaluation algorithm (designed by Saul et al., 2016) as candidates for further investigation to elucidate potential roles in AN risk. Ten SSVs have been prioritized for each genetic candidate loci, NCKIPSD, and CADM1. The column titled “Gene” refers to which candidate loci the SSV was reported for. The column labeled “rs Number” refers to the unique identifier supplied by the current human reference genome for that variant. The column titled “Symbol” refers to the most frequently occurring variation of that SSV according to the Allele Frequency Aggregator Project. The final column, titled “Gene Feature,” refers to the functional property of the region of the genome in which SSV is situated. The symbol, rs number and gene feature listed here are as reported in the current human reference genome GRCh38.p13.
Abbreviations: A, adenine; C, cytosine; CADM1, cell adhesion molecule 1; G, guanine; IP6K2, inositol hexakisophosphate kinase 2; LINC02585, long intergenic non‐protein coding RNA 02585; NCKIPSD, NCK interacting protein with SH3 domain; rs, reference SNP accession number; T, thymine; UTR, untranslated region.