| Literature DB >> 35637795 |
Raquel Vicario-Feliciano1, Cristal I Hernández-Hernández1, Ivonne C Camacho-Pastor2, Juan C Martínez-Cruzado1.
Abstract
Background Wilson's disease (WD) is an autosomal recessive progressive, disabling, life-threatening disease. Although early diagnosis and treatment can halt disease progression and reverse disability, diagnosis is often challenging, with a mean diagnostic delay of approximately two years. At least 98% of WD-causing variants are in the ATPase copper transporting beta (ATP7B) gene. Identifying ATP7B mutations that cause WD in Puerto Rico will allow newborn screening for WD, as well as preventive, life-saving treatment. Methodology TaqMan genotyping assays were performed on 174 random volunteers in southwestern Puerto Rico and on three independent WD cases for rs367956522 and rs140708492, single-nucleotide polymorphisms (SNPs) composing a WD-causing haplotype. A polymerase chain reaction followed by Sanger DNA sequencing confirmed the case genotypes. Bioinformatics analyses were performed on ATP7B polymorphisms present in The 1000 Genomes Project (1KGP) database for Puerto Rico. Results rs367956522 is always inherited together with rs140708492 but not vice versa. The three independent WD cases were homozygous for both SNPs, but the evidence strongly suggested that rs367956522 is the pathogenic variant. The 1KGP database revealed the presence of only one other likely pathogenic ATP7B variant, rs191312027 (Gly869Arg). Together, both variants may be responsible for causing WD in one of every 14,156 Puerto Ricans. Both are likely of European origin. Conclusions Genotyping probes for both variants are readily commercially available. Thus, rapid, inexpensive newborn screening for rs367956522 and rs191312027 is strongly recommended. Although these two variants may account for all or the vast majority of WD cases in Puerto Rico, other ATP7B polymorphisms described or not described in this study might also be pathogenic.Entities:
Keywords: cost-effectiveness; european origin; low-copper diet; newborn screening; preventive treatment; puerto rico; taqman genotyping assay; the 1000 genomes project; wilson’s disease
Year: 2022 PMID: 35637795 PMCID: PMC9129266 DOI: 10.7759/cureus.24446
Source DB: PubMed Journal: Cureus ISSN: 2168-8184
Municipality distribution and genotypes of random samples.
| Municipality | n | # heterozygous for rs140708492 only | # heterozygous for rs140708492 and rs367956522 |
| Añasco | 2 | 1 | --- |
| Cabo Rojo | 5 | --- | --- |
| Comerío | 1 | --- | --- |
| Guánica | 11 | --- | --- |
| Guayanilla | 9 | --- | --- |
| Hormigueros | 1 | --- | --- |
| Lajas | 13 | --- | --- |
| Lares | 1 | --- | --- |
| Maricao | 2 | --- | --- |
| Maunabo | 1 | --- | --- |
| Mayagüez | 7 | --- | --- |
| Naranjito | 1 | 1 | --- |
| Peñuelas | 3 | --- | --- |
| Ponce | 4 | --- | --- |
| Sabana Grande | 16 | 2 | --- |
| San Germán | 23 | --- | --- |
| San Juan | 2 | --- | --- |
| Vega Baja | 1 | --- | --- |
| Yauco | 71 | --- | 1 |
| Total | 174 | 4 | 1 |
Figure 1Chromatograms of the DNA region encompassing the splicing acceptor site of intron 11 in the ATP7B gene.
(A)Control. The AG acceptor site is located at positions 294 and 295 in the sequence. (B) The heterozygous mother of the third WD patient. Both alleles (A (green) and G (black)) of rs367956522 appear at position 294.(C) The third WD patient. Homozygosity for the minor (G) allele at position 294 is observed.
WD = Wilson’s disease
Polymorphisms inside the five main ATP7B transcripts.
∩UTR = untranslated region
*According to build GRCh37 (hg19). Underlined positions indicate polymorphisms with a degree of likelihood for being pathogenic (see Discussion).
∆Identification number of the single nucleotide polymorphism in the National Center for Biotechnology Information database.
†CTCF = CCCTC-binding factor. Position of amino acid replacements refers to the largest protein.
‡Worldwide Minor Allele Frequency in The 1000 Genomes Project database.
§CIP = conflicting interpretations in pathogenicity; b = benign, lb = likely benign, us = uncertain significance, lp = likely pathogenic, p = pathogenic.
#1 to 5 refer to the ATP7B proteins from the longest to shortest.
¥SIFT = sorting intolerant from tolerant program. APT = affect protein function, low = low confidence prediction.
||Out of 208 Puerto Rican chromosome 13 copies.
**Polymorphism found only in Puerto Rico.
ψAbsent in the ATP7B protein 1381 aa long.
ΩAbsent in the ATP7B protein 1258 aa long.
| mRNA region∩ | Position on Chr13* | dbSNP ID∆ | Effect† | WMAF‡ | ClinVar interpretation§ | Polyphen-2 prediction# | SIFT prediction¥ | Frequency in PR|| |
| 5’ UTR | 52585591 | rs148013251 | CTCF binding site | 0.37 | CIP: 5b, 1lb, 1us | --- | --- | 77 |
| 5’ UTR | 52585548 | rs2277448 | CTCF binding site | 0.48 | Benign | --- | --- | 80 |
| 5’ UTR | 52585504 | rs184427928 | Unknown | <0.01 | Not reported | --- | --- | 1 |
| Protein coding | 52549258 | rs184868522 | Met33Thr | <0.01 | Uncertain significance | 1. Benign 2, 3, 4. Possibly damaging 5. Probably damaging | 1, 2, 4. APT (low) 3, 5. Tolerate (low) | 2 |
| Protein coding | 52548320 | rs201200863** | Pro346Serψ | <0.01 | Not reported | Benign | Tolerated (low) | 1 |
| Protein coding | 52548158 | rs199807461 | Thr400Ala | <0.01 | CIP: 1lb, 2us | 1, 2, 5. Benign 3, 4. Possibly damaging | Tolerated | 1 |
| Protein coding | 52548140 | rs1801243 | Ser406Ala | 0.38 | Benign | Benign | Tolerated | 75 |
| Protein coding | 52544809 | rs150018860 | Synonymous | <0.01 | Likely benign | --- | --- | 1 |
| Protein coding | 52544805 | rs1801244 | Val456Leu | 0.38 | Benign | Benign | Tolerated (low) | 78 |
| Protein coding | 52542667 | rs145798966 | Synonymous | <0.01 | CIP: 3b, 5lb, 2us | --- | --- | 1 |
| Protein coding | 52524488 | rs1061472 | Lys832Arg | 0.50 | Benign | 1, 2, 3, 4. Benign 5.Possibly damaging | Tolerated | 99 |
| Protein coding | 52524268 | rs191312027 | Gly869Arg | <0.01 | CIP: 8p, 5lp, 1us | Probably damaging | APT | 1 |
| Protein coding | 52523808 | rs732774 | Arg952LysΩ | 0.47 | Benign | Benign | Tolerated | 98 |
| Protein coding | 52520507 | rs1801246 | Synonymous | 0.02 | Benign | --- | --- | 7 |
| Protein coding | 52520501 | rs200656411** | Synonymous | <0.01 | Likely benign | --- | --- | 1 |
| Protein coding | 52520471 | rs1801247 | Synonymous | 0.04 | Benign | --- | --- | 10 |
| Protein coding | 52520435 | rs1801248 | Synonymous | 0.01 | CIP: 8b, 1lb | --- | --- | 11 |
| Protein coding | 52516568 | rs59120265 | Synonymous | 0.01 | Benign | --- | --- | 3 |
| Protein coding | 52516529 | rs373081328 | Synonymous | <0.01 | CIP: 1b, 4lb, 1us | --- | --- | 1 |
| Protein coding | 52515354 | rs1801249 | Val1140Ala | 0.46 | Benign | Benign | Tolerated (low) | 96 |
| Protein coding | 52513266 | rs7334118 | His1207Arg | 0.03 | CIP: 7b, 2lb, 1us | 1, 3, 4, 5. Benign 2. Probably damaging | Tolerated | 11 |
| Protein coding | 52509077 | rs189601972** | Gly1405Ser | <0.01 | CIP: 1lp, 7us | 1, 2, 3, 5. Probably damaging 4. Possibly damaging | APT (low) | 1 |
| Protein coding | 52508989 | rs60986317 | Thr1434Met | 0.02 | CIP: 1b, 3lb, 8us | 1, 4. Possibly damaging 2, 3, 5. Probably damaging | Tolerated (low) | 1 |
| Protein coding | 52508988 | rs116091486 | Synonymous | 0.01 | Benign/Likely benign | --- | --- | 3 |
| Protein coding | 52508979 | rs73202048 | Synonymous | <0.01 | Benign/Likely benign | --- | --- | 4 |
| Protein coding | 52508895 | rs199859839** | Synonymous | <0.01 | Likely benign | --- | --- | 1 |
| 3’ UTR | 52508877 | rs73498144 | CTCF binding site | <0.01 | Likely benign | --- | --- | 2 |
| 3’ UTR | 52508823 | rs566159129** | CTCF binding site | <0.01 | Not reported | --- | --- | 1 |
| 3’ UTR | 52508739 | rs563069856 | Unknown | <0.01 | Not reported | --- | --- | 1 |
| 3’ UTR | 52508702 | rs115420019 | Unknown | 0.03 | 1b, 1us | --- | --- | 7 |
| 3’ UTR | 52507883 | rs79747858 | Unknown | 0.01 | Likely benign | --- | --- | 1 |
| 3’ UTR | 52507856 | rs533209080** | Unknown | <0.01 | Uncertain significance | --- | --- | 1 |
| 3’ UTR | 52507816 | rs564499990 | Unknown | <0.01 | Uncertain significance | --- | --- | 1 |
| 3’ UTR | 52507720 | rs1051332 | Unknown | 0.31 | Benign | --- | --- | 70 |
| 3’ UTR | 52507710 | rs79490882 | Unknown | 0.04 | Benign | --- | --- | 7 |
| 3’ UTR | 52507370 | rs191892694** | Unknown | <0.01 | Not reported | --- | --- | 1 |
| 3’ UTR | 52507175 | rs77770386 | Unknown | 0.02 | Benign | --- | --- | 5 |
| 3’ UTR | 52507148 | rs9535794 | Unknown | 0.02 | Likely benign | --- | --- | 9 |
| 3’ UTR | 52507110 | rs928169 | Unknown | 0.48 | Benign | --- | --- | 101 |
Figure 2Median-joining haplotype subnetwork of all haplotypes whose most recent hypothetical ancestors differ from the haplotype shared by five individuals by not more than four mutations.
Haplotypes are represented by circles with size proportional to the number of chromosomes they represent. Haplotypes represented by one chromosome are shown in yellow, and those represented by more than one chromosome in green. The 1KGP ID of each sample to which each chromosome belongs is shown next to its corresponding circle: three or four-digit numbers represent the ID of Puerto Rican samples (e.g., 1402 = HG01402), while non-Puerto Rican chromosomes are identified by letters (e.g., GBR# = chromosomes from British in England and Scotland, see text). The haplotype with the pathological mutation rs367956522 (1413) is highlighted in red and a bigger, bold font. Hypothetical ancestor haplotypes are represented by small blue circles except for the root ancestor haplotype, which is highlighted as a small red circle. Crosshatches denote mutations distinguishing haplotypes from one another. Crosshatches representing rs140708492 are highlighted by asterisks. Only one mutation distinguishes the haplotype shared by five chromosomes from the root, rs140708492, which forms part of four of the seven reticulations.