Literature DB >> 32955204

Power and Pitfalls of Computational Methods to Identify New Genes Responsible for Acute Liver Failure of Indeterminate Etiology in Adults.

Abdelkader Heddar1,2, Micheline Misrahi1,2.   

Abstract

Entities:  

Mesh:

Year:  2020        PMID: 32955204      PMCID: PMC7447419          DOI: 10.14309/ctg.0000000000000180

Source DB:  PubMed          Journal:  Clin Transl Gastroenterol        ISSN: 2155-384X            Impact factor:   4.396


× No keyword cloud information.
We have read with great interest the recent article by Rakela et al. (1) describing genetic alterations detected by whole exome sequencing in a cohort of 26 adult patients with acute liver failure of indeterminate etiology (ALF-IE). The authors conclude that 12 variants of 11 genes were significantly associated with (ALF-IE), particularly the stop codon of SEPRINB11 identified in 23 of the 26 patients and the missense variant of CYP2D6 detected in 13 patients. However, we have several key concerns regarding the results of the study. First, Rakela et al. reported that the SERPINB11 (c.268G>T, p.Glu90Ter) variant, recorded as rs4940595 in the database of human single nucleotide polymorphism, was detected in most of the cohort. However, a careful examination of this variant using VarSome (2) and the latest update of the Broad Institute's gnomAD (3), a large database of human polymorphisms, reveals that this SERPINB11 variant has a global allele frequency (AF) as high as 63% for all ethnic subgroups (Tables 1 and 2). Thus, we cannot conclude that the variant is frequently associated with (ALF-IE). According to the pipeline of analysis described in the study (AF threshold of 0.01 in human polymorphism databases), this result can be explained by the fact that authors probably used the old version of gnomAD, in which all the variants and polymorphisms were not aligned with the new version of the genome and not the updated version of gnomAD. This update issue is also true for the second missense variant of CYP2D6, which has neither a high frequency nor an overall frequency for all ethnic subgroups in the updated gnomAD database (Tables 1 and 2).
Table 1.

Extensive information on the variants detected by Rakela et al.

GenersNchrGRCh38REFALTCanonical RNA transcriptDNAProteinVariant typegnomAD EgnomAD G
SERPINB11rs4940595231863712604GTNM_080475.4c.268G>Tp.Glu90TerNon-sense0.63
rs494059501863712604GANM_080475.4c.268G>Ap.Glu90LysMissense0.0000279
rs494059501863712604GCNM_080475.4c.268G>Cp.Glu90GlnMissense0.00000699
CYP2D6rs1135840162242126611CGNM_000106.6c.1457G>Cp.Ser486ThrMissense0.577
ANTXRLrs709174981046330066CANM_001278688.2c.1878C>Ap.Pro626=Synonymous0.000007
rs70917491046330066CGNM_001278688.2c.1878C>Gp.Pro626=Synonymous0.194
rs70917491046330066CTNM_001278688.2c.1878C>Tp.Pro626=Synonymous0.0103
MUC6rs7765723129111016887GANM_005961.3c.5914C>Tp.Pro1972SerMissense0.000020.0118
OR6J1rs1753430141422634064AGNM_001348233.1c.748T>Cp.Ser250ProMissense0.41
GNALrs20189854831811689671_GGCCCTENST00000334049.11c.113_118dupTGGCCCp.Leu38_Ala39dupInsertion0.08420.141
rs2018985481811689671GGCCCT_ENST00000334049.11c.113_118delTGGCCCp.Leu38_Ala39delInframe deletion0.001040.0168
AADACL3rs30108777112719616CTNM_001103170.3c.310C>Tp.Pro104SerMissense0.151
MCL1rs1158094631150578851GANM_021960.5c.680C>Tp.Ala227ValMissense0.008420.0086
CYP2D7rs56404506112242141186CTNM_001348386.3c.1196G>Ap.Arg399HisMissense0.281
CYP2D7rs1800754152242141587GANM_001348386.3c.932C>Tp.Ser311LeuMissense0.48
rs18007542242141587GTNM_001348386.3c.932C>Ap.Ser311TerNon-sense
RRP36rs2008868319643021676_GCCGGG (6bp)ENST00000244496.6c.43_48dupGGGGCCp.Gly15_Ala16dupInsertion0.0000750.00104
rs200886831643021676GCCGGG (6bp)_ENST00000244496.6c.43_48delGGGGCCp.Gly15_Ala16delInframe deletion0.04190.285
rs200886831643021676_GC…GG (12bp)ENST00000244496.6c.37_48dupGGGGCCGGGGCCp.Gly13_Ala16dupInsertion0.000014
rs200886831643021676GC…GG (12bp)_ENST00000244496.6c.37_48delp.Gly13_Ala16delInframe deletion0.0001870.000961
KIAA1161rs48797826934372875GCNM_020702.5c.69C>Gp.Tyr23TerNon-sense0.251

ALT, alternative allele; Chr, chromosome; gnomAD, Allele frequency according to gnomAD exome database; gnomAD G, Allele frequency according to gnomAD genome database; GRCh38, position of the variant according to the last annotation of the human genome; N, number of patients with the variants in the cohort described by Rakela et al; REF, reference allele; rs: the identifier (ID) of the variant according to the last version of the human database of single nucleotide polymorphism (dbSNP 151)

Table 2.

AF in GnomAD genome (V3) database of the 2 variants of SERPINB11 and CYP2D6 reported by Rakela et al. in a cohort of adult patients with acute liver failure of indeterminate etiology

PopulationSERPINB11: c.268G>T, p.Glu90Ter (rs4940595)CYP2D6: c.1457G>C, p.Ser486Thr (rs1135840)
Allele countAllele numberHomozygotesAFAllele countAllele numberHomozygotesAF
African21,03341,9325,3290.50226,14840,8228,8810.641
Amish6049001990.6714668841280.527
Ashkenazi Jewish2,1713,3227230.6542,1383,3127140.646
East Asian1,2063,1242320.3862,1623,0607820.707
European (Finnish)6,98710,4382,3140.6695,21710,3881,3300.502
European (Non-Finnish)46,72164,53216,8580.72435,74864,03010,4320.558
Latino8,12413,6462,4300.5956,61513,5481,7340.488
South Asian1,9043,0285970.6291,6632,9724890.559
Other1,3622,1504380.6331,2232,1263690.575
Total90,112143,07229,1200.62981,380141,14224,8590.577

AF, allele frequency.

Extensive information on the variants detected by Rakela et al. ALT, alternative allele; Chr, chromosome; gnomAD, Allele frequency according to gnomAD exome database; gnomAD G, Allele frequency according to gnomAD genome database; GRCh38, position of the variant according to the last annotation of the human genome; N, number of patients with the variants in the cohort described by Rakela et al; REF, reference allele; rs: the identifier (ID) of the variant according to the last version of the human database of single nucleotide polymorphism (dbSNP 151) AF in GnomAD genome (V3) database of the 2 variants of SERPINB11 and CYP2D6 reported by Rakela et al. in a cohort of adult patients with acute liver failure of indeterminate etiology AF, allele frequency. Furthermore, SERPINB11 is a unitary pseudogene in human genome (4). The pseudogene refers to a gene that has lost its functional counterpart during evolution because of a mutagenic event resulting in a premature stop codon. These mutagenic effects are often located in the initial portion of the coding gene, as for SERPINB11. Interestingly, SERPINB11 has the particularity of being polymorphic in humans (4). Hence, the rs4940595 (G/T) variant distinguishes between individuals bearing a functional copy of SERPINB11 with a wild-type c.268G allele and those bearing a nonfunctional copy, with the mutated c.268G>T allele, introducing a stop codon. The extremely high frequency of the mutated allele of SERPINB11 in the general population in different ethnic groups makes the link between this variant and ALF-IE in adults highly unlikely. Finally, 5 of the 9 other variants located in MUC6, OR6J1, AADACL3, CYP2D7, and KIAA1161 also occur frequently in the gnomAD genome, with an AF up to 40%. For the remaining 4 variants, the results can be considered inconclusive because they correspond to multiallelic polymorphisms which must be referenced precisely, theirs ID being insufficient (Table 1). For instance, rs200886831 of RRP36 detected in 9 patients, refers to 4 different alleles and corresponds to in-frame deletion or insertion of either 6 or 12 bp. One of the 4 alleles has an AF of 45% in gnomAD, whereas the other 3 are extremely rare. Thus, the conclusion raised by Rakela et al. on the significant association of the 12 variants with ALF-IE in adults is incompatible with careful examination of recent databases. A careful processing of exome and genome data is necessary to find out genes causing ALF-IE in adults.

CONFLICTS OF INTEREST

Guarantor of the article: Micheline Misrahi, MD, PhD. Specific author contribution: A.H. performed reanalysis and interpretation of published computational data and wrote the letter. M.M. supervised the study, wrote, and revised the manuscript. Financial support: None to report. Potential competing interests: None to report.
  3 in total

1.  Identification and analysis of unitary pseudogenes: historic and contemporary gene losses in humans and other primates.

Authors:  Zhengdong D Zhang; Adam Frankish; Toby Hunt; Jennifer Harrow; Mark Gerstein
Journal:  Genome Biol       Date:  2010-03-08       Impact factor: 13.583

2.  Whole Exome Sequencing Among 26 Patients With Indeterminate Acute Liver Failure: A Pilot Study.

Authors:  Jorge Rakela; Jody Rule; Daniel Ganger; Julie Lau; Julie Cunningham; Mrunal Dehankar; Saurabh Baheti; William M Lee
Journal:  Clin Transl Gastroenterol       Date:  2019-10       Impact factor: 4.488

3.  VarSome: the human genomic variant search engine.

Authors:  Christos Kopanos; Vasilis Tsiolkas; Alexandros Kouris; Charles E Chapple; Monica Albarca Aguilera; Richard Meyer; Andreas Massouras
Journal:  Bioinformatics       Date:  2019-06-01       Impact factor: 6.937

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.