| Literature DB >> 31307431 |
Friedrich Kueppers1, Mark D Andrake2, Qifang Xu2, Roland L Dunbrack2, Joannah Kim3, Christopher L Sanders3.
Abstract
BACKGROUND: Alpha 1 Antitrypsin (AAT) is a key serum proteinase inhibitor encoded by SERPINA1. Sequence variants of the gene can cause Alpha 1 Antitrypsin Deficiency (AATD), a condition associated with lung and liver disease. The majority of AATD cases are caused by the 'Z' and 'S' variants - single-nucleotide variations (SNVs) that result in amino acid substitutions of E342K and E264V. However, SERPINA1 is highly polymorphic, with numerous potentially clinically relevant variants reported. Novel variants continue to be discovered, and without reports of pathogenicity, it can be difficult for clinicians to determine the best course of treatment.Entities:
Keywords: Alpha 1 Antitrypsin Deficiency; Protein modeling; Rare variants; SERPINA1
Mesh:
Substances:
Year: 2019 PMID: 31307431 PMCID: PMC6631921 DOI: 10.1186/s12881-019-0852-5
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
Fig. 1Structure of AAT indicating the location of missense residues. The AAT protein (PDB code 1OPH) is shown in ribbon representation coloring according to secondary structural elements (alpha helices shown in orange, beta strands shown in light blue), and the position of missense changes showing the wildtype residue in sphere representation and labeled with the residue name and position. The purple ribbon protein is trypsinogen. The stretch of amino acids that comprise the reactive center loop are shown in green ribbon representation. A = front view; B = rear view (rotated 180 degrees about the x-axis). AAT, Alpha 1 Antitrypsin
Fig. 2Primary and secondary structure of AAT. While the top row sequence represents Uniprot numbering including the signal peptide, the bottom sequence row uses the conventional numbering for AAT found in the broad literature, and used throughout this manuscript. Color coding of residues are according to chemical properties, and display of secondary structural elements (arrow for β-strand, curl for α-helix as extracted from PDB code 1OPH) are shown above the sequences using the software package ESPript [13]. The positions of the novel alleles reported in this manuscript are marked by an asterisk below the residue, and include the variant residue found. Those that are probably or possibly deleterious are colored red, and those that are possibly or probably neutral are colored green. Numbering of the variants, apart from I9N, does not include the 24 amino acid precursor. AAT, Alpha 1 Antitrypsin
Overview of patient characteristics and rare/novel sequence variants identified by next-generation sequencing
| Patient ID | AAT serum level (mg/dL) | CRP (mg/dL) | Previous phenotypes | Age (yrs) | Sex | NGS DNA analysis | ||
|---|---|---|---|---|---|---|---|---|
| Novel mutant nucleotide changes | Consequence | Genotype | ||||||
| Splice variants | ||||||||
| 2250 | 70 | 5.9 | – | 65 | F | Novel splice variant: G > C at position +1 of intron 1C | E376D – M3 allele | |
| 24023 | 62.8 | – | 71 | M | 917 + 1G > A | Affects the normal mRNA splicing. | E376D – M3 allele | |
| Deletions | ||||||||
| 10724 | 52 | 7.3 | M3 | 49 | M | A347fs: Novel 1 bp deletion (1112delC) at position 347 | Frameshift that extends the protein by 5 amino acids | E376D – M3 allele |
| Stop codons | ||||||||
| 6326e | 2 | 0 | Z/M1 | 57 | F | Q156X: C > T at Chr14:94849037 (GRCh.37.p13) c.538C > T | Insertion of stop codon at position 156 | Q156X E342K – Z allele V213A – M1 allele |
| 6376e | 98 | 7 | I | 54 | M | Q156X: C > T at Chr14:94849037 (GRCh.37.p13) c.538C > T | Insertion of stop codon at position 156 | Q156X R39C – I allele |
| 19771 | 91.4 | – | M | 57 | G192Fs: 1 bp deletion at Chr14:94847477 (GRCh37.p13) c.647_647delG | Predicted to produce a premature stop codon at amino acid 214, leading to a premature termination on exon III | ||
| Single-nucleotide variations | ||||||||
| CA97 | 112 | – | M3 M2/4 | 65 | F | GAG > AAG | E204K – rarea E376D – M3 allele R101H – M2/M4 allele | |
| 1144 | 72 | – | M1 | 61 | M | CCC > TCC | P289S – rarea V213A – M1 allele | |
| 2343 | 86 | 3.6 | M1 | 60 | F | ATC > AAC | I9N [includes precursor] – rarea V213A – M1 allele | |
| 4293d | 66 | 0.5 | M1 | 54 | M | CCC > CTC | Q0Bellingham – insertion of stop codon at position 156 | P28L – rarea K217X – Q0Bellingham V213A – M1 allele |
| 5564d | 67 | 1.6 | M1 | 51 | F | CCC > CTC | Q0Bellingham – insertion of stop codon at position 156 | P28L – rarea K217X – Q0Bellingham V213A – M1 allele |
| 4668 | 78 | 2.2 | M3 | 62 | M | ATC > AAC | I50N (PiTijarafe) – rare E376D – M3 allele | |
| 9533 | 73 | 0.9 | – | 60 | M | ATG > ACG | M385 T – rarec M allele | |
| 10889 | – | 12.5 | M3 | 34 | F | CAG > CGG | Q40R – rarea E376D – M3 allele | |
| 12642 | 89 | 0.6 | M3 | 66 | F | GAC > CTC | D341V – rareb E376D – M3 allele P = L118 – no amino acid change | |
| 14271 | 47 | 0.6 | Z/M1 | 61 | F | ATG > ACG | M221 T – rarea V213A – M1 allele E342K – Z allele | |
| 15230 | 34 | 1 | Z/M1 | 72 | M | GTG > GAG | V210E (Ncambodia/Pierre-bénite) – rarea V213A – M1 allele E342K – Z allele | |
| 17,657 | 160 | 1.5 | M3/M4 | 87 | M | AAG > GAG | K174E – NOVELa E376D – M3 allele R101H – M2/M4 allele | |
| 21034 | 121.2 | – | – | 47 | F | CCC > CAC | P369H – rare E264V – S allele | |
| 21636 | 88.4 | – | – | 58 | M | GTG > ATG | V333 M – rarec E376D – M3 allele R101H – M4 allele | |
| 23523 | 118.6 | – | – | 48 | F | GCA > CCA | A325P (Nvestenanova) – rarea R223C – F allele E376D – M3 allele R101H – M4 allele | |
| 24319 | 79.3 | – | – | 57 | F | GCC > GAC | A142D – rarec E264V – S allele V213A – M1 allele | |
| 76430 | 74.8 | – | – | 59 | M | CAC > TAC | H262Y – rarec M allele | |
All SNVs are reported without the 24 amino acid precursor unless otherwise stated
aSNVs have been previously reported in dbSNP (E204K – rs199422208; P289S – rs779938258; Q40R – rs763483402; M221 T – rs766260108; K174E – rs766034720; I9N [includes precursor] – rs1296175763; P28L – rs944607375)
bSNVs have been previously reported in dbSNP and linked to AATD (D314V – rs864622046)
cSNV previously reported in the literature (20–26) (H262Y – rs149537225; V333 M – 373,630,097; A142D – rs142942004; V210E – rs746197812; M385 T – rs1488213352; A325P – rs376024688; I50N – rs1275309068)
dFamilial samples from brother (4293) and sister (5564)
eFamilial samples from brother (6376) and sister (6326)
bp Base pair, dbSNP Database of single nucleotide polymorphisms, SNV Single nucleotide variation
Summary of computational analysis of rare/novel SNVs
| Patient ID | Novel mutation | Amino acid change | Analysis | ||||
|---|---|---|---|---|---|---|---|
| SVM probability | ΔΔG (FoldX) | PolyPhen-2 Score | Comparison with previous computational characterizations | Comments on side chain structure | |||
| Probably deleterious mutations | |||||||
| 1144 | P289Sa | Proline (P; non-polar side chain) > serine (S; polar uncharged side chain) |
|
|
| • Giacopuzzi et al. (2018) [ • VEST3: • iFISH: • MutationAssessor: 4.425 (high) • SIFT: | • Tightly packed side chain buried in the same hydrophobic region as M221, in the breach at the top of the A-sheet, and the beginning of the RCL • Serine is not tolerated sterically as it is larger – likely causing disruption to role of this strategic portion of the protein |
| 4668 | I50N | Isoleucine (I; hydrophobic side chain) > asparagine (N; polar uncharged side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.706 • iFISH: • MutationAssessor: 4.41 (high) • SIFT: | • Highly conserved residue • Polar side chain introduced to a very hydrophobic core of the protein • Will destabilize hydrophobic core |
| 12,642 | D341V | Aspartic acid (D; negatively charged side chain) >valine (V; hydrophobic side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.765 • iFISH: • MutationAssessor: 4.06 (high) • SIFT: | • Conserved residue • Borderline significant change in protein stability • In a buried location found at the “Breach” region of the protein at the base of the RCL loop • Change to valine would eliminate aspartic acid hydrogen bonding to adjacent K343 and possibly affect RCL conformation |
| 14,271 | M221 T | Methionine (M; hydrophobic side chain) >threonine (T; polar uncharged side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.778 • iFISH: • MutationAssessor: 4.74 (high) • SIFT: | • Highly conserved residue • Tightly packed side chain buried in hydrophobic region in the breach at top of α-sheet and beginning of RCL • Threonine would be sterically tolerated due to smaller size but would not have same impact as Methionine on tight packing in strategic area of protein |
| 15,230 | V210E | Valine (V; hydrophobic side chain) >glutamic acid (E; negatively charged side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.618 • iFISH: • MutationAssessor: 3.745 (high) • SIFT: | • Not a highly conserved residue • Residue participates in tight hydrophobic packing near the tip of a β–strand hairpin • Introduction of glutamic acid could cause charge repulsion with D211 and disrupt packing of β-hairpin or could introduce a new h-bond with nearby N390 |
| 4293†& 5564† | P28La | Proline (P; non-polar side chain) >leucine (L; hydrophobic side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: 0.387 • VEST3: 0.404 • iFISH: 0.7976 • MutationAssessor: 2.86 (medium) • SIFT: | • Highly conserved residue • P28 is near the N-terminus and the side chain packs against P23. • Change to the larger hydrophobic leucine would be sterically permissible as the side chain is surface-accessible. • Possible that the wildtype Proline is necessary to kink the helix for the tight packing to occur, and the conformation of N-terminal helix interaction with the rest of the protein |
| 21,034 | P369H | Proline (P; non-polar side chain) >histidine (H; positively charged side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: • iFISH: • MutationAssessor: 4.755 (high) • SIFT: | • Buried location found at end of the RCL loop • Change to histidine would disrupt packing and affect RCL conformation |
| 24,319 | A142D | Alanine (A; small hydrophobic side chain) >aspartic acid (D; negatively charged side chain) |
|
|
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.694 • iFISH: • MutationAssessor: 3.51 (high) • SIFT: Silva et al., (2016) [ • PolyPhen-2: | • Highly conserved residue • Change to aspartic acid could be sterically problematic as larger charged side chain is introduced to a hydrophobic pocket and could destabilize it |
| Possibly deleterious mutations | |||||||
| 9533 | M385 T | Methionine (M; hydrophobic side chain) >threonine (T; polar uncharged side chain) |
|
| 0.134 | Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.738 • iFISH: • MutationAssessor: 1.97 (medium) • SIFT: 0.094 | • Conserved residue • Residue in buried core of protein and makes at least 12 hydrophobic contacts in the core • Change to threonine would shorten the side chain and disrupt core packing; note the significant stability drop |
| 21,636 | V333 M | Valine (V; hydrophobic side chain) >methionine (M; hydrophobic side chain) |
| −0.25 |
| Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.676 • iFISH: • MutationAssessor: 1.985 (medium) • SIFT: 0.079 Silva et al., (2016) [ • PolyPhen-2: 0.53 | • Buried location with low ASA; found within the beta-sheet region • Larger/longer side-chain • Methionine may sterically clash in the buried location |
| Possibly neutral mutations | |||||||
| 2343 | I9N [includes precursor]a | Isoleucine (I; hydrophobic side chain) > asparagine (N; polar uncharged side chain) | 0.3387 | N/A | 0.517 | Giacopuzzi et al. (2018) [ • REVEL: 0.453 • VEST3: 0.291 • iFISH: 0.3779 • MutationAssessor: 1.1 (low) • SIFT: | • Not included in Fig. |
| 10,889 | Q40Ra | Glutamine (Q; polar uncharged side chain) >arginine (R; positively charged side chain) |
| −0.35 | 0.018 | Giacopuzzi et al. (2018) [ • REVEL: 0.311 • VEST3: 0.092 • iFISH: 0.5284 • MutationAssessor: 1.515 (low) • SIFT: 0.24 | • Conserved residue • Change to the larger Arginine side chain would present steric problems, despite its accessibility • Q40 hydrogen bonds to V302 and while the larger side chain could also hydrogen bond, it could disrupt packing of the helix that holds V302 |
| 17,657 | K174Ea | Lysine (K; positively charged side chain) > glutamic acid (E; negatively charged side chain) |
| 0.21 | 0.030 | Giacopuzzi et al. (2018) [ • REVEL: • VEST3: 0.681 • iFISH: • MutationAssessor: 2.24 (medium) • SIFT: 0.208 | • Moderately conserved residue • Change to side chain is sterically tolerated as a smaller side chain is introduced in to a solvent-accessible loop of the protein |
| 76,430 | H262Y | Histidine (H; positively charged side chain) >tyrosine (Y; largely hydrophobic side chain, but hydroxyl group can participate in hydrogen bonds or also be phosphorylated) |
| −0.68 | 0.040 | Giacopuzzi et al. (2018) [ • REVEL: 0.086 • VEST3: 0.144 • iFISH: 0.5173 • MutationAssessor: 1.54 (low) • SIFT: Silva et al. (2016) [ • Polyphen-2: 0.06 (21) | • Highly conserved residue • Tightly packed side chain buried • Histidine is involved in 3 hydrogen bonds – to backbone atoms of residues N265, E266, K234 • Tyrosine might not be tolerated sterically because it is larger |
| Probably neutral mutations | |||||||
| CA97 | E204Ka | Glutamic acid (E; negatively charged side chain) >lysine (K; positively charged side chain) | 0.1021 | − 0.70 | 0.000 | Giacopuzzi et al. (2018) [ • REVEL: 0.457 • VEST3: 0.648 • iFISH: 0.1155 • MutationAssessor: − 0.625 (low) • SIFT: 0.921 | • Not a conserved residue, larger lysine chain is sterically tolerated & can make similar contacts. • Little change in protein stability is predicted. Although variant could affect RCL from a distance |
| 23,523 | A325P | Alanine (A; small hydrophobic side chain) >proline (P; non-polar side chain) | 0.0878 | 0.72 | 0.000 | Giacopuzzi et al. (2018) [ • REVEL: 0.214 • VEST3: 0.143 • iFISH: 0.4862 • MutationAssessor: 0.265 (medium) • SIFT: 0.411 | • Not conserved • Insignificant change in protein stability • In a surface-accessible loop, that could play a role in an alternate trypsin binding site |
All sequence variants are reported without the 24 amino acid precursor unless otherwise stated; avariants previously reported in the dbSNP only
Computational analysis (excludes MutationAssessor): numbers in bold, deleterious score; numbers in normal text, neutral score; RCL Reactive center loop, SVM Single vector machine, ASA Accessible surface area
Results of benchmarking analysis
| Species | Sequence Identity (%) | SVM probability | ΔΔG (FoldX) | PolyPhen-2 Score | |
|---|---|---|---|---|---|
| Pathogenic sequence variants (ClinVar) | |||||
| P369L |
| 100 |
|
|
|
| P369S | Homo sapiens | 100 |
|
|
|
| P369T | Homo sapiens | 100 |
|
|
|
| M358R | Homo sapiens | 100 |
| 0.431 | 0.0190 |
| E342K | Homo sapiens | 100 |
|
|
|
| E264V | Homo sapiens | 100 |
|
|
|
| D256V | Homo sapiens | 100 |
|
|
|
| G225R | Homo sapiens | 100 |
|
|
|
| R223C | Homo sapiens | 100 |
| −0.350 |
|
| I92N | Homo sapiens | 100 |
|
|
|
| G67E | Homo sapiens | 100 |
|
|
|
| S53F | Homo sapiens | 100 |
|
|
|
| L41P | Homo sapiens | 100 |
|
|
|
| R39C | Homo sapiens | 100 |
|
|
|
| A336T | Homo sapiens | 100 |
|
|
|
| G115S | Homo sapiens | 100 |
|
|
|
| F52S | Homo sapiens | 100 |
|
|
|
| Benign sequence variants (ClinVar) | |||||
| E376D | Homo sapiens | N/A | 0.2991 |
| 0 |
| E363K | Homo sapiens | N/A | 0.4172 | −0.900 | 0.11 |
| A284S | Homo sapiens | N/A | 0.3445 | −0.240 | 0.139 |
| V213A | Homo sapiens | N/A | 0.1161 | −0.100 | 0 |
| R101H | Homo sapiens | N/A | 0.0576 | −0.533 | 0 |
| Benign sequence variants (Primate neutral variants) | |||||
| P21Q | Hylobates sp. ECACC | 95 |
|
| 0.014 |
| F23L | Papio anubis | 92 | 0.3762 | 0.38 | 0 |
| T27A |
| 98 | 0.4118 |
| 0 |
| N29K | Hylobates sp. ECACC | 95 | 0.4365 | −0.63 | 0.178 |
| N29S | Papio anubis | 92 | 0.0839 | 0.13 | 0 |
| T48S | Hylobates sp. ECACC | 95 | 0.4991 | 0.51 | 0 |
| I50V | Gorilla gorilla | 98 |
|
|
|
| D74S | Chlorocebus sabaeus | 92 | 0.0588 | 0.38 | 0 |
| N81H | Pongo abelii | 96 | 0.462 | 0.27 | 0.007 |
| I92V | Pongo abelii; Hylobates sp. ECACC | 96; 95 |
|
| 0.006 |
| Q105K | Papio anubis | 92 | 0.2178 | −0.5 | 0.001 |
| N116S | Gorilla gorilla | 98 |
|
| 0.311 |
| K136N | Papio anubis | 92 | 0.4292 | 0.07 | 0 |
| E141D | Pongo abelii | 96 | 0.4052 | 0.77 | 0.002 |
| G148E | Chlorocebus sabaeus | 92 | 0.219 | −0.46 | 0 |
| D159N | Papio anubis | 92 | 0.165 | −1.21 | 0 |
| Q212E | Pongo abelii | 96 | 0.2568 | 0.35 | 0 |
| V213A | Hylobates sp. ECACC | 95 | 0.1161 | −0.1 | 0 |
| Q230Y | Papio anubis | 92 | 0.0803 | 0.49 | 0 |
| Q230H | Hylobates sp. ECACC | 95 | 0.0545 | 0.41 | 0 |
| K233E | Papio anubis | 92 | 0.0787 | 0.58 | 0 |
| D270E | Hylobates sp. ECACC | 95 | 0.2543 | −0.03 | 0 |
| I271V | Gorilla gorilla | 98 | 0.1015 | 0.52 | 0 |
| D280N | Pongo abelii | 96 | 0.1407 |
| 0 |
| S285N | Chlorocebus sabaeus | 92 | 0.0554 | −1.6 | 0 |
| S292A | Chlorocebus sabaeus | 92 | 0.3206 | −0.14 | 0.003 |
| S301R | Pongo abelii | 96 | 0.1328 | −0.54 | 0.011 |
| S301T | Hylobates sp. ECACC | 95 | 0.0724 | 0 | 0 |
| S313G | Hylobates sp. ECACC | 95 |
|
| 0.05 |
| E324D | Chlorocebus sabaeus | 92 | 0.0978 |
| 0 |
| A332V | Homo sapiens | 99 | 0.1255 |
|
|
| I360V | Gorilla gorilla | 98 | 0.1547 |
| 0 |
| L383H | Hylobates sp. ECACC | 95 |
|
|
|
| M385I | Papio anubis | 92 |
|
| 0.001 |
| M385 V | 99; 96 |
|
| 0.001 | |
All variants are reported without the 24 amino acid precursor unless otherwise stated
Scores in bold, pathogenic; scores in italics, possibly pathogenic; scores in normal text, benign. SVM Support vector machine
Measurements of benchmarking predictions
| TPR (sensitivity) | TNR (specificity) | PPV | NPV | ACC | BACC | |
|---|---|---|---|---|---|---|
| SVM | 1.0 | 0.825 | 0.720 | 1.0 | 0.879 | 0.913 |
| Polyphen2 | 0.944 | 0.925 | 0.85 | 0.974 | 0.931 | 0.935 |
TNR True negative rate, TPR True positive rate, PPV Positive predictive value, NPV Negative predictive value, ACC Total accuracy, BACC Balanced accuracy
Fig. 3Structure of AAT zoomed in on locations of interest. Presented are some of the missense residues predicted to negatively affect the stability of the folded protein. Several of these missense changes are in the tightly packed core of the protein, such as the P289S variant packed tightly near the M221T variant location