| Literature DB >> 35314694 |
Praveen Anand1, Patrick J Lenehan2, Michiel Niesen2, Unice Yoo2, Dhruti Patwardhan1, Marcelo Montorzi2,3, A J Venkatakrishnan4, Venky Soundararajan5,6.
Abstract
Acute cardiac injury has been observed in a subset of COVID-19 patients, but the molecular basis for this clinical phenotype is unknown. It has been hypothesized that molecular mimicry may play a role in triggering an autoimmune inflammatory reaction in some individuals after SARS-CoV-2 infection. Here we investigate if linear peptides contained in proteins that are primarily expressed in the heart also occur in the SARS-CoV-2 proteome. Specifically, we compared the library of 136,704 8-mer peptides from 144 human proteins (including splicing variants) to 9926 8-mers from all the viral proteins in the reference SARS-CoV-2 proteome. No 8-mers were exactly identical between the reference human proteome and the reference SARS-CoV-2 proteome. However, there were 45 8-mers that differed by only one amino acid when compared to the reference SARS-CoV-2 proteome. Interestingly, analysis of protein-coding mutations from 141,456 individuals showed that one of these 8-mers from the SARS-CoV-2 Replicase polyprotein 1a/1ab (KIALKGGK) is identical to an MYH6 peptide encoded by the c.5410 C > A (Q1804K) genetic variation, which has been observed at low prevalence in Africans/African Americans (0.08%), East Asians (0.3%), South Asians (0.06%), and Latino/Admixed Americans (0.003%). Furthermore, analysis of 4.85 million SARS-CoV-2 genomes from over 200 countries shows that viral evolution has already resulted in 20 additional 8-mer peptides that are identical to human heart-enriched proteins encoded by reference sequences or genetic variants. Whether such mimicry contributes to cardiac inflammation during or after COVID-19 illness warrants further experimental evaluation. We suggest that SARS-CoV-2 variants harboring peptides identical to human cardiac proteins should be investigated as "viral variants of cardiac interest".Entities:
Year: 2022 PMID: 35314694 PMCID: PMC8935120 DOI: 10.1038/s41420-022-00914-9
Source DB: PubMed Journal: Cell Death Discov ISSN: 2058-7716
Fig. 1Identification of mimicked peptides between SARS-CoV-2 and human proteins.
a Identification of cardiac-specific proteins based on analysis of bulk RNAseq and single-cell RNA-seq data and identification of SARS-CoV-2 proteins. b Comparison of peptide libraries of human cardiac proteins and SARS-CoV-2 proteins.
List of peptide pairs from SARS-CoV-2 proteins and human cardiac proteins that have a Hamming distance less than or equal to 1.
| SARS-CoV-2 Proteins | Human cardiac-enriched proteins | ||||
|---|---|---|---|---|---|
| Protein name | Amino acid positions | Amino acid sequence | Protein name | Isoforms containing sequence | Amino acid sequence |
| Spike glycoprotein WT/2 P | 491–498 | PLQSYGFQ | KLHL41 | 1, 2 | PLQSYFFQ |
| Spike glycoprotein WT/2 P | 856–863 | NGLTVLPP | FHOD3 | 1, 2, 3, 4 | IGLTVLPP |
| Spike glycoprotein WT/2 P | 857–864 | GLTVLPPL | FHOD3 | 1, 2, 3, 4 | GLTVLPPP |
| Spike glycoprotein WT/2 P | 1087–1094 | AHFPREGV | CMYA5 | 1 | AHFPAEGV |
| Replicase polyprotein 1ab | 4937–4944 | KYAISAKN | TTN | 1, 3, 2, 5, 12, 13, 7, 8, 4, 9, 10, 11 | KYIISAKN |
| Replicase polyprotein 1ab | 5604–5611 | LQGPPGTG | MYLK3 | 1, 4 | KQGPPGTG |
| Replicase polyprotein 1ab | 5605–5612 | QGPPGTGK | MYLK3 | 1, 4 | QGPPGTGR |
| Replicase polyprotein 1ab | 5813–5820 | NRPQIGVV | CASQ2 | 1, 2 | FRPQIGVV |
| Replicase polyprotein 1ab | 5814–5821 | RPQIGVVR | CASQ2 | 1, 2 | RPQIGVVN |
| Replicase polyprotein 1ab | 5955–5962 | DTKFKTEG | TTN | 1, 3, 2, 5, 12, 13, 7, 8, 4, 9, 10, 11 | DTKFKTTG |
| Replicase polyprotein 1ab | 5955–5963 | DTKFKTEGL | TTN | 1, 3, 2, 5, 12, 13, 7, 8, 4, 9, 10, 11 | DTKFKTTGL |
| Replicase polyprotein 1ab | 5956–5963 | TKFKTEGL | TTN | 1, 3, 2, 5, 12, 13, 7, 8, 4, 9, 10, 11 | TKFKTTGL |
| Replicase polyprotein 1ab | 6516–6523 | KPVPEVKI | CMYA5 | 1 | KPSPEVKI |
| Replicase polyprotein 1ab | 6516–6523 | KPVPEVKI | TTN | 1, 2, 5, 12, 13, 7, 8, 4, 11 | KPVPEEKI |
| Replicase polyprotein 1a/1ab | 207–214 | RAGKASCT | TTN | 1, 2, 5, 12, 13, 7, 8, 4, 11 | EAGKASCT |
| Replicase polyprotein 1a/1ab | 208–215 | AGKASCTL | TTN | 1, 2, 5, 12, 13, 7, 8, 4, 11 | AGKASCTT |
| Replicase polyprotein 1a/1ab | 345–352 | GTENLTKE | TNNI3K | 1, 3, 4 | GTESLTKE |
| Replicase polyprotein 1a/1ab | 459–466 | VNINIVGD | ENO3 | 1, 3, 2 | VNIQIVGD |
| Replicase polyprotein 1a/1ab | 492–499 | KGLDYKAF | CASQ2 | 2 | KKLDYKAF |
| Replicase polyprotein 1a/1ab | 512–519 | TKGKAKKG | MYH7 | 1 | GKGKAKKG |
| Replicase polyprotein 1a/1ab | 513–520 | KGKAKKGA | MYH7 | 1 | KGKAKKGS |
| Replicase polyprotein 1a/1ab | 879–886 | VIKTLQPV | LMOD3 | 1 | VIKTLKPV |
| Replicase polyprotein 1a/1ab | 963–970 | GATSAALQ | ANKRD2 | 1, 2 | GAQSAALQ |
| Replicase polyprotein 1a/1ab | 1143–1150 | VLLAPLLS | HJV | 1, 2, 3 | TLLAPLLS |
| Replicase polyprotein 1a/1ab | 1144–1151 | LLAPLLSA | HJV | 1, 2, 3 | LLAPLLSG |
| Replicase polyprotein 1a/1ab | 1197–1204 | KQVEQKIA | GOT1 | 1, 2 | KKVEQKIA |
| Replicase polyprotein 1a/1ab | 2246–2253 | STAALGVL | SLC4A3 | 1, 2, 3 | STAVLGVL |
| Replicase polyprotein 1a/1ab | 2533–2540 | KGSLPINV | TTN | 1, 2, 5, 12, 13, 7, 8, 4, 11 | KGSLPITV |
| Replicase polyprotein 1a/1ab | 2550–2557 | EESSAKSA | MYH7B | 1 | EESKAKSA |
| Replicase polyprotein 1a/1ab | 2630–2637 | LSTFISAA | TENM2 | 1, 2 | LSTFFSAA |
| Replicase polyprotein 1a/1ab | 2757–2764 | KIALKGGK | MYH6 | 1 | QIALKGGK |
| Replicase polyprotein 1a/1ab | 2757–2764 | KIALKGGK | MYH7 | 1 | QIALKGGK |
| Replicase polyprotein 1a/1ab | 2758–2765 | IALKGGKI | MYH6 | 1 | IALKGGKK |
| Replicase polyprotein 1a/1ab | 2758–2765 | IALKGGKI | MYH7 | 1 | IALKGGKK |
| Replicase polyprotein 1a/1ab | 3908–3915 | FEKMVSLL | MYH6 | 1 | EEKMVSLL |
| Replicase polyprotein 1a/1ab | 3908–3915 | FEKMVSLL | MYH7 | 1 | EEKMVSLL |
| Replicase polyprotein 1a/1ab | 3909–3916 | EKMVSLLS | MYH6 | 1 | EKMVSLLQ |
| Replicase polyprotein 1a/1ab | 3909–3916 | EKMVSLLS | MYH7 | 1 | EKMVSLLQ |
| Replicase polyprotein 1a/1ab | 4137–4144 | VKLQNNEL | TBX20 | 1 | VKLTNNEL |
| Putative ORF9c protein | 47–54 | AAVGELLL | ASB10 | 1, 2, 3 | AAVVELLL |
| ORF7b protein | 14–21 | LAFLLFLV | TMEM182 | 2, 1, 3 | LAGLLFLV |
| ORF7a protein | 43–50 | NSPFHPLA | FLNC | 1, 2 | NSPFHVLA |
| Nucleoprotein | 192–199 | NSSRNSTP | CMYA5 | 1 | NSSRSSTP |
| Nucleoprotein | 374–381 | KKADETQA | MYPN | 1 | EKADETQA |
| Nucleoprotein | 375–382 | KADETQAL | MYPN | 1 | KADETQAR |
List of identical cardiac peptides found in SARS-CoV-2 variants in GISAID.
| No. of GISAID entries with identical match | SARS-CoV-2 gene | Mimicked peptide | Approx. start-end in SCOV2 protein | Cardiac protein | Cardiac protein start-end | Exact IEDB epitope match | Identical IEDB epitope seq (HLA/antigen infor) [≥90% identity] |
|---|---|---|---|---|---|---|---|
| 4501 | NSP3 | STAVLGVL | 1429–1436 | SLC4A3 | 746–753 | No | HEAQAVLGVLL (HLA-B*40:01); HEAQAVLGVLL (HLA-B*40:01) |
| 1322 | NSP2 | GTESLTKE | 166–173 | TNNI3K | 294–301 | No | NA |
| 580 | N | NSSRSSTP | 193–200 | CMYA5 | 88–95 | No | TINSSRSSQESY (B-cell epitopes, MHC ligands) |
| 205 | NS9c | AAVVELLL | 48–55 | ASB10 | 307–314 | No | MVDPQLDGPQLAALA AVVELGSFDA () |
| 118 | NSP2 | KGKAKKGS | 334–341 | MYH7 | 635–642 | Yes | AGADAPIEKGKGKAKKGSS (MHC ligand) |
| 33 | NSP3 | VIKTLKPV | 60–67 | LMOD3 | 514–521 | No | KVAIKTLKPGTMS (HLA-A*02:01); |
| TKVAIKTLKPGTMSPE (HLA-A*02:01) | |||||||
| 17 | NSP3 | KKVEQKIA | 380–387 | GOT1 | 55–62 | GEKVEQKIEGKWVNEKKAQEDKLQ | |
| (MHC Class I, II, B-cell epitope) | |||||||
| 14 | NS7a | NSPFHVLA | 44–51 | FLNC | 1727–1734 | No | NA |
| 11 | N | EKADETQA | 375–382 | MYPN | 89–96 | No | ADETQALPQRQKKQQ (HLA Class II) |
| 9 | NSP3 | TLLAPLLS | 304–311 | HJV | 409–416 | NFNQHEVLLAPLLS (B-cell epitope and MHC ligand) | |
| 6 | NSP3 | LSTFFSAA | 1811–1818 | TENM2 | 1036–1043 | No | AGTLSTFFGVPLVLT (HLA class II MHC restriction) |
| 6 | NSP3 | LLAPLLSG | 327–334 | HJV | 410–417 | No | ENFNQHEVLLAPLLS |
| (B cell and MHC) | |||||||
| 6 | Spike | IGLTVLPP | 854–861 | FHOD3 | 971–978 | No | GFIKQYGDCLGDIAA RDLICAQKFNGLTVL PPLLTDEMIAQYT (T cell, B cell and MHC ligands) |
| 2 | NSP13 | QGPPGTGR | 282–289 | MYLK3 | 373–380 | No | ILYGPPGTGK (HLA-A*03:01) |
| 1 | NSP15 | KPVPEEKI | 65–72 | TTN | 10277–10284 | No | EAPLYVVDKPVPEESE (HLA-DRB1*04:01) |
| 1 | NSP3 | KGSLPITV | 1716–1722 | TTN | 5437–5444 | No | SLPITVYYAV (T cell, B cell and MHC ligands) |
The NSP proteins are cleaved products of the replicase polyprotein.
List of mutated cardiac peptide n-mers from human genetic variants identical to SARS-CoV-2 variants.
| Human cardiac gene | rsID | Mutation consequence | Mutated cardiac peptide | Wild-type cardiac peptide | No. of GISAID genomes (Pango lineage distribution in %) | SARS-CoV-2 gene | IEDB epitope exact match | IEDB epitope info (≥90 % seq identity) |
|---|---|---|---|---|---|---|---|---|
| FLNC | rs374848954 | p.Val1732Leu | NSPFHLLA | NSPFHVLA | 1661 (AY.4: 30.8% AY.44: 10.6%; B.1.1.7: 9.4%; AY.43: 6.85%; | NS7a | Yes | NSPFH (HLA-A*01:01) |
| LMOD3 | rs370869958 | p.Lys519Arg | VIKTLRPV | VIKTLKPV | 85 (B.1.617.2: 24.7%; AY.43: 20.98%; AY.4: 8.64%; B.1.1: 6.17%; B.1.1.7: 6.17%; P.1: 6.17%;) | NSP3 | No | NA |
| TMEM182 | rs774398171 | p.Gly215Val | LAVLLFLV | LAGLLFLV | 21 AY.3: 57.14%; B.1.243: 14.28%; B.1.1.7: 9.52%; B.1: 4.76%) | NS7b | No | NA |
| MYLK3 | rs771870674 | p.Arg380Cys | QGPPGTGC | QGPPGTGR | 1 (B.1) | NSP13 | No | GPPGTGKSHFAIGLA (B cell, T cell, MHC ligand) |
The NSP proteins are cleaved products of the replicase polyprotein.