Literature DB >> 33613009

Aggregation hot spots in the SARS-CoV-2 proteome may constitute potential therapeutic targets for the suppression of the viral replication and multiplication.

Abstract

The emergence of novel coronavirus SARS-CoV-2 is responsible for causing coronavirus disease-19 (COVID-19) imposing serious threat to global public health. Infection of SARS-CoV-2 to the host cell is characterized by direct translation of positive single stranded (+ ss) RNA to form large polyprotein polymerase 1ab (pp1ab), which acts as precursor for a number of nonstructural and structural proteins that play vital roles in replication of viral genome and biosynthesis of new virus particles. The maintenance of viral protein homeostasis is essential for continuation of viral life cycle in the host cell. To test whether the protein homeostasis of SARS-CoV-2 can be disrupted by inducing specific protein aggregation, we made an effort to examine whether the viral proteome contains any aggregation prone regions (APRs) that can be explored for inducing toxic protein aggregation specifically in viral proteins and without affecting the host cell. This curiosity leads to the identification of several (> 70) potential APRs in SARS-CoV-2 proteome. The length of the APRs ranges from 5 to 25 amino acid residues. Nearly 70% of total APRs investigated are relatively smaller and found to be in the range of 5-10 amino acids. The maximum number of ARPs (> 50) was observed in pp1ab. On the other hand, the structural proteins such as, spike (S), nucleoprotein (N), membrane (M) and envelope (E) proteins also possess APRs in their primary structures which altogether constitute 30% of the total APRs identified. Our findings may provide new windows of opportunities to design specific peptide-based, anti-SARS-CoV-2 therapeutic molecules against COVID-19.

Entities: Chemical

Keywords: Aggregation prone regions; COVID-19; Protein aggregations; SARS-CoV-2

Year: 2021 PMID： 33613009 PMCID： PMC7882052 DOI： 10.1007/s42485-021-00057-y

Source DB: PubMed Journal: J Proteins Proteom ISSN： 0975-8151

Introduction

The coronavirus disease 19 (COVID-19) is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), first emerged in Wuhan, China now imposing a serious threat to human life all across the globe and responsible for disruption of social and economic integrity worldwide (Arabi et al. 2020; Cucinotta and Vanelli 2020; Nicola et al. 2020; Tandon 2020). So far, patients are mostly being managed by supportive treatment using lopinavir/ritonavir, ribavirin, beta-interferon, glucocorticoid and remdesivir (Antinori et al. 2020;Chan et al. 2020; Jean et al. 2020; Salvi and Patankar 2020;Srinivas et al. 2020;Sternberg et al. 2020]. In the meantime, some novel vaccine candidates and different pharmacological approaches are under investigation [as reviewed in (Scarabelet al. 2021)]. The BNT162b2- BioNTech/Pfizer and mRNA-1273-Moderna vaccines have completed their trials and been approved by FDA and EMA (Scarabelet al. 2021). In India, the use of Covishield (developed by University of Oxford, AstraZeneca and produced by SII, India) has been approved and the other indigenous vaccine candidate Covaxin (ICMR-NIV-Bharat Biotech) is in phase-III trials (ICMR report). Various other strategies are under investigation (Sanders et al. 2020) and almost 100 different vaccine candidates have been proposed (Zhang et al. 2020) and these strategies are being validated through clinical studies and trials (Cao et al. 2020; Hung et al. 2020; Wang et al. 2020; Chen et al. 2021). However, the post-efficacy strategies for the successful vaccine candidates are the prime requirement for the mass use of these vaccines (Kim et al. 2021). Therefore, in addition to the available options, it becomes imperative to search novel therapeutic targets to curtail virus infection and multiplication. The replication cycle of SARS-CoV-2 in host cell is marked with highly synchronized processes of protein expression, protein folding, and assembly of viral genome along with structural proteins lead to formation of new virus particles (Sims et al. 2008; Fehr and Perlman 2015; Chen et al. 2020; Lukassen et al. 2020; Lunget al. 2020; Malik 2020). Maintenance of protein homeostasis in a eukaryotic cell is achieved by an integrated mechanism of protein biosynthesis, folding and attainment of native structure, and the degradation of misfolded proteins (Balchin et al. 2016; Chiti and Dobson 2017; Klaips et al. 2018; Zhong et al. 2019). After the entry into the host cell, viruses employ various strategies to hijack and regulate various biochemical and molecular activities, such as transcription and translation machineries, of the host cell to produce new viral proteins and enzymes essential for multiplication of the virus (Chen et al. 2020; Malik 2020; Salvi and Patankar 2020). Translation of viral genome represents a key event required for the establishment of infection and multiplication of SARS-CoV-2. We started our prediction using the primary structures of proteins emerging from all the known open reading frames (ORFs) of the SARS-CoV-2. The genome structure of SARS-CoV-2 contains at least six ORFs. The first ORF (known as ORF1a/b) constitutes approximately two‐thirds of the total genome length and encodes 16 nonstructural proteins (NSPS1‐16) (Gordonet al. 2020; Malik 2020). There is a − 1 frame shift between ORF1a and ORF1b, leading to production of two polypeptides: polypeptide 1a (pp1a) and polypeptide 1ab(pp1ab) having 7096 amino acid residues. These polypeptides are proteolytically cleaved to form 16 polypeptides segments that ultimately give rise nonstructural proteins (NSPS). Chymotrypsin‐like protease (3CLpro) which is virally encoded act at specific sites and help in the formation of NSPS. Other ORFs are situated at the 3′-terminus of ORF1 constitute just 1/3rd of the viral genome and encode four major structural proteins namely, spike (S), membrane (M), envelope (E), and nucleocapsid (N) proteins. The NSPS play specific roles during infection such as, degradation of host cell mRNA, inhibition of interferon (IFN) signaling, blocking the host innate immune response, promoting cytokine expression, etc. These biochemical functions of NSPS are crucial for establishment of viral infection and multiplication. The four structural proteins are vital for virion assembly and formation of new viral particles. The S protein forms a homotrimer and then form spikes on the viral surface that are responsible for initial attachment to the host receptors (Pillay 2020). The M protein has three trans-membrane domains and it shapes the virions, promotes membrane curvature, and binds to the nucleocapsid. The E protein plays a role in virus assembly and release, and it is involved in viral pathogenesis. The N protein contains two domains, which bind with virus RNA genome through an integrated action S, E and M proteins. It has often been observed that the protein aggregation frequently disrupts the protein homeostasis leading to development of various disease conditions. Protein aggregation is generally driven by specific amino acid sequences which are interspersed within the primary structure of proteins and polypeptides, known as aggregation-prone regions (APRs). The synthetic analogs of such APRs sequences contain the ability to self-assemble to form aggregates rich β-sheet structures. Further, these APRs are shown to interact with similar sequences present in parent proteins and peptides through homologous interaction and induce aggregation. Hence, these APRs have been successfully explored for the targeted disruption of protein homeostasis. Several recent studies have confirmed that the presence of synthetic analogs of these sequence-stretches (i.e., APRs) effectively block the folding of the original proteins and render them for degradation by the proteasomal degradation machinery of the host cell (Beerten et al. 2012; De Baets et al. 2014; Gallardo et al. 2016; Ganesan et al. 2016; Khodaparast et al. 2018).To explore the possibility of targeted protein aggregation to curtail SARS-CoV-2 infection, we screened the viral proteome to find out presence of APRs. Our initial studies suggest that the primary structures of many of the key proteins such as, polyprotein polymerase 1ab (pp1ab), envelope protein (E), nucleoprotein (N), membrane (M) protein, etc., are marked by the presence of small amino acid sequence-stretches possessing high aggregation propensity. On the other hand, it has been observed that the peptide (APR)-induced protein aggregation turns out to be a highly ordered and specific process. Since these APRs form essential elements of the native proteins, in unfolded state (immediately after translation), can interact with synthetic analogs of APRs and induce aggregation of entire proteins and finally subject the protein molecules for degradation rather than their folding into functional proteins.

Methods

Prediction of the potential aggregation prone regions (APRs) in the SARS-CoV-2 proteome

The complete genome of Wuhan-Hu-1 (NC_045512.2) was downloaded from NCBI nucleotide database. The aggregation propensity of all the SARS-CoV-2 proteins primary structure was assessed by using in silico predictions. These primary structures of the proteins were sequentially submitted to different computation algorithms namely FoldAmyloid (http://bioinfo.protres.ru/FoldAmyloid/) (Garbuzynskiy et al. 2010), TANGO (http://tango.crg.es/) (Fernandez-Escamilla et al. 2004), AGGRESCAN (http://bioinf.uab.es/aggrescan/) (Conchillo-Soleet al. 2007), and AMYLPRED (http://aias.biol.uoa.gr/AMYLPRED/input.php) (Frousios et al. 2009) with the default setting. The scores were compared with classical aggregating peptide i.e., Amyloid beta (Aβ) peptide.

Result and discussion

Prediction of APRs in different proteins emerging from different ORFs of SARS-CoV-2

Table 1 summarizes the locations of the predicted APRs in different structural and nonstructural proteins of SARS-CoV-2. The APRs are found to be asymmetrically distributed in the different regions of all the proteins investigated. As mentioned earlier, pp1a and pp1ab are the two large polypeptides that formed from direct translation of virus genome after its entry into host cells. Given the fact that 2/3rd proportion of the total virus genome is utilized for the synthesis of NSPS, they are very crucial for the continuation of virus replication cycle (Masters 2006; Chen et al. 2020). NSP-1 is the first non-structural protein formed from pp1ab, obstructs translation of host mRNA by interfering with the 40S ribosomal subunit (Raj 2021). The primary structure of NSP-1 contains 180 residues and it was found to be free from any APRs in it. Similarly, the region corresponding to NSP-13 spanning from 5325 to 5925 residues does not contain any aggregation prone regions in it. In the polypeptide segments corresponding to NSP-2 to NSP-12 and NSP-14 to NSP-16 contain several APRs. NSP-2, 637 residues in its primary structure, is the second nonstructural protein and found to have 6 potential APRs ranging from 6 to 13 residues in length. The maximum numbers of APRs are found in the segment spanning from 3570 to 3859 residues, which corresponds to NSP-6. The total APRs in this region constitute > 35% residues of the total protein. Along with NSP-3 and NSP-4, NSP-6 plays vital role in creation of cytoplasmic double-membrane vesicles essential for viral replication. On the other hand, NSP-6 also plays important role in preventing delivery of the viral components to lysosomes of the host cell and hence protects the virus from lysosomal inactivation (Gordon et al. 2020). Hence, truncating NSP-6 would be helpful in enhancing host mediated destruction of the virus. Similarly, the polypeptide regions corresponding to NSP-3 and NSP-4 consist of large number of APRs (Fig. 1).

Table 1

Location of newly identified aggregation prone regions in different proteins of SARS-CoV-2

	Amino acid sequences	Positions	Residues	Amino acid Sequence of APRs	Length of APRs
Polyprotein polymerase 1ab
Nsp1	MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQENWNTKHSSGVTRELMRELNGG	1–180	180	Nil
Nsp2	AVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQLDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFPLNSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTCDFLKATCEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSNIETRLRKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDLLEILSRERVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGNYKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAAVTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKLRPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVVNKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVTFLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQYCALSPGLLATNNVFRLKGG	181–818	637	₄₀₉CVFAYV₄₁₅ ₄₇₃VAIILASF₄₈₀ ₅₆₅AAVTIL₅₇₀ ₅₉₅VIIMAYVTG₆₀₃ ₆₄₅AWEILKFLITGVF₆₅₇ ₆₇₅VKCFIDVV₆₈₂	6 8 6 9 13 8
Nsp3	APTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIAEIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSSKTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYYKKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRIKASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWLMWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG	819–2763	1945	₁₁₇₃VYLAVF₁₁₇₈ ₁₂₉₅VLTAVV₁₃₀₀ ₁₅₇₀VFTTV₁₅₇₄ ₁₆₇₆LATALLT₁₆₈₂ ₁₇₁₀FCALILAY₁₇₁₇ ₂₁₇₁YFFTLLL₂₁₇₇ ₂₂₂₉IIIWFLLLSVCLGSLI₂₂₄₄ ₂₃₂₄VAEWFLAYILFTRFFYV₂₃₄₀ ₂₃₆₃WLMWLIINLV₂₃₇₂ ₂₃₈₄YIFFASFYYVW₂₃₉₄ ₂₅₃₈INVIVF₂₅₄₃ ₂₇₀₉IALIWNV₂₇₁₅	6 6 5 7 8 7 16 17 10 11 6 7
Nsp4	KIVNNWLKQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFANKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLPRVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVAYESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSGRWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCLAYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTNDVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQ	2764–3263	500	₂₇₇₆VTLVFLFVAAIFYLI₂₇₉₀ ₂₈₅₃LIAAVIT₂₈₅₉ ₂₉₇₅VVTTF₂₉₇₉ ₃₀₅₂IVAIVVTCLAYYF₃₀₆₄ ₃₀₇₇VAFNTLLFLMSFTVLCL₃₀₉₄ ₃₁₀₄VYSVIYLYLTFYL₃₁₁₆ ₃₁₃₈FWITIAYIICI₃₁₄₈ ₃₁₅₃FYWFF₃₁₅₇ ₃₁₈₀LCTFLL₃₁₈₅	15 7 5 13 17 13 11 5 6
Nsp5	SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ	3264–3569	306	₃₄₆₃ITVNVLAWLYAAVI₃₄₇₆	14
Nsp6	SAVKRTIKGTHHWLLLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTARTVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLARGIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQ	3570–3859	290	₃₅₈₂WLLLTILTSLLVLV₃₅₉₅ ₃₆₁₆MGIIAMSAFAMMFV₃₆₂₉ ₃₆₃₅FLCLFL₃₆₄₀ ₃₆₄₄LATVAYFNMVY₃₆₅₄ ₃₆₈₃VMYASAVVLLILMT₃₆₉₆ ₃₇₀₉WTLMNVLTLVY₃₇₁₉ ₃₇₃₃MWALIISV₃₇₄₀ ₃₇₄₇VVTTVMFLA₃₇₅₅ ₃₇₅₈IVFMCV₃₇₆₃ ₃₇₇₉IMLVYCFLGYFCTCYF₃₇₉₄	14 14 6 11 14 11 8 9 6 16
Nsp7	SKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ	3860–3942	83	₃₈₇₀VVLLSVL₃₈₇₆ ₃₉₁₁MVSLLSVLL₃₉₁₉	7 9
Nsp8	AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQ	3943–4140	198	₁₂₈LMVVI₁₃₂ ₁₈₄LIVTAL₁₈₉	5 6
Nsp9	NNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ	4141–4253	113	₄₁₈₀FVLALL₄₁₈₅	6
Nsp10	AGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQ	4254–4392	139	₄₂₆₆VLSFCAFA₄₂₇₃	8
Nsp12	SADAQSFLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQ	4393–5324	932	₄₅₉₃IVGVL₄₅₉₇ ₄₇₆₃LLVYA₄₇₆₇ ₄₈₆₁LLFVV₄₈₆₅	5 5 5
Nsp13	AVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQ	5325–5925	601
Nsp14	AENVTGLFKDCSKVITGLHPTQAPTHLSVDTKFKTEGLCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQ	5926–6452	527	₆₁₀₆VVFVLW₆₁₁₁ ₆₃₀₆VCLFW₆₃₁₀ ₆₄₃₁FSLWVY₆₄₃₆	6 5 6
Nsp15	SLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFKPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHVETFYPKLQ	6453–6798	346	₆₄₅₇VAFNVV₆₄₆₂ ₆₅₇₁LTVFF₆₅₇₅ ₆₇₇₉ISFMLW₆₇₈₄	6 5 6
Nsp16	SSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSLLSKGRLIIRENNRVVISSDVLVNN	6799–7096	298	₆₉₄₇FFTYICGFI₆₉₅₅ ₆₉₈₅FAWWTAFV₆₉₉₂ ₇₀₆₉ILSLL₇₀₇₃	9 8 5
Spike Protein	MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFRSSVHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIRGWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVYSSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQGFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFLLKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITNLCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCFTNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYNYLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPYRVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFGRDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAIHADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPRRARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFGGFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFNGLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSEPVLKGVKLHYT	1–1273	1273	₂FVFLVL₇ ₁₄₀FLGVYY₁₄₅ ₅₁₀VVVLSF₅₁₅ ₁₀₆₀VVFL₁₀₆₃ ₁₁₂₈VVIGIV₁₁₃₃ ₁₂₁₅YIWLGFIAGLIAIVMVTI₁₂₃₂	6 6 6 4 6 18
E-protein	MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCCNIVNVSLVKPSFYVYSRVKNLNSSRVPDLLV	1–75	75	₁₇VLLFLAFVVFLLVTLAIL₃₄	18
M-protein	MADSNGTITVEELKKLLEQWNLVIGFLFLTWICLLQFAYANRNRFLYIIKLIFLWLLWPVTLACFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRLFARTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCDIKDLPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIALLVQ	1–222	222	₂₂LVIGFLFLTWICLLQFA₃₈ ₅₁LIFLWLL₅₇ ₆₀VTLACFVLAAVY₇₁ ₈₀IAIAMACLVGLMWLSYFI₉₇ ₁₃₈LVIGAVIL₁₄₅	17 7 12 18 8
N-protein	MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGLPNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGDAALALLLLDRLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGRRGPEQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYTGAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTVTLLPAADLDDFSKQLQQSMSSADSTQA	1–419	419	₁₀₈WYFYYL₁₁₃ ₁₂₉GIIWV₁₃₃ ₂₁₉LALLLL₂₂₄	6 5 6
Aβ (1–42) peptide	DAEFRHDSGYEVHHQKLVFFAEDVGSNKGAIIGLMVGGVVIA	1–42	42	₁₇LVFFA₂₁ ₃₀AIIGLMVGGVVI₄₁	5 12

Fig. 1

Identification of aggregation prone regions (APRs) in the major proteins of SARS-CoV-2. The aggregation score and propensity in the predicted APRs found to be equivalent to the Abeta peptide, which serves as a classical β-structured aggregates

Location of newly identified aggregation prone regions in different proteins of SARS-CoV-2 409CVFAYV415 473VAIILASF480 565AAVTIL570 595VIIMAYVTG603 645AWEILKFLITGVF657 675VKCFIDVV682 6 8 6 9 13 8 1173VYLAVF1178 1295VLTAVV1300 1570VFTTV1574 1676LATALLT1682 1710FCALILAY1717 2171YFFTLLL2177 2229IIIWFLLLSVCLGSLI2244 2324VAEWFLAYILFTRFFYV2340 2363WLMWLIINLV2372 2384YIFFASFYYVW2394 2538INVIVF2543 2709IALIWNV2715 6 6 5 7 8 7 16 17 10 11 6 7 2776VTLVFLFVAAIFYLI2790 2853LIAAVIT2859 2975VVTTF2979 3052IVAIVVTCLAYYF3064 3077VAFNTLLFLMSFTVLCL3094 3104VYSVIYLYLTFYL3116 3138FWITIAYIICI3148 3153FYWFF3157 3180LCTFLL3185 15 7 5 13 17 13 11 5 6 3582WLLLTILTSLLVLV3595 3616MGIIAMSAFAMMFV3629 3635FLCLFL3640 3644LATVAYFNMVY3654 3683VMYASAVVLLILMT3696 3709WTLMNVLTLVY3719 3733MWALIISV3740 3747VVTTVMFLA3755 3758IVFMCV3763 3779IMLVYCFLGYFCTCYF3794 14 14 6 11 14 11 8 9 6 16 3870VVLLSVL3876 3911MVSLLSVLL3919 7 9 128LMVVI132 184LIVTAL189 5 6 4593IVGVL4597 4763LLVYA4767 4861LLFVV4865 5 5 5 6106VVFVLW6111 6306VCLFW6310 6431FSLWVY6436 6 5 6 6457VAFNVV6462 6571LTVFF6575 6779ISFMLW6784 6 5 6 6947FFTYICGFI6955 6985FAWWTAFV6992 7069ILSLL7073 9 8 5 Spike Protein 2FVFLVL7 140FLGVYY145 510VVVLSF515 1060VVFL1063 1128VVIGIV1133 1215YIWLGFIAGLIAIVMVTI1232 6 6 6 4 6 18 22LVIGFLFLTWICLLQFA38 51LIFLWLL57 60VTLACFVLAAVY71 80IAIAMACLVGLMWLSYFI97 138LVIGAVIL145 17 7 12 18 8 108WYFYYL113 129GIIWV133 219LALLLL224 6 5 6 17LVFFA21 30AIIGLMVGGVVI41 5 12 Identification of aggregation prone regions (APRs) in the major proteins of SARS-CoV-2. The aggregation score and propensity in the predicted APRs found to be equivalent to the Abeta peptide, which serves as a classical β-structured aggregates Apart from pp1ab, the structural proteins also contain sequence stretches of high significant aggregation score. There were six potential APRs identified in S protein, however, the length of APRs use to be relatively shorter except the APR present at N-terminus of the protein. The E-protein is the smallest structural protein (75 residues) and known to play essential role in the virus morphogenesis (Liu et al. 2007), consists of a single potential APR. The M-protein constitutes an essential component of virus along with other structural proteins and plays a central role in virus morphogenesis and assembly via its interactions with other viral proteins (Neuman et al. 2011). It consists of five APRs ranging from 7 to 18 amino acid residues. The N-protein consists of relatively less number of shorter APRs compared to other structural proteins. Among all the four structural proteins the S-protein and M-proteins are comparatively richer in the APRs compared to N and E-proteins. The score of individual APRs range from 20 to 100. However, most of the APRs have aggregation score above 50, indicative of less chance to give false positive values. The lengths of most of the APRs identified in the SARS-CoV-2 proteome are in the range of 5–8 residues (Table 1). Most of the APRs in pp1ab possess are found to be relatively shorter in length compared to the one observed in structural proteins. It is observed that the shorter APR peptides (of ≈ 6 residues) found to be giving better prediction reliability compared to the larger one. On the other hand, it has also been established that the longer APRs possesses greater tendency to display false positives compared to the shorter ones. It has been observed that APRs of shorter length possess high aggregation propensity and interact more efficiently with the identical sequences in the large peptides or proteins compared to longer APRs. The legitimacy of the predicted APRs is based on the reliability of mathematical and statistical lucidities. The computational algorithm TANGO uses a statistical mechanics approach to make predictions of different secondary structures present in different regions for a given proteins (Pande 2004). The algorithm assumes a particular amino acid sequence (of at least five consecutive residues) is aggregation-prone if it has high propensity to form β-sheet structure and when this sequence form aggregate all the residues of the β-region are buried in the hydrophobic interior. It predicts the aggregation propensity in a sequence specific manner and presents the data in the form of beta-aggregation score and its value range from 1 to 100. It is reported that the TANGO score of 5 per residue gives a Matthews correlation coefficient between prediction and experiment of 0.92 (Fernandez-Escamillaet al. 2004). Further, it has been shown that the false-positive rate of TANGO is below 5% for a TANGO score of more than 15 (Bednarskaet al. 2016). Most of the classical amyloidogenic peptides possess the aggregation score above 50 and hence we gathered all the sequence stretches displaying the score above it. The overall score above 90 suggest high aggregation propensity with less probability of getting false positive. The data obtained from Tango were further analyzed by using other analogous algorithms such as Aggrescan, AmylPred and FoldAmyoid. In all the predictions we used amyloid beta (Aβ1-42) peptide as a reference due to its ability to form classical aggregates rich in β-structures. The AGGRESCAN program predicts the aggregation prone regions in a protein as “hot spot” sequences of 5 to 11 residues that can nucleate aggregation in peptides and proteins. The aggregation propensity of the hot spots is determined largely by amino acid composition, which is based on the experimentally determined aggregation propensity scale for individual amino acids. The FoldAmyloid program predicts short amino acid sequences (≥ 5 residues) based on the contacts, packing density, backbone H-bonds of acceptors or donors for prediction of aggregation prone regions. AmylPred combines the data from SecStr, a secondary structure prediction tool, to predict the amino acid sequence in protein that can act as potential conformational switch. As shown in Fig. 2, the APRs identified in different viral proteins by all the four algorithms are found to be unanimous.

Fig. 2

Consensus among different methods for the prediction of aggregation prone regions in the different proteins of SARS-CoV-2

Mechanistic outlook of APRs-induced disruption SARS-CoV-2 protein homeostasis

For the first time the mechanism of APR-induced disruption of protein homeostasis action was proposed by Balch et al. (2008). They showed that the disruption of bacterial protein homeostasis can be induced by small aggregating peptides resulting into formation of toxic protein aggregates in the bacterial cell. Generally, the ordered protein aggregation is facilitated through the formation of intermolecular β-structures by short polypeptide sequence with high aggregation propensity. Presence of such sequences define the basis of amyloid formation in various disease conditions, particularly the most debilitating Alzheimer’s and Parkinson’s diseases. Similar sequences are commonly present in various globular proteins that constitute their hydrophobic core and confer structural stability. They also assist oligomeric proteins by forming protein–protein interfaces. Despite the fact that these sequences participate in providing stability to the native proteins, they can self-assemble with identical sequences to form β-structured aggregates in unfolded state. While forming the β-structured aggregates, it is often observed that their interactions with identical sequences in denatured proteins use to be more efficient than the partially similar sequences. Therefore, we hypothesize that the amino acid sequence stretches with high aggregation propensity derived from SARS-CoV-2 proteins could be able to induce specific protein aggregation leading to virucidal activity against the virus. Although, these APRs self-assemble to form β-structured aggregates and initiate seeding of identical peptides or the denatured proteins who’s APRs are exposed. Therefore, to target a specific protein using APRs, it is a prerequisite that the protein must remain in unfolded state so that the interaction between homologous APRs becomes feasible. Following the viral entry, the direct translation of the pp1ab is one of the most essential steps for initiation of virus replication cycle. The APRs present in the polypeptide remain transiently exposed during translation. It is that time point when the synthetic analogs of APRs can be effectively be used to target specific proteins by interfering the protein folding reactions of polypeptide chains into functional proteins. As shown in Fig. 3, the repeated interruption of protein folding, aggregation and degradation will lead to deprivation of key proteins leading to suppression of the viral replication and multiplication. On the other hand the proteome of SARS-CoV-2 is highly specific and hence these APRs are not likely to interfere with the protein homeostasis of the host cells.

Fig. 3

Schematic representation of APR peptide-based inhibition of viral replication. The events in the region one represents usual cycle of infection, release of viral + ssRNA AND its direct translation to form pp1ab which subsequently forms all the nonstructural proteins (NSPS). The NSPS, are used in amplification of viral genomic + ssRNA, formation of structural and other accessory proteins. At the end genomic + ssRNA assemble with structural proteins to form new viral particles. The events depicted in the region 2 (left side) depict the events leading to APR peptide-based targeting of proteins formed from ORF1a/ORF1ab (pp1ab). Addition of APR peptides will interfere the protein folding reaction of viral proteins and subject them for proteasomal degradation in the host cell. Depletion of essential viral proteins will lead to complete halt of the viral replication and formation of new viral particles

Conclusion

The maintenance of viral protein homeostasis remains as one of the most crucial steps for continuation of viral life cycle. The presence of APRs in the SARS-CoV-2 proteins constitutes susceptible proteomic segments that might act as hot spots for the commencement of the viral protein homeostasis failure. Taking the advantage of distinctive viral protein expression, folding and assembly of viral proteins, we propose a hypothesis that the disruption of protein homeostasis during viral replication will be able to prevent formation of new viral particles. Maintenance of integrated protein homeostasis is essential and remains at the highest risk during translation. Targeted aggregation of viral proteins, specifically during translation, would be able deplete the functional proteins and imposes an explicit inhibitory effect on viral replication and multiplication. The primary structures of SARS-CoV-2 proteins are marked by the presence of small unique sequences that would play vital role in inhibiting the formation of functional proteins and hence prevent the viral replication and multiplication. A recent study has shown that viral translation, splicing and nucleic acid metabolism constitute viable therapeutic targets for COVID-19 (Bojkova et al. 2020). Hence, the development of an exclusive and multi-target strategy to disrupt the protein homeostasis will represent an attractive and potential anti-SARS-CoV-2 strategy. At present, we are actively engaged in synthesizing all peptides analogous to the identified APRs and characterizing their biophysical characteristics and we hope that the APR-induced proteostatic disruptions will provide an innovative approach to fight with COVID-19.

46 in total

Review 1. The molecular biology of coronaviruses.

Authors: Paul S Masters
Journal: Adv Virus Res Date: 2006 Impact factor: 9.937

Review 2. COVID-19: An Update on the Epidemiological, Clinical, Preventive and Therapeutic Evidence and Guidelines of Integrative Chinese-Western Medicine for the Management of 2019 Novel Coronavirus Disease.

Authors: Kam Wa Chan; Vivian Taam Wong; Sydney Chi Wai Tang
Journal: Am J Chin Med Date: 2020-03-13 Impact factor: 4.667

3. Novel Drugs Targeting the SARS-CoV-2/COVID-19 Machinery.

Authors: Ariane Sternberg; Dwight L McKee; Cord Naujokat
Journal: Curr Top Med Chem Date: 2020-05-16 Impact factor: 3.295

Review 4. Modulating protein-protein interaction networks in protein homeostasis.

Authors: Mengqi Zhong; Gregory M Lee; Eline Sijbesma; Christian Ottmann; Michelle R Arkin
Journal: Curr Opin Chem Biol Date: 2019-03-23 Impact factor: 8.822

5. De novo design of a biologically active amyloid.

Authors: Rodrigo Gallardo; Meine Ramakers; Frederik De Smet; Filip Claes; Ladan Khodaparast; Laleh Khodaparast; José R Couceiro; Tobias Langenberg; Maxime Siemons; Sofie Nyström; Laurence J Young; Romain F Laine; Lydia Young; Enrico Radaelli; Iryna Benilova; Manoj Kumar; An Staes; Matyas Desager; Manu Beerens; Petra Vandervoort; Aernout Luttun; Kris Gevaert; Guy Bormans; Mieke Dewerchin; Johan Van Eldere; Peter Carmeliet; Greetje Vande Velde; Catherine Verfaillie; Clemens F Kaminski; Bart De Strooper; Per Hammarström; K Peter R Nilsson; Louise Serpell; Joost Schymkowitz; Frederic Rousseau
Journal: Science Date: 2016-11-11 Impact factor: 47.728

Review 6. Coronaviruses: an overview of their replication and pathogenesis.

Authors: Anthony R Fehr; Stanley Perlman
Journal: Methods Mol Biol Date: 2015

7. Structural hot spots for the solubility of globular proteins.

Authors: Ashok Ganesan; Aleksandra Siekierska; Jacinte Beerten; Marijke Brams; Joost Van Durme; Greet De Baets; Rob Van der Kant; Rodrigo Gallardo; Meine Ramakers; Tobias Langenberg; Hannah Wilkinson; Frederik De Smet; Chris Ulens; Frederic Rousseau; Joost Schymkowitz
Journal: Nat Commun Date: 2016-02-24 Impact factor: 14.919

8. Triple combination of interferon beta-1b, lopinavir-ritonavir, and ribavirin in the treatment of patients admitted to hospital with COVID-19: an open-label, randomised, phase 2 trial.

Authors: Ivan Fan-Ngai Hung; Kwok-Cheung Lung; Eugene Yuk-Keung Tso; Raymond Liu; Tom Wai-Hin Chung; Man-Yee Chu; Yuk-Yung Ng; Jenny Lo; Jacky Chan; Anthony Raymond Tam; Hoi-Ping Shum; Veronica Chan; Alan Ka-Lun Wu; Kit-Man Sin; Wai-Shing Leung; Wai-Lam Law; David Christopher Lung; Simon Sin; Pauline Yeung; Cyril Chik-Yan Yip; Ricky Ruiqi Zhang; Agnes Yim-Fong Fung; Erica Yuen-Wing Yan; Kit-Hang Leung; Jonathan Daniel Ip; Allen Wing-Ho Chu; Wan-Mui Chan; Anthony Chin-Ki Ng; Rodney Lee; Kitty Fung; Alwin Yeung; Tak-Chiu Wu; Johnny Wai-Man Chan; Wing-Wah Yan; Wai-Ming Chan; Jasper Fuk-Woo Chan; Albert Kwok-Wai Lie; Owen Tak-Yin Tsang; Vincent Chi-Chung Cheng; Tak-Lun Que; Chak-Sing Lau; Kwok-Hung Chan; Kelvin Kai-Wang To; Kwok-Yung Yuen
Journal: Lancet Date: 2020-05-10 Impact factor: 79.321

Review 9. Emerging coronaviruses: Genome structure, replication, and pathogenesis.

Authors: Yu Chen; Qianyun Liu; Deyin Guo
Journal: J Med Virol Date: 2020-02-07 Impact factor: 2.327

10. WHO Declares COVID-19 a Pandemic.

Authors: Domenico Cucinotta; Maurizio Vanelli
Journal: Acta Biomed Date: 2020-03-19