Literature DB >> 34117310

The major genetic risk factor for severe COVID-19 does not show any association among South Asian populations.

Prajjval Pratap Singh1, Anshika Srivastava1, Gazi Nurun Nahar Sultana2, Nargis Khanam1, Abhishek Pathak3, Prashanth Suravajhala4, Royana Singh5, Pankaj Shrivastava6, George van Driem7, Kumarasamy Thangaraj8,9, Gyaneshwer Chaubey10.   

Abstract

With the growing evidence on the variable human susceptibility against COVID-19, it is evident that some genetic loci modulate the severity of the infection. Recent studies have identified several loci associated with greater severity. More recently, a study has identified a 50 kb genomic segment introgressed from Neanderthal adding a risk for COVID-19, and this genomic segment is present among 16% and 50% people of European and South Asian descent, respectively. Our studies on ACE2 identified a haplotype present among 20% and 60% of European and South Asian populations, respectively, which appears to be responsible for the low case fatality rate among South Asian populations. This result was also consistent with the real-time infection rate and case fatality rate among various states of India. We readdressed this issue using both of the contrasting datasets and compared them with the real-time infection rates and case fatality rate in India. We found that the polymorphism present in the 50 kb introgressed genomic segment (rs10490770) did not show any significant correlation with the infection and case fatality rate in India.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34117310      PMCID: PMC8196069          DOI: 10.1038/s41598-021-91711-4

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Since the beginning of COVID-19 pandemic, it has been observed that people with a different ethnic background and country or continent of origin have variable degrees of susceptibility[1,2]. Though there are a few well known factors for higher susceptibility, e.g. age and comorbidity[3,4], the hospitalisation of younger healthy people has also been reported[5]. Recent genome wide association study has identified a gene cluster at chromosome 3 as well as the ABO gene at chromosome 9 associated with the severe risk factor for COVID-19 among Europeans[6]. Subsequently, the COVID-19 Host Genetics Initiative has corroborated this result[7]. The worldwide meta-analysis of the COVID-19 Host Genetics Initiative has identified 13 genetic loci associated with higher susceptibility or higher severity[8]. Zeberg and Pääbo[9] have identified a risk haplotype of 50 kb size introgressed from Neanderthals, which they called the ‘Neanderthal core haplotype’. This risk haplotype was found to be present with an allele frequency of 30% among South Asians, 8% in Europeans and 4% among admixed Americans. The peak carrier frequency was estimated among the Bangladeshi population, where 63% carried at least one copy of this haplotype. The study also cited twice the risk of mortality among people of Bangladeshi extraction living in the UK as opposed to the native population of Brittanic pedigree[10]. Conversely, three of our studies on ACE2, the gateway of SARS-CoV-2, identified a haplotype, shared among South Asians and East Eurasians, likely protecting them from severe risk[11-13]. Additionally, the spatial distribution of this haplotype showed strong association with the low infection as well as low case fatality rate (CFR)[13]. To resolve this discrepancy between the two sets of findings and the associated claims, we have extracted a SNP (rs10490770) reported to be associated with the high risk for COVID-19[9], from our published and unpublished genome wide datasets (Supplementary Table S1), and looked for existing association with the state-wise COVID-19 data of India.

Materials and methods

Zeberg and Pääbo[9] have mainly discussed about the SNP rs35044562. However, they reported 12 other SNPs present in the ‘Neanderthal core haplotype’ that are in high linkage disequilibrium (r2 > 0.98) (Supplementary Table S2). SNP rs10490770 showed high LD (r2 = 0.99) with the SNP rs35044562. The genome-wide genotype data by Illumina tagged rs2285666 and rs10490770 SNPs in their panel. Therefore, we searched the genotype datasets generated by this platform. The frequency data for both of the SNPs from various Indian populations were extracted using Plink 1.9[14], from 1000 genome project data phase 3[15], data published by the Estonian Biocentre[16-19] and our newly genotyped samples for various Indian states and Bangladesh (Supplementary Table S1). In addition to our previous study[13], more samples were added for the SNP rs2285666. The state-wise COVID-19 infection and CFR datasets were extracted from https://www.covid19india.org/. The regression estimations and plots were built by https://www.graphpad.com/quickcalcs/linear1/ and reverified by the Microsoft Excel regression calculations. We have also used Pearson’s correlation coefficient test[20] to evaluate the effect of both the SNPs. The spatial distribution of both SNPs were drawn using the web tool available at https://www.datawrapper.de/.

Results and discussion

In contrast to the conclusions drawn by Zeberg and Pääbo[9], our study on ACE2 identified a haplotype that is frequent among South Asians and East Eurasians[11-13]. This haplotype is derived by a polymorphism rs2285666 responsible for elevated expression of ACE2. We have found high inverse correlation of this haplotype with the state-wise cases as well as the case-fatality rate (CFR) among Indian populations[13]. This correlation was significant at various timelines of the pandemic in India (Table 1). We verified the statistical tests with the updated data up to December 2020 and found these data to be consistent with previous observations (Fig. 1 and Supplementary Fig. S1). Thus, it is likely that the ACE2 SNP rs2285666 has played a significant role in modulating the susceptibility to the disease among Indian populations.
Table 1

Estimates of Pearson correlation coefficient for the rs2285666 and rs10490770 with the real-time COVID-19 cases as well as case fatality rate among Indian populations. The significant values are shown in bold letters.

rs2285666 (T)rs10490770 (C)FreqMay2020FreqAug2020FreqDec2020CFRAug2020CFRAug2020
rs2285666

Pearson correlation

Sig. (2-tailed)

1

− 0.428

0.098

− 0.588*

0.017

− 0.558*

0.025

− 0.580*

0.018

− 0.649**

0.007

− 0.568

0.022

rs10490770

Pearson correlation

Sig. (2-tailed)

− 0.428

0.098

1

0.204

0.450

0.164

0.543

0.266

0.319

0.244

0.362

− 0.026

0.924

*Correlation is significant at the 0.05 level (2-tailed).

**Correlation is significant at the 0.01 level (2-tailed).

Figure 1

The regression analyses of rs2285666 (ACE2) and rs10490770 (LZTFL1) with the COVID-19 cases as well as case fatality rate in various states of India (Supplementary Table S1).

Estimates of Pearson correlation coefficient for the rs2285666 and rs10490770 with the real-time COVID-19 cases as well as case fatality rate among Indian populations. The significant values are shown in bold letters. Pearson correlation Sig. (2-tailed) − 0.428 0.098 − 0.588* 0.017 − 0.558* 0.025 − 0.580* 0.018 − 0.649** 0.007 − 0.568 0.022 Pearson correlation Sig. (2-tailed) − 0.428 0.098 0.204 0.450 0.164 0.543 0.266 0.319 0.244 0.362 − 0.026 0.924 *Correlation is significant at the 0.05 level (2-tailed). **Correlation is significant at the 0.01 level (2-tailed). The regression analyses of rs2285666 (ACE2) and rs10490770 (LZTFL1) with the COVID-19 cases as well as case fatality rate in various states of India (Supplementary Table S1). In our search of the SNPs reported to be associated with high risk by Zeberg and Pääbo[9], we found rs10490770 from genome-wide datasets[17,18,21-23]. We applied the same tests done for the ACE2 SNPs (Fig. 1). The state-wise frequency variation of this SNP did not show any association either with the number of cases or the CFR (Table 1 and Supplementary Fig. S1). We repeated these regression tests for the number of cases as well as the CFR data, obtained during all the three months. However, none of them showed any association with the rs10490770 (p > 0.3) (Table 1). It is interesting to note that this SNP (rs10490770) has been found to be associated with disease severity in the global data[8]. However, the lack of association for rs10490770 with COVID-19 cases or CFR in India is striking and suggests instead a complex susceptibility response among Indian populations. Along with the complex genetic structure[11,24], socio-economic status[25] and hygiene[26] may have contributed to such a complex scenario. Furthermore, a detailed clinical and genome-wide association study on Indian COVID-19 patients would be useful to resolve this complexity. Zeberg and Pääbo[9] used the data of higher susceptibility to the disease among the Bangladeshi population living in UK[10] to support their findings. By considering the effect of sex, age, socio-economic deprivation and region, this report found that people of Bangladeshi origin had double the risk of mortality as compared to people of British origin. However, the higher mortality rate for Bangladeshi population in the UK needs more detailed investigation on comorbidity, genetic admixture as well as local environment and socio-economic circumstances in their particular British context. More importantly, a similar trend had also been observed among admixed Americans, where some of the same qualifications may apply mutatis mutandis[27-29]. Furthermore, it is notable that among the Bangladeshi samples analysed by us, the tribal populations of Bangladesh showed almost three times less frequency of rs10490770 (Supplementary Table S1). This is likely due to the different population histories of the caste and tribal populations of Bangladesh[30,31]. The Tibeto-Burman speakers of Bangladesh show a closer genetic affinity with the East and Southeast Asian populations, whereas the Indo-European speaking caste populations incline with the Indian populations. Therefore, it is advised explicitly to differentiate between the caste and tribal populations while making any statement about Bangladeshi populations. Significantly, our data also show that the incidence of the allele rs2285666 has been found to occur in the highest frequency of 100% in Indian populations such as the Nishi and Kokborok (Tripuri), who represent Trans-Himalayan language communities (Supplementary Fig. S1 and Supplementary Table S1). As a linguistic phylum, the Trans-Himalayan language family is widespread in parts of eastern Eurasia and includes languages such as Tibetan, Burmese, Mandarin, Cantonese and Hokkien. Thus, our extensive analyses on real-time data did not show any association of rs10490770 with the state-wise infection rates as well as CFRs, suggesting that the risk allele for COVID-19 in Europe does not play a significant role in COVID-19 severity in South Asia. Supplementary Information.
  20 in total

1.  Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia.

Authors:  Mait Metspalu; Irene Gallego Romero; Bayazit Yunusbayev; Gyaneshwer Chaubey; Chandana Basu Mallick; Georgi Hudjashov; Mari Nelis; Reedik Mägi; Ene Metspalu; Maido Remm; Ramasamy Pitchappan; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; Toomas Kivisild
Journal:  Am J Hum Genet       Date:  2011-12-09       Impact factor: 11.025

2.  The Genetic Ancestry of Modern Indus Valley Populations from Northwest India.

Authors:  Ajai K Pathak; Anurag Kadian; Alena Kushniarevich; Francesco Montinaro; Mayukh Mondal; Linda Ongaro; Manvendra Singh; Pramod Kumar; Niraj Rai; Jüri Parik; Ene Metspalu; Siiri Rootsi; Luca Pagani; Toomas Kivisild; Mait Metspalu; Gyaneshwer Chaubey; Richard Villems
Journal:  Am J Hum Genet       Date:  2018-12-06       Impact factor: 11.025

3.  Surge of respiratory illnesses in children due to fires in Brazil's Amazon region.

Authors:  Lise Alves
Journal:  Lancet Respir Med       Date:  2019-10-31       Impact factor: 30.700

4.  COVID-19 and African Americans.

Authors:  Clyde W Yancy
Journal:  JAMA       Date:  2020-05-19       Impact factor: 56.272

5.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

6.  The major genetic risk factor for severe COVID-19 is inherited from Neanderthals.

Authors:  Hugo Zeberg; Svante Pääbo
Journal:  Nature       Date:  2020-09-30       Impact factor: 49.962

7.  COVID-19 and Racial/Ethnic Disparities.

Authors:  Monica Webb Hooper; Anna María Nápoles; Eliseo J Pérez-Stable
Journal:  JAMA       Date:  2020-06-23       Impact factor: 157.335

8.  Racial Disparity of Coronavirus Disease 2019 in African American Communities.

Authors:  Ravina Kullar; Jasmine R Marcelin; Talia H Swartz; Damani A Piggott; Raul Macias Gil; Trini A Mathew; Tina Tan
Journal:  J Infect Dis       Date:  2020-08-17       Impact factor: 5.226

9.  Racial and Ethnic Disparities in COVID-19-Related Infections, Hospitalizations, and Deaths : A Systematic Review.

Authors:  Katherine Mackey; Chelsea K Ayers; Karli K Kondo; Somnath Saha; Shailesh M Advani; Sarah Young; Hunter Spencer; Max Rusek; Johanna Anderson; Stephanie Veazie; Mia Smith; Devan Kansagara
Journal:  Ann Intern Med       Date:  2020-12-01       Impact factor: 25.391

10.  Genetic structure of Tibeto-Burman populations of Bangladesh: evaluating the gene flow along the sides of Bay-of-Bengal.

Authors:  Nurun Nahar Gazi; Rakesh Tamang; Vipin Kumar Singh; Ahmed Ferdous; Ajai Kumar Pathak; Mugdha Singh; Sharath Anugula; Pandichelvam Veeraiah; Subburaj Kadarkaraisamy; Brijesh Kumar Yadav; Alla G Reddy; Deepa Selvi Rani; Syed Saleheen Qadri; Lalji Singh; Gyaneshwer Chaubey; Kumarasamy Thangaraj
Journal:  PLoS One       Date:  2013-10-09       Impact factor: 3.240

View more
  4 in total

1.  Will GWAS eventually allow the identification of genomic biomarkers for COVID-19 severity and mortality?

Authors:  Vito Luigi Colona; Michela Biancolella; Antonio Novelli; Giuseppe Novelli
Journal:  J Clin Invest       Date:  2021-12-01       Impact factor: 14.808

Review 2.  The COVID-19 pandemic, an environmental neurology perspective.

Authors:  J Reis; A Buguet; G C Román; P S Spencer
Journal:  Rev Neurol (Paris)       Date:  2022-03-22       Impact factor: 4.313

3.  Beyond GWAS-Could Genetic Differentiation within the Allograft Rejection Pathway Shape Natural Immunity to COVID-19?

Authors:  Joanna Szyda; Paula Dobosz; Joanna Stojak; Mateusz Sypniewski; Tomasz Suchocki; Krzysztof Kotlarz; Magdalena Mroczek; Maria Stępień; Dawid Słomian; Sławomir Butkiewicz; Paweł Sztromwasser; Jakub Liu; Zbigniew J Król
Journal:  Int J Mol Sci       Date:  2022-06-03       Impact factor: 6.208

4.  Coronavirus Host Genomics Study: South Africa (COVIGen-SA).

Authors:  Andrew K May; Heather Seymour; Harriet Etheredge; Heather Maher; Marta C Nunes; Shabir A Madhi; Simiso M Sokhela; W D Francois Venter; Neil Martinson; Firdaus Nabeemeeah; Cheryl Cohen; Jocelyn Moyes; Sibongile Walaza; Stefano Tempia; Jackie Kleynhans; Anne von Gottberg; Jeremy Nel; Halima Dawood; Ebrahim Variava; Stephen Tollman; Kathleen Kahn; Kobus Herbst; Emily B Wong; Caroline T Tiemessen; Alex van Blydenstein; Lyle Murray; Michelle Venter; June Fabian; Michéle Ramsay
Journal:  Glob Health Epidemiol Genom       Date:  2022-10-06
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.