Hossein Afshar1, Fatemeh Adelirad2, Ali Kowsari3, Naser Kalhor3, Ahmad Delbari1, Reza Najafipour4, Mahshid Foroughan1, Ali Bozorgmehr5, Safoura Khamse1, Neda Nazaripanah2, Mina Ohadi6. 1. Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran. 2. Department of Health Education and Promotion, Faculty of Health Sciences Tabriz University of Medical Sciences, Tabriz, Iran. 3. Department of Mesenchymal Stem Cell, The Academic Center for Education, Culture and Research, Qom, Iran. 4. Cellular and Molecular Research Centre, Research Institute for Prevention of Non Communicable Disease, Qazvin University of Medical Sciences, Qazvin, Iran. 5. Department of Neuroscience, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran, Iran. 6. Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran, ohadi.mina@yahoo.com.
Abstract
BACKGROUND: Approximately 2% of the human core promoter short tandem repeats (STRs) reach lengths of ≥6 repeats, which may in part be a result of adaptive evolutionary processes and natural selection. A single-exon transcript of the human nescient helix loop helix 2 (NHLH2) gene is flanked by the longest CA-repeat detected in a human protein-coding gene core promoter (Ensembl transcript ID: ENST00000369506.1). NHLH2 is involved in several biological and pathological pathways, such as motivated exercise, obesity, and diabetes. METHODS: The allele and genotype distribution of the NHLH2 CA-repeat were investigated by sequencing in 655 Iranian subjects, consisting of late-onset neurocognitive disorder (NCD) as a clinical entity (n = 290) and matched controls (n = 365). The evolutionary trend of the CA-repeat was also studied across vertebrates. RESULTS: The allele range was between 9 and 25 repeats in the NCD cases, and 12 and 24 repeats in the controls. At the frequency of 0.56, the 21-repeat allele was the predominant allele in the controls. While the 21-repeat was also the predominant allele in the NCD patients, we detected significant decline of the frequency (p < 0.0001) and homozygosity (p < 0.006) of this allele in this group. Furthermore, 12 genotypes were detected across 16 patients (5.5% of the entire NCD sample) and not in the controls (disease-only genotypes; p < 0.0003), consisting of at least one extreme allele. The extreme alleles were at 9, 12, 13, 18, and 19 repeats (extreme short end), and 23, 24, and 25 repeats (extreme long end), and their frequencies ranged between 0.001 and 0.04. The frequency of the 21-repeat allele significantly dropped to 0.09 in the disease-only genotype compartment (p < 0.0001). Evolutionarily, while the maximum length of the NHLH2 CA-repeat was 11 repeats in non-primates, this CA-repeat was ≥14 repeats in primates and reached maximum length in human. CONCLUSION: We propose a novel locus for late-onset NCD at the NHLH2 core promoter exceptionally long CA-STR and natural selection at this locus. Furthermore, there was indication of genotypes at this locus that unambiguously linked to late-onset NCD. This is the first instance of natural selection in favor of a predominantly abundant STR allele in human and its differential distribution in late-onset NCD.
BACKGROUND: Approximately 2% of the human core promoter short tandem repeats (STRs) reach lengths of ≥6 repeats, which may in part be a result of adaptive evolutionary processes and natural selection. A single-exon transcript of the humannescient helix loop helix 2 (NHLH2) gene is flanked by the longest CA-repeat detected in a human protein-coding gene core promoter (Ensembl transcript ID: ENST00000369506.1). NHLH2 is involved in several biological and pathological pathways, such as motivated exercise, obesity, and diabetes. METHODS: The allele and genotype distribution of the NHLH2 CA-repeat were investigated by sequencing in 655 Iranian subjects, consisting of late-onset neurocognitive disorder (NCD) as a clinical entity (n = 290) and matched controls (n = 365). The evolutionary trend of the CA-repeat was also studied across vertebrates. RESULTS: The allele range was between 9 and 25 repeats in the NCD cases, and 12 and 24 repeats in the controls. At the frequency of 0.56, the 21-repeat allele was the predominant allele in the controls. While the 21-repeat was also the predominant allele in the NCD patients, we detected significant decline of the frequency (p < 0.0001) and homozygosity (p < 0.006) of this allele in this group. Furthermore, 12 genotypes were detected across 16 patients (5.5% of the entire NCD sample) and not in the controls (disease-only genotypes; p < 0.0003), consisting of at least one extreme allele. The extreme alleles were at 9, 12, 13, 18, and 19 repeats (extreme short end), and 23, 24, and 25 repeats (extreme long end), and their frequencies ranged between 0.001 and 0.04. The frequency of the 21-repeat allele significantly dropped to 0.09 in the disease-only genotype compartment (p < 0.0001). Evolutionarily, while the maximum length of the NHLH2 CA-repeat was 11 repeats in non-primates, this CA-repeat was ≥14 repeats in primates and reached maximum length in human. CONCLUSION: We propose a novel locus for late-onset NCD at the NHLH2 core promoter exceptionally long CA-STR and natural selection at this locus. Furthermore, there was indication of genotypes at this locus that unambiguously linked to late-onset NCD. This is the first instance of natural selection in favor of a predominantly abundant STR allele in human and its differential distribution in late-onset NCD.
Authors: H Afshar; S Khamse; F Alizadeh; A Delbari; R Najafipour; A Bozorgmehr; M Khazaei; F Adelirad; A Alizadeh; A Kowsari; M Ohadi Journal: Sci Rep Date: 2020-11-10 Impact factor: 4.379