Literature DB >> 33091040

Comprehensive analysis of mitochondrial and nuclear DNA variations in patients affected by hemoglobinopathies: A pilot study.

Ylenia Barbanera1, Francesco Arcioni2, Hovirag Lancioni3, Roberta La Starza1, Irene Cardinali3, Caterina Matteucci1, Valeria Nofrini1, Antonella Roetto4, Antonio Piga4, Paola Grammatico5, Maurizio Caniglia2, Cristina Mecucci1, Paolo Gorello1.   

Abstract

The hemoglobin disorders are the most common single gene disorders in the world. Previous studies have suggested that they are deeply geographically structured and a variety of genetic determinants influences different clinical phenotypes between patients inheriting identical β-globin gene mutations. In order to get new insights into the heterogeneity of hemoglobin disorders, we investigated the molecular variations on nuclear genes (i.e. HBB, HBG2, BCL11A, HBS1L and MYB) and mitochondrial DNA control region. This pilot study was carried out on 53 patients belonging to different continents and molecularly classified in 4 subgroup: β-thalassemia (β+/β+, β0/β0 and β+/β0)(15), sickle cell disease (HbS/HbS)(20), sickle cell/β-thalassemia (HbS/β+ or HBS/β0)(10), and non-thalassemic compound heterozygous (HbS/HbC, HbO-Arab/HbC)(8). This comprehensive phylogenetic analysis provided a clear separation between African and European patients either in nuclear or mitochondrial variations. Notably, informing on the phylogeographic structure of affected individuals, this accurate genetic stratification, could help to optimize the diagnostic algorithm for patients with uncertain or unknown origin.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 33091040      PMCID: PMC7581000          DOI: 10.1371/journal.pone.0240632

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Worldwide, the most frequent genetic diseases are represented by a very heterogeneous subgroup of congenital hemolytic anaemias, which includes β-thalassemia and sickle cell anaemia [1]. Individuals with β-thalassemia have a diverse range of clinical manifestations, from an asymptomatic state to a transfusion-dependent form, depending mainly on the β-globin gene mutations [2,3]. However, individuals with identical β-thalassemia genotypes can exhibit variable clinical severities [4], because of the high complexity of the genetic background associated with the disease [5,6]. The term sickle cell disease (SCD) encompasses a group of disorders characterized by the presence of at least one hemoglobin S allele (HbS; p.Glu6Val in HBB) and a second HBB allele carrying pathogenic variant resulting in abnormal hemoglobin polymerization. SCD (HbS/HbS) caused by the homozygous HBB variant p.Glu6Val is the most common cause of SCD in the US. SCD caused by compound heterozygous HBB pathogenic variants includes sickle-hemoglobin C disease (HbS/HbC) and two types of sickle β-thalassemia (HbS/β+-thalassemia and HbS/β°-thalassemia). Other β-globin chain variants such as D-Punjab, O-Arab, and E also result in SCD when inherited with HbS. The marked variability in phenotype severity is due to the different genotypes of the disease and the co-presence of other independent genetic factors that may worsen or alleviate the phenotype [7,8]. Single nucleotide polymorphism (SNPs) in several genetic modulators and cis-regulatory elements involved in the regulation of human HbF, i.e. HBG2, B cell CLL/lymphoma 11A (BCL11A), Kruppel-like factor 1 (KLF1) and MYB, can delay fetal-to-adult hemoglobin (Hb) switch, ameliorating the severity of β-thalassemia and sickle cell disease [9-20]. Other studies focusing on different human populations reported that the SNPs are deeply geographically structured and further loci i.e KCNK10, GPR65, RNASE2, RNASE3, and C/EBPE are involved in HbF expression regulation [21-25]. In the last decades, mitochondrial DNA (mtDNA) has been proven to be an effective tool to reconstruct the maternal origins of individuals and geographically map samples, but it also plays a key role in various human diseases. Due to its maternal inheritance and the lack of recombination [26,27], its differentiation is generated only by the sequential accumulation of new mutations from a female ancestor that makes the mtDNA a molecular register along maternal lines. Over millennia, the molecular differentiation gave rise to groups of mtDNAs that share a set of variants (called haplotypes) acquired from the same common female ancestor. Even though whole genomic approaches are now answering to questions related to the origin and diversification of species, the mtDNA continues to be an important source of information to reconstruct the genetic history of specific populations. Studies based on sequence variation especially in the noncoding control region have allowed the classification of distinct geographic and ethnic affinities [28-34]. Nowadays, a detailed worldwide mtDNA phylogeny is available, where different haplogroups are defined by specific mutational motifs and restricted to specific areas and population groups [35]. The development of phylogeography is closely related to mtDNA analysis, which was described as one of the best tools to study population variability [36]. The consequences of mitochondrial mutations have been also evaluated in human ageing as well as inherited and acquired diseases [37,38], however the only study focusing on the association of mtDNA variations with β-thalassemia claimed that mitochondrial mutations may increase disease susceptibility or the development of phenotype-associated disease in affected individuals, although concluded that there was no strong proof for the association of these polymorphisms with β-thalassemia [39]. Doubtless, as mentioned above, there should be a correlation between hemoglobinopathies and geographic origins of patients [40,41], that could be deepened also by considering the mtDNA variability. The aim of this pilot study was to provide a first comprehensive analysis of hemoglobin and uniparental marker variants in 53 patients with hemoglobinopathies coming from different countries, in order to depict the phylogeographic structure of affected individuals.

Results and discussion

The study was carried out on 53 patients (M/F: 38/15) affected by hemoglobinophaties who referred to our laboratory at the University of Perugia/Hospital Santa Maria della Misericordia, in Umbria a region of Central Italy. Except for the first patient for whom we do not know the origin, 16 patients (30.2%) were European, 28 (52.8%) African, four (7.6%) Asian and four (7.6%) American. Complete information about geographical characteristics and genotypes were reported in Table 1, while clinical phenotypes were detailed in S1 Text and S1–S4 Tables.
Table 1

Geographic information and α/β genotypes of the 53 patients analysed in this study.

Sample IDSex-AgeaEthnic originβ genotypeα genotype
ContinentCountryAllele 1Allele 2
Classical nomenclatureHGVS nomenclatureClassical nomenclatureHGVS nomenclature
β-Thalassemia (β++, β00 and β+0)1MUnknownUnkonwnIVS-I-1(G>A)(β0)HBB:c.92+1G>AIVS-I-1(G>A)(β0)HBB:c.92+1G>Aαα/αα
2M-18EuropeAlbaniacodon39 (C>T)(β0)HBB:c.118C>Tcodon 44(-C)(β0)HBB:c.135delCαα/αα
3*M-3AfricaAlgeria- 29(A>G)(β+)HBB:c.-79A>Gcodon 8(-AA)(β0)HBB:c.25_26delAAαα/αα
4*F-4AfricaAlgeria- 29(A>G)(β+)HBB:c.-79A>Gcodon 8(-AA)(β0)HBB:c.25_26delAAαα/αα
5MEuropeItalycodon39 (C>T)(β0)HBB:c.118C>TIVS-I-6(T>C)(β++)HBB:c.92+6T>Cαα/αα
6M-1EuropeAlbaniaIVS-I-110(G>A)(β+)HBB:c.93-21G>AIVS-I-110(G>A)(β+)HBB:c.93-21G>Aαα/αα
7*M-15AmericaVenezuelacodon17(A>T)(β0)HBB:c.52A>T- 29(A>G)(β+)HBB:c.-79A>Gαα/αα
8M-18EuropeAlbaniaIVS-I-5(G>C)/(β+)HBB:c.92+5G>CIVS-I-6(T>C)(β++)HBB:c.92+6T>Cαα/αα
9F-62EuropeItalyIVS-I-6(T>C)/(β++)HBB:c.92+6T>CIVS-I-6(T>C)/(β++)HBB:c.92+6T>Cαα/αα
10F-31EuropeItalyCodon 39(C>T)(β0)HBB:c.118C>TIVS-I-6(T>C)/(β++)HBB:c.92+6T>Cαα/αα
11*M-3AsiaSyriacodon 5 (-CT)(β0)HBB:c.17_18delCTIVS-I-6(T>C)/(β++)HBB:c.92+6T>Cαα/αα
12*M-4AsiaSyriacodon 5 (-CT)(β0)HBB:c.17_18delCTIVS-I-6(T>C)/(β++)HBB:c.92+6T>Cαα/αα
13F-19AsiaChina- 28(A>G)(β+)HBB:c.-78A>Gcd17(A>T); AAG(Lys)>TAG(Stop codon)(β0)HBB:c.52A>Tαα/αα
14M-29EuropeItalyIVS-II-745 (C>G) (β+)HBB:c.316-106C>GHb Lepore Boston-WashingtonNG_000007.3:g.63632_71046delαα/αα
15M-10AsiaCambodiaHb EHBB:c.79G>AHb EHBB:c.79G>Aαα/αα
Sickle Cell/Thalassemia (HbS/β+ or β0)16*F-18EuropeAlbaniaHbSHBB:c.20A>Tcodon 39(C>T)(β0)HBB:c.118C>Tαα/αα
17*M-11EuropeAlbaniaHbSHBB:c.20A>Tcodon 39(C>T)(β0)HBB:c.118C>Tααα anti3.7/αα
18F-5EuropeAlbaniaHbSHBB:c.20A>Tcodon 39(C>T)(β0)HBB:c.118C>Tαα/αα
19M-6AfricaIvory CoastHbSHBB:c.20A>T-29(A>G)(β+)HBB:c.-79A>Gαα/αα
20M-46EuropeAlbaniaHbSHBB:c.20A>TIVS-I-6(T>C)(β++)HBB:c.92+6T>Cαα/αα
21*F-52AmericaVenezuelaHbSHBB:c.20A>T-29(A>G)(β+)HBB:c.-79A>Gαα/αα
22M-51EuropeItaly (Sicily)HbSHBB:c.20A>TIVS-I-110(G>A)(β+)HBB:c.93-21G>Aαα/αα
23M-2EuropeAlbaniaHbSHBB:c.20A>TIVS-I-110(G>A)(β+)HBB:c.93-21G>Aαα/αα
24M-19AfricaGuineaHbSHBB:c.20A>T-29(A>G)(β+)HBB:c.-79A>Gαα/αα
25M-24EuropeAlbaniaHbSHBB:c.20A>Tcodon 39(C>T)(β0)HBB:c.118C>Tαααanti3.7/αα
Sickle cell disease (HbS/HbS)26M-7EuropeAlbaniaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
27M-32AmericaDominican RepublicHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
28M-4AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>T(-) α3.7/αα
29M-1AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
30M-4AfricaNigeria-Ivory CoastHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
31M-2AfricaNigeria-Ivory CoastHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
32M-5AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
33M-2AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>T(-) α3.7/αα
34M-1AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
35M-49EuropeItalyHbSHBB:c.20A>THbSHBB:c.20A>T(-) α3.7/αα
36F-5AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
37F-1AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
38M-14AfricaCongoHbSHBB:c.20A>THbSHBB:c.20A>T(-) α3.7/αα
39M-1AfricaBeninHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
40M-12AfricaSenegalHbSHBB:c.20A>THbSHBB:c.20A>T(-) α3.7/αα
41M-11AmericaVenezuelaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
42M-60AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>T(-) α3.7/- α3.7
43F-1AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
44M-10 daysAfricaCongoHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
45M-2AfricaNigeriaHbSHBB:c.20A>THbSHBB:c.20A>Tαα/αα
Compound heterozygotes (HbS/HbC and HbO-Arab/HbC)46M-38AfricaIvory CoastHbSHBB:c.20A>THbCHBB:c.19G>A(-) α3.7/αα
47*M-2AfricaIvory CoastHbSHBB:c.20A>THbCHBB:c.19G>Aαα/αα
48F-1AfricaGhanaHbSHBB:c.20A>THbCHBB:c.19G>Aαα/αα
49*F-14AfricaIvory CoastHbSHBB:c.20A>THbCHBB:c.19G>A(-) α3.7/αα
50*F-16AfricaIvory CoastHbSHBB:c.20A>THbCHBB:c.19G>Aαα/αα
51F-24AfricaNigeriaHbSHBB:c.20A>THbCHBB:c.19G>Aαα/αα
52F-59AfricaMoroccoHbO-ArabHBB:c.19G>AHbCHBB:c.364G>Aαα/αα
53M-17AfricaMoroccoHbSHBB:c.20A>THbCHBB:c.19G>Aαα/αα

aAge at time of genetic testing.

*Related patients.

aAge at time of genetic testing. *Related patients. Homozygotes and compound heterozygotes included: 15 β-thalassemia patients, 20 sickle cell disease (HbS/HbS) patients, ten sickle cell/β-thalassemia (HbS/β+ or β0) patients, eight compound heterozygotes for Hb variants of which seven HbS/HbC and one HbO-Arab/HbC. Ten patients carried α mutations: eight were heterozygous for α -3.7 and two for ααα anti 3.7 (Table 1). Seventeen different mutations of the HBB gene were found: 12 generated β-thalassemic defects, two produced Hb thalassemic variants (i.e. Hb-Lepore and HbE) and three were associated with the production of non-thalassemic Hb variants (HbS, HbC and HbO-Arab) (Table 1). Overall, β-thalassemia patients carried 17 different gene mutations and 12 different genotypes. Seven out of 15 patients (46.6%) were European, two (13.3%) were African, four (26.7%) were Asian and one (6.7%) was American. No precise information was available for one patient (6.7%) (Table 1). Sickle cell/β-thalassemia patients were characterized by four different β genotypes while α mutations (ααα anti 3.7) were found in two patients from Albania. Seven out of eleven patients were Europeans (63.6%), two (18.2%) Africans and one (9%) American (Table 1). All sickle cell disease patients carried the same β genotype, while the presence of the -α3.7/αα genotype was demonstrated in five patients and the -α3.7/-α3.7 genotype was highlighted in one patient. Sixteen out of 20 (80%) were African patients, two were Americans (10%) and two were Europeans (10%) (Table 1). Moreover, seven out of eight patients included in the cohort of compound heterozygotes who carried HbS/HbC genotype (patients 46–51, 53), while the other patient bore HbC/HbO-Arab genotype (patient 52). -α3.7/αα genotype was reported in two HbS/HbC cases. All eight patients were African (Table 1).

Hemoglobin phenotype modulating SNPs: Geographical distribution

The analysis of the five polymorphic loci related to HbF expression (i.e. HBG2 rs7482144 [C>T], BCL11A rs1427407 [G>T], BCL11A rs10189857 [A>G], HBS1L rs28384513 [A>C], HBS1L-MYB rs9399137 [T>C]), showed BCL11A rs10189857 [A>G] as the most frequent polymorphic site, since the allele G was detected in 33/53 patients (21 heterozygotes, 12 homozygotes). Conversely, HBS1L-MYB rs9399137 [T>C] was the less frequent polymorphic variation, as the allele C was present only in 6/53 patients (one homozygote and five heterozygotes). As referring to the other SNPs: HBG2 rs7482144 [C>T], BCL11A rs1427407 [G>T] and HBS1L-MYB [A>C] were detected in pts 9, 19 and 21 respectively, showing an intermediate frequency. No patients carrying C/EBPE rs45496295 [C>T] variation were found (Table 2, S5 Table).
Table 2

Molecular analysis of phenotype modulating SNPs, mtDNA sequence variation and geographical distribution of the 53 patients analysed in this study.

Sample IDSex-AgeaEthnic originHBG2BCL11AHBS1L-MYBHBS1L-MYBC/EBPEmtDNA
ContinentCountryrs7482144 [C>T]rs1427407 [G>T]rs10189857 [A>G]rs28384513 [A>C]brs9399137 [T>C]rs45496295 [C>T]Haplotype IDHaplogroupAccession Number
β Thalassemia (β++, β00 and β+0)1MUnkonwnUnkonwnCCGGGGAATTCCHT22R0aMT176183
2M-18EuropeAlbaniaCCGGAAACTCCCHT38H7i1MT176184
3*M-3AfricaAlgeriaCTGGAGAATTCCHT02H1q3MT176185
4*F-4AfricaAlgeriaCTGGAGAATTCCHT02H1q3MT176186
5MEuropeItalyCCGGAAAATTCCHT05HV1MT176187
6M-1EuropeAlbaniaCCGGAGACTCCCHT23T2c1d+152MT176188
7*M-15AmericaVenezuelaCTGGGGAATTCCHT28D1f1MT176189
8M-18EuropeAlbaniaCCGGAGACTTCCHT14H2a5a1MT176190
9F-62EuropeItalyCCGGGGACTTCCHT37H7hMT176191
10F-31EuropeItalyCCGGAGACTCCCHT42H7i1MT176192
11*M-3W. AsiaSyriaCCGTAGAATTCCHT17L2b1MT176193
12*M-4W. AsiaSyriaCCGGGGAATTCCHT17L2b1MT176194
13F-19E. AsiaChinaCCGGGGACTTCCHT12N9a10aMT176195
14M-29EuropeItalyCCGGAAAATTCCHT33H33cMT176196
15M-10E. AsiaCambodiaTTGGGGCCTCCCHT16D4b2dMT176197
Sickle Cell/Thalassemia (HbS/β+ or β0)16*F-18EuropeAlbaniaCCGGGGACTTCCHT15H76aMT176198
17*M-11EuropeAlbaniaCCGGGGAATTCCHT30X2c1bMT176199
18F-5EuropeAlbaniaCCGGAGACCCCCHT07J1c2eMT176200
19M-6AfricaIvory CoastCCGGAGACTTCCHT04L2a1a1MT176201
20M-46EuropeAlbaniaCCGGAGACTTCCHT24T2bMT176202
21*F-52AmericaVenezuelaCTGTAGAATTCCHT28D1f1MT176203
22M-51EuropeItaly (Sicily)CCGTAAAATTCCHT01H2a2a1MT176204
23M-2EuropeAlbaniaCCGGAGAATTCCHT43H12aMT176205
24M-19AfricaGuineaTTGTAAAATTCCHT36L3f1b+16292+150MT176206
25M-24EuropeAlbaniaCCGTAAACTCCCHT06J1c+16261MT176207
Sickle cell disease (HbS/HbS)26M-7EuropeAlbaniaCCGTAGAATTCCHT20T1a1+@152MT176208
27M-32AmericaDominican RepublicCCGGGGAATTCCHT29B4MT176209
28M-4AfricaNigeriaCCGTAAAATTCCHT40G1aMT176210
29M-1AfricaNigeriaCCGGAAAATTCCHT40G1aMT176211
30M-4AfricaNigeria-Ivory CoastCCGTAAAATTCCHT32L2a1MT176212
31M-2AfricaNigeria-Ivory CoastCCGGAGAATTCCHT32L2a1MT176213
32M-5AfricaNigeriaCCGTAAAATTCCHT09L1b2aMT176214
33M-2AfricaNigeriaCCGTAAAATTCCHT21L1b1a+189MT176215
34M-1AfricaNigeriaCCGTAGACTTCCHT41L2a1+143MT176216
35M-49EuropeItalyCCGTAAAATTCCHT39L3e1b2MT176217
36F-5AfricaNigeriaCCGGAGACTTCCHT10L2a1MT176218
37F-1AfricaNigeriaCCTTAAACTTCCHT03L1c1dMT176219
38M-14AfricaCongoCCGGAAACTTCCHT08L1c2a1aMT176220
39M-1AfricaBeninCTGGAAAATTCCHT04L2a1a1MT176221
40M-12AfricaSenegalCTGGAAAATTCCHT35M30cMT176222
41M-11AmericaVenezuelaCTGGGGAATTCCHT44U7aMT176223
42M-60AfricaNigeriaCCGGGGACTTCCHT34L2a1+16189+(16192)MT176224
43F-1AfricaNigeriaCCGGAAACTTCCHT13L3k1MT176225
44M-10 daysAfricaCongoCCGTAGACTTCCHT26L0a1a2MT176226
45M-2AfricaNigeriaCCGGAAAATTCCHT25L0a1aMT176227
Compound heterozygotes (HbS/HbC and HbO-Arab/HbC)46M-38AfricaIvory CoastCCGTAAAATTCCHT11L2a1b+143MT176228
47*M-2AfricaIvory CoastCCGTAGAATTCCHT31L2a1+143MT176229
48F-1AfricaGhanaCCGGGGACTTCCHT18L3bMT176230
49*F-14AfricaIvory CoastCCGTAGAATTCCHT31L2a1+143MT176231
50*F-16AfricaIvory CoastCCTTAAAATTCCHT31L2a1+143MT176232
51F-24AfricaNigeriaCCGTAGAATTCCHT19L3d1b3aMT176233
52F-59AfricaMoroccoCCGGAGACTTCCHT02H1q3MT176234
53M-17AfricaMoroccoCCGGAGAATTCCHT27M1a2aMT176235

aAge at time of genetic testing.

bOnly SNPs that reduce HbF.

*Related patients.

aAge at time of genetic testing. bOnly SNPs that reduce HbF. *Related patients. Interestingly, we observed that frequencies of the five analysed SNPs were related to the geographical origin of patients (Fig 1). Although in our cohort there were only four Asian cases, all investigated SNPs were detected. Conversely, in the other patients (i.e. Europeans, Africans, Americans), the frequency of each SNP was considerably different. HBS1L-MYB rs9399137 T>C was not detected in both our African or American populations, while HBG2 rs7482144 C>T was not detected in European patients (Fig 1, Table 2, S1A and S1B Fig).
Fig 1

Nuclear SNP frequencies and geographic distribution of five nuclear SNPs in the 53 patients.

Black rings represent the number of patients in different geographic areas. Pie charts represent the frequencies of each SNP. The central number indicates the total alleles carrying SNPs.

Nuclear SNP frequencies and geographic distribution of five nuclear SNPs in the 53 patients.

Black rings represent the number of patients in different geographic areas. Pie charts represent the frequencies of each SNP. The central number indicates the total alleles carrying SNPs. In particular, HBG2 rs7482144 [C>T] was found in nine patients, five African, three Americans and one Asian (S1A Fig, Table 2). The absence of this variation in European subjects did not completely fit with the allele frequencies retrieved from GnomAD database, which reported T alleles in the European population, although in a lower frequency (3839 T alleles and 10893 C alleles) (S1B and S1C Fig). BCL11A rs1427407 [G>T] spread over all analysed ethnic group: 13 African patients, four Europeans, one American and one Asian. Likewise, BCL11A, rs10189857 [A>G] was present in 33/53 patients: 14 Africans, ten Europeans, four Asians, four Americans and no precise ethnic information were available for one patient (S1A Fig, Table 2). The allele frequencies of both BCL11A rs1427407 and BCL11A rs10189857 were in agreement with those reported in GnomAD database (Fig 1B and 1C). HBS1L-MYB rs28384513 [A>C] was identified in ten African patients, nine Europeans and two Asians, while it was not detected in American patients (Fig 1A, Table 2). Notably GnomAD database reported the G allele also in this population, although in a lower frequency (9164 T alleles and 5824 G alleles) (S1B and S1C Fig). The low number of American patients included in our cohort (i.e. three pts) could explain the discrepancy between our results and the database. Finally, the HBS1L-MYB rs9399137 [T>C], was detected only in 6/53 analysed patients, five European patients and one Asian (S1A and S1B Fig, Table 2). No African patients with HBS1L-MYB rs9399137 [T>C] were found in our cohort although this represented the largest group. Allele frequencies obtained by GnomAD database reported C alleles in American and African populations, although at lower frequencies (S1B and S1C Fig). In conclusion, for three out of five SNPs our results were in agreement with GnomAD database. The only discrepancy regards HBG2 rs7482144 [C>T] which was not detected in European patients and HBS1L-MYB rs9399137 [T>C] which was not detected in Africans although these patients represented 30.2% and 52.8% of the cohort, respectively. These discrepancies may be due to a selection bias as European patients represented only Italian and Albania geographical areas, while 24/28 African patients came from seven Sub-Saharan countries and only 4/28 came from two North Africa countries. (Fig 1)

Analysis of mtDNA sequence variation and integration with hemoglobin marker variants

Mitochondrial control region sequences ranged from nucleotide position (np) 16024 to np 210, thus encompassing the HVSI and most of the HVSII (Table 2). The overall sequence alignment of samples revealed 99 polymorphic sites (S), represented by ten transversions and two indel (one deletion and one insertion). Nucleotide diversity (π) across all samples was estimated at 0.0173. The haplotype diversity was very high (Hd = 0.993) as well as the average number of nucleotide differences (k = 13,052). When aligned to the revised Cambridge Reference Sequence, a total of 44 different haplotypes were obtained and classified through their mutational motifs into 13 different macro-haplogroups: B, D, G, JT, L0, L1, L2, L3*, M*, N9, R0, UK and X (Fig 2A). Seven haplotypes were shared between two or three samples, but five couples of those individuals declared to be maternally related. For the remaining two haplotypes, one (HT02) was found in two brothers from Algeria and one sample from Morocco the other (HT04) was detected in one patient from Ivory Coast and one from Benin.
Fig 2

Haplogroup frequency.

(A) Haplogroup frequency pie chart. (B) Geographic distribution of patients based on their declared origin and reported for different haplogroups. (C) Distribution of hemoglobinopathies subgroups in all patients for different mtDNA haplogroups.

Haplogroup frequency.

(A) Haplogroup frequency pie chart. (B) Geographic distribution of patients based on their declared origin and reported for different haplogroups. (C) Distribution of hemoglobinopathies subgroups in all patients for different mtDNA haplogroups. In order to better define the phylogenetic relationships between samples, we have constructed the network illustrated in Fig 3. Haplogroup labels are reported according to the Phylotree nomenclature [42]. The network topology showed two major mtDNA clusters, corresponding to super-haplogroups R and not-R. Within R, JT and H haplogroups prevailed. The latter formed a star-like cluster mainly represented by sub-haplogroups H1, H2 and H7. R0a, HV, B and UK were also represented. The not-R cluster included mostly L1, L2 and L3* sub-lineages, followed by L0 and D; also N9, M, X and G were revealed.
Fig 3

Network of mitochondrial variation.

Tree of the 53 mtDNA haplotypes observed among the patients, highlighted with different colors according to their geographic origin. The circles are proportional to the number of samples carrying the same haplotype. Haplogroups and sub-haplogroups are indicated with capital letters. Super-haplogroups R and not-R are defined in the right and left box, respectively.

Network of mitochondrial variation.

Tree of the 53 mtDNA haplotypes observed among the patients, highlighted with different colors according to their geographic origin. The circles are proportional to the number of samples carrying the same haplotype. Haplogroups and sub-haplogroups are indicated with capital letters. Super-haplogroups R and not-R are defined in the right and left box, respectively. Among samples, the most represented lineage belonged to the super-haplogroup L (45%), consisting in L0 (8%), L1 (17%), L2 (54%) and L3* (21%), followed by R0 (25%) and M* (13%) (Fig 2B). The sibling haplogroup JT was detected with 9% of frequency, while B, N9, UK and X lineages were represented by only one sample each. It is known that macro-haplogroup L represents the most ancestral mitochondrial lineage of all living modern humans and, if excluding the derived L3 branches M and N, it represents the majority of the typical sub-Saharan mtDNA variability. Within L, L2 is the most common haplogroup in Africa and it has been previously observed throughout the continent [43]. Our findings agree with the haplogroup frequency and origin of our samples specifically from Nigeria and Ivory Coast (but also Congo and Benin), except for Syria, from which two samples (brothers) derived. L3* was found in two samples from Nigeria, one from Guinea, one from Ghana and one from Italy. Even if L haplotypes account for <1% of mtDNAs in Europe, with L1b being the most common haplogroup, frequencies widely change, ranging from 3% in Southern Europe to 0.7% in Central Europe and 0.5% in Northern Europe. Within R0, only one sample R0a and one HV1 were found, while the remaining patients were H, thus corresponding to a total of 11/53 (21%). Haplogroup H is the most frequent mitochondrial lineage in Europe (~40%) with a declining pattern from Western Europe towards the Near East and Caucasus (~10–20%). It was also found in Africa, especially the North region, with a peak of H1 in Maghreb and this is in accordance with the origin of our H samples (Italy, Albania, Morocco and Algeria). As for the macro-haplogroup M, samples originated from Venezuela and Cambodia and belonged to the D4 sub-branches (D1f1 and D4b2d), which exactly reflect the geographic maximum distribution of these lineages (South America and East Asia, respectively) [44]. Even patient 53 from Morocco, belonging to M1a2a mitochondrial haplogroup, was congruent with the frequency distribution of the lineage, unlike African patients 28 and 29 (brothers) and 40, which belonged to G1a and M30c respectively, both sub-lineages typical of Asia. The East Asian B4 haplogroup revealed in patient 27 from Dominican Republic could result odd, but we should keep in mind that a subclade of B4b is one of the five haplogroups found among indigenous people of the Americas, thus explaining a lower distribution also in Central America. Patient 13 from China belonging to N9a10a perfectly reflects the East Asian origin and the distribution peak of the lineage [45]. After evaluating relationships between mitochondrial phylogeny and the geographic origin of individuals, we moved to a different level of resolution, including also the nuclear SNP data in a new network (S2 Fig, Table 2). We considered the mitochondrial control-region haplotypes, whose phylogenetic relationships are illustrated in the Fig 3, together with zygosity of the five variable genes, trying to detail the patients’ genotype. Patients 5, 14, 29 and 45 were the only homozygous wildtype for all variables: two from Italy (both belonging to the R0 haplogroup) and two from Nigeria (carrying L0 and G mitogenomes). A total of 41/53 patients showed the prevalence of wildtype homozygosity, while only six individuals (patients 6, 10, 21, 25, 34 and 44) were heterozygous for three polymorphisms. Fourteen patients were homozygous mutants for one SNP, while only patient 15 was homozygous mutant for three variables. It is to be highlighted that all individuals within L mitochondrial macro-haplogroup were homozygous wildtype for HBS1L-MYB rs9399137 [T>C], as patients harbouring the JT mitochondrial lineage were homozygous wildtype for HBG2 rs7482144 [C>T]. Conversely, C/EBPE rs45496295 [C>T] was always found as homozygous wildtype, thus it was excluded from the representation. A further analysis was performed, trying to obtain a graphical overview of the possible integration between mtDNA control region variability and nuclear mutations associated to hemoglobinopathies subgroups (Table 2, S2 Text and S5 Table). In S3 Fig, different phenotypical subgroups were highlighted in the mitochondrial phylogenetic tree: β-thalassemia (β-thal) patients, sickle cell/β-thalassemia (HbS/β+ or β0), sickle cell disease (HbS/HbS) and compound heterozygotes (HbS/HbC and HbO-Arab/HbC). The overall network revealed a predominant distribution of the hemoglobinopathies subgroups within clades (S3 Fig). In particular, except for three haplotypes belonging to samples from Albania (patient 26) and South America (patiens 27 and 41) that lied in the R clade, sickle cell disease seemed to be common among not-R haplogroups, especially African lineages (L0, L1, L2 and L3*). It is to be noticed that no sickle cell disease patients belonged to macro-haplogroup R0 (inclusive of R0a, H and HV haplogroups) (Fig 2C). A similar distribution was observed for compound heterozygotes patients, all Africans; they lied in not-R clade (M1, L2 and L3* haplogroups), except for patient 52, from Morocco, that belonged to H1 lineage and it was the only patient characterized by HbC/HbO-Arab genotype. An opposite distribution could be observed for thalassemic patients, which spread among R haplogroups, except for four samples from South America (patient 7), Syria (patients 11 and 12, brothers) and Cambodia (patient 15). In particular, the R0 clade seemed to be overall characterized by β-thalassemic individuals. On the other hand, sickle cell/β-thalassemia patients (HbS/β+ or β0) seemed to be widespread in the network. When we separated data in four different networks, one for each hemoglobinopathies subgroup coloured with the geographic origin of the patient, it was clear that Asian samples were all thalassemic, but clustered far from European thalassemic patients (S3E Fig). No African patient was affected by thalassemia (with the exception of a couple of brothers from Algeria). If excluding two samples with sickle cell/β-thalassemia patients (HbS/β+ or β0) (S3F Fig) Africans prevail in sickle cell disease grouping (S3G Fig) and are the only source of compound heterozygotes (S3H Fig). No star-like pattern of molecular relatedness emerged that would represent any event of local expansion. To resume, we observed that European patients fall prevalently into the thalassemic (β+/β+, β0/β0 and β+/β0) or sickle cell/β-thalassemia (HbS/β+ or β0) subgroups, while Africans into the remaining two (i.e. HbS/HbS and HbS/HbC) (S4 Fig). The clear structuring and sub-division between African and European clades could be detected also in the principal component analysis (PCA) performed by considering all variable sites (both nuclear and mtDNA SNPs) for the 53 patients (Fig 4). Moreover, American samples lied in two quarters, close to Europeans. The graph showed also three European samples close to the African quarters (left quarters): Albanian patients 17 (X haplogroup) and 26 (JT), and patient 35 (L3*) from Italy.
Fig 4

Two-dimensional Principal Component Analysis plot obtained by including all mitochondrial and nuclear DNA variations, considered separately in each sample and coloured depending on their geographic origin.

On the other hand, five African samples stand close to the vertical line that separates left and right quarters: patient 24 (L3*) from Guinea; patients 29 (G), 43 (L3*), and 45 (L0) from Nigeria; and patient 44 (L0) from Congo. Interestingly, all these patients together with three Europeans (patients 17, 26, 35) carried at least HbS allele, which is very frequent in the Sub Saharan Africa. The prevalently European quarter (the first quarter on the right) shows four African patients (patients 3 and 4 siblings, and patients 48 and 52). The two most distant patients from African samples are from North Africa (Morocco and Algeria) confirming the closeness to European ones. Intriguingly, only two thalassemic samples lie on the left half of the PCA: they are patients 11 and 12, the only samples from West Asia, while the other two Asian samples were from China and Cambodia, and fall into the right half of the graph. Even if the past and recent migrations could partially alter the picture, a high level of population and geographic structuring still persist.

Conclusions

This study represents the first comprehensive analysis of nuclear and mitochondrial DNA marker variations in patients with hemoglobinopathies and showed a clear separation between African and European patients allowing for depicting the phylogeographic structure of affected individuals. Further investigations on a larger cohort of patients, as well as on additional mitochondrial and nuclear variants using standard and Next Generation Sequencing methodologies, will allow to establish possible correlations between genetic data and clinical behaviour. Taking all together, this pilot study based on hemoglobin and uniparental marker variants could represent an integrated and powerful tool for enliven the multifaced pictures of hemoglobinophaties.

Materials and methods

Patients and DNA extraction

The study was carried out on 53 patients (M/F: 38/15) affected by hemoglobinophaties who referred to our laboratory at the University of Perugia/Hospital Santa Maria della Misericordia, in Umbria a region of Central Italy. (Table 1). α and β genotype of patients 1–12, 16–23, 26–39 and 46–52 were previously described [46]. Moreover patients 3–4, 11–12, 16–17 and 47, 49–50 were brothers. Patient 21 was mother of patient 7. Peripheral blood samples were collected during routine medical examinations and genomic DNA was isolated through salting-out procedure [47]. At diagnosis and/or during follow-up all patients or their parents/guardians provided informed consent for sample collection, storage and for research collaborative studies. Protocols were approved by the Ethics Committee for Clinical Experimentation of the University of Perugia (protocol no. 2017–01) and analyses were performed in agreement with the Helsinki declaration.

Genomic DNA analysis

Five SNP associated with phenotype severity modulating respectively HBG2 rs7482144 [C>T], BCL11A rs1427407 [G>T], BCL11A rs10189857 [A>G], HBS1L rs28384513 [A>C] and HBS1L-MYB rs9399137 [T>C] were studied in all patients, using polymerase chain reaction (PCR)-based reverse dot-blot technique (β-Thal Modifier StripAssay; ViennaLab Diagnostics GmbH, Vienna, Austria). C/EBPE, rs45496295 [C>T] were studied using primers: CEBP_F: 5’-GACAGGAGCTGGCTCTGAGT-3’ and CEBP_R: 5’-TCCGCAGAGTTAGGCCGT-3’ Nucleotide Sequencing was performed using BigDyeTM Terminator Cycler Sequencing Kit and ABI PRISMTM 310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA, USA) [48]. The SNP frequency in the different populations were obtained by GnomAD database (Genome Aggregation Database, https://gnomad.broadinstitute.org/).

Mitochondrial DNA analysis

Mitochondrial control region between sites 15877 and 259 was amplified using two primers designed on the revised Cambridge Reference Sequence (rCRS) with accession number NC_012920 [49]: forward 5’-CAAATGGGCCTGTCCTTGTA-3’ and reverse 5’-TGTGCAGACATTCAATTGTT-3’. The PCR fragments were first enzymatically purified (ExoSAP-IT®, USB Corporation, Cleveland, OH, USA), then cycle sequenced by application of ABI Prism™ BigDye Terminator chemistry on the ABI PRISMTM 310 Genetic Analyzer (PE Applied Biosystems, Foster City, CA, USA), with the forward primer 5’-CCATTAGCACCCAAAGCTA-3’. Sequences were assembled and aligned to rCRS using Sequencher™ 5.10 (Gene Codes Corporation). Whenever electropherograms showed ambiguities, new PCR amplifications and sequencing reactions were performed. Final assembling encompassed at least the entire Hyper Variable Sequence HVSI and part of HVSII (from np 16024 to np 210). All mitochondrial control-region sequences were deposited in GenBank with accession numbers MT176183-MT176235. For each sample, haplotypes were registered and classified in haplogroups according to the respective mutational motifs by referring to HaploGrep2 software based on PhyloTree database build 17, www.phylotree.org [42]. Genetic variation indices were estimated by using DnaSP 5.1 software [50]. The evolutionary relationships among haplotypes were visualized through the construction of a median-joining network using Network 10 software (www.fluxus-engineering.com). By default, all polymorphisms relative to rCRS were weighted by a value of ten, except for the highly variable site 16519 and the variations in the poly-C stretch around np 16189 that were down-weighted to one. Conversely, triple weights were assigned to the mutational motifs 16223 and 73 in order to strengthen the topology of the network. Principal component analyses (PCA) were performed using Excel software implemented by XLSTAT, as described elsewhere [51] in order to explain the variance of multivariate data (mitochondrial and nuclear DNA variation, considered separately in each sample).

Nuclear SNP frequencies.

(A) Frequency per SNP of patients reported according to the geographic origin. (B) Frequency per continent of the ancestral allele and the variant one in the cohort of this study. (C) Frequency per continent of the ancestral allele and the variant one reported by GnomAD database. Please note that panels A and B are referred to coding DNA reference sequence while panel C is referred to the genomic reference sequence. (DOCX) Click here for additional data file.

Network of mitochondrial and nuclear variation.

Tree of the 53 mtDNA haplotypes observed among the patients in which zygosity of SNP was manually highlighted (rs45496295 was excluded because samples were all homozygous for it). (DOCX) Click here for additional data file.

Nuclear SNP frequency, geographic distribution and network mtDNA, in the subgroups of patients.

(A) Nuclear SNP frequency and Geographic distribution in β-thalassemia (β+/β+, β0/β0 and β+/β0) patients, (B) sickle cell/β-thalassemia (HbS/β+ or β0) patients, (C) sickle cell disease patients (HbS/HbS) (D) compound heterozygotes (HbS/HbC and HbO-Arab/HbC). Panel E-H: network plots obtained by including all mitochondrial and nuclear DNA variation, considered separately in each pathological group, and colored depending on their geographic origin. (DOCX) Click here for additional data file.

Network of mitochondrial variation and hemoglobinopathies.

Tree of the 53 mtDNA haplotypes observed among the patients in which different subgroups of hemoglobinopathies were highlighted. (DOCX) Click here for additional data file.

Clinical data of β thalassemia patients.

(DOCX) Click here for additional data file.

Clinical data of sickle cell/thalassemia patients.

(DOCX) Click here for additional data file.

Clinical data of sickle cell disease patients.

(DOCX) Click here for additional data file.

Clinical data of compound heterozygotes patient.

(DOCX) Click here for additional data file.

Molecular characteristics of the genetic modulators of clinical phenotypes in homozygotes and compound heterozygotes for β gene defect.

(DOCX) Click here for additional data file. (DOCX) Click here for additional data file. (DOCX) Click here for additional data file.
  50 in total

Review 1.  Beta-globin-gene haplotypes, mitochondrial DNA, the Y-chromosome: their impact on the genetic epidemiology of the major structural hemoglobinopathies.

Authors:  R L Nagel
Journal:  Cell Mol Biol (Noisy-le-grand)       Date:  2004-02       Impact factor: 1.770

Review 2.  Somatic mitochondrial DNA mutations in mammalian aging.

Authors:  Nils-Göran Larsson
Journal:  Annu Rev Biochem       Date:  2010       Impact factor: 23.643

3.  Asian affinities and continental radiation of the four founding Native American mtDNAs.

Authors:  A Torroni; T G Schurr; M F Cabell; M D Brown; J V Neel; M Larsen; D G Smith; C M Vullo; D C Wallace
Journal:  Am J Hum Genet       Date:  1993-09       Impact factor: 11.025

4.  The Molecular Spectrum of β- and α-Thalassemia Mutations in Non-Endemic Umbria, Central Italy.

Authors:  Paolo Gorello; Francesco Arcioni; Antonietta Palmieri; Ylenia Barbanera; Laura Ceccuzzi; Cecilia Adami; Mauro Marchesi; Antonella Angius; Olivia Minelli; Marina Onorato; Antonio Piga; Maurizio Caniglia; Cristina Mecucci; Antonella Roetto
Journal:  Hemoglobin       Date:  2016-11       Impact factor: 0.849

5.  The association between four SNPs (rs7482144, rs4671393, rs28384513 and rs4895441) and fetal hemoglobin levels in Chinese Zhuang β-thalassemia intermedia patients.

Authors:  Yunli Lai; Lin Zhou; Sheng Yi; Yun Chen; Yanqing Tang; Shang Yi; Ze Yang; Hongwei Wei; Chenguang Zheng; Sheng He
Journal:  Blood Cells Mol Dis       Date:  2017-01-25       Impact factor: 3.039

Review 6.  New trend in the epidemiology of thalassaemia.

Authors:  Chi-Kong Li
Journal:  Best Pract Res Clin Obstet Gynaecol       Date:  2016-10-26       Impact factor: 5.237

7.  Structural and Functional Insights on an Uncharacterized Aγ-Globin-Gene Polymorphism Present in Four β0-Thalassemia Families with High Fetal Hemoglobin Levels.

Authors:  Nicoletta Bianchi; Lucia Carmela Cosenza; Ilaria Lampronti; Alessia Finotti; Giulia Breveglieri; Cristina Zuccato; Enrica Fabbri; Giovanni Marzaro; Adriana Chilin; Gioia De Angelis; Monica Borgatti; Cristiano Gallucci; Cecilia Alfieri; Michela Ribersani; Antonella Isgrò; Marco Marziali; Javid Gaziev; Aldo Morrone; Pietro Sodani; Guido Lucarelli; Roberto Gambari; Katia Paciaroni
Journal:  Mol Diagn Ther       Date:  2016-04       Impact factor: 4.074

8.  rs11886868 and rs4671393 of BCL11A associated with HbF level variation and modulate clinical events among sickle cell anemia patients.

Authors:  Leila Chaouch; Imen Moumni; Houyem Ouragini; Imen Darragi; Miniar Kalai; Dorra Chaouachi; Imen Boudrigua; Raouf Hafsia; Salem Abbes
Journal:  Hematology       Date:  2016-01-22       Impact factor: 2.269

9.  The emerging tree of West Eurasian mtDNAs: a synthesis of control-region sequences and RFLPs.

Authors:  V Macaulay; M Richards; E Hickey; E Vega; F Cruciani; V Guida; R Scozzari; B Bonné-Tamir; B Sykes; A Torroni
Journal:  Am J Hum Genet       Date:  1999-01       Impact factor: 11.025

10.  Maternal ancestry and population history from whole mitochondrial genomes.

Authors:  Toomas Kivisild
Journal:  Investig Genet       Date:  2015-03-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.