| Literature DB >> 31286593 |
Li Zhang1,2,3, Qianqian Zhang1,2,3, Yaohua Tang4, Peikuan Cong5, Yuhua Ye1,2,3, Shiping Chen6, Xinhua Zhang7, Yan Chen8, Baosheng Zhu9, Wangwei Cai10, Shaoke Chen11, Ren Cai12, Xiaoling Guo13, Chonglin Zhang14, Yuqiu Zhou15, Jie Zou16, Yanhui Liu17, Biyan Chen18, Shanhuo Yan19, Yajun Chen20, Yuehong Zhou21, Hongmei Ding22, Xiarong Li23, Dianyu Chen24, Jianmei Zhong1,2,3, Xuan Shang1,2,3, Xuanzhu Liu23, Ming Qi24,25, Xiangmin Xu1,2,3.
Abstract
Hemoglobinopathies are the most common monogenic disorders worldwide. Substantial effort has been made to establish databases to record complete mutation spectra causing or modifying this group of diseases. We present a variant database which couples an online auxiliary diagnosis and at-risk assessment system for hemoglobinopathies (DASH). The database was integrated into the Leiden Open Variation Database (LOVD), in which we included all reported variants focusing on a Chinese population by literature peer review-curation and existing databases, such as HbVar and IthaGenes. In addition, comprehensive mutation data generated by high-throughput sequencing of 2,087 hemoglobinopathy patients and 20,222 general individuals from southern China were also incorporated into the database. These sequencing data enabled us to observe disease-causing and modifier variants responsible for hemoglobinopathies in bulk. Currently, 371 unique variants have been recorded; 265 of 371 were described as disease-causing variants, whereas 106 were defined as modifier variants, including 34 functional variants identified by a quantitative trait association study of this high-throughput sequencing data. Due to the availability of a comprehensive phenotype-genotype data set, DASH has been established to automatically provide accurate suggestions on diagnosis and genetic counseling of hemoglobinopathies. LOVD-DASH will inspire us to deal with clinical genotyping and molecular screening for other Mendelian disorders.Entities:
Keywords: DASH; LOVD; clinical genotyping; database; hemoglobinopathy; molecular screening
Mesh:
Year: 2019 PMID: 31286593 PMCID: PMC6899610 DOI: 10.1002/humu.23863
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Summary of disease‐causing and modifier variants of hemoglobinopathies in LOVD‐China
| Disease‐causing variants | ||||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| OMIM | 141900 | 141800 | 141850 | 142000 | 142200 | 142250 |
| Location | 11p15.4 | 16p13.3 | 16p13.3 | 11p15.4 | 11p15.4 | 11p15.4 |
| Pathogenicity | ||||||
| Hb variant | 28 | 6 | 13 | 4 | 2 | 2 |
| Thalassemia | 92 | 27 | 33 | 17 | 0 | 0 |
| HPFH | 2 | 0 | 0 | 2 | 1 | 0 |
| Uncertain significance | 2 | 11 | 8 | 2 | 8 | 5 |
| Total | 124 | 44 | 54 | 25 | 11 | 7 |
Note: All the variants are classified according to ACMG recommendations. The pathogenicity was described as Hb variant, thalassemia, and hereditary persistence fetal hemoglobin (HPFH). “Hb variant” includes not only variants result in clinical significance, but all the reported abnormal hemoglobin variants. Modifier variants refer to all the collectible variants reported to be HbF‐related or be significant in influencing the severity of hemoglobinopathies in the Chinese population. Variants in disease‐causing genes without known pathogenicity were defined as the variants with “uncertain significance” (VUS). “Uncertain significance” in modifier variants, especially in HMIP region, refers to the variants reported to be HbF‐related but with unclear modification.
Abbreviations: ACMG, American College of Medical Genetics and Genomics; LOVD, LOVD, Leiden Open Variation Database; OMIM, Online Mendelian Inheritance in Man
Appropriately, HBG1 and HBG2 genes can be classed as modifier genes. “Hb variant” in these genes referred to the variants which lead to abnormal fetal hemoglobin such as Hb F‐Jiangsu (HBG1:c.403G>A).
HPFH involves only the deletion forms.
Although bulks of modifier variants have been detected in non‐globin genes like BCL11A or HMIP region etc., functional variants within KLF1 have the most clinical significant influence on the severity of β‐hemoglobinopathies in Chinese population.
Variants in erythroid transcription factors GATA1 are reported to be related with HbF, HbA2, and severity of hemoglobinopathies.
Functional variants from 22,309 high‐throughput sequencing data
| SNPs | Locus | Location | Nucleotide change | Frequency | Modification |
| HbF level of carriers(g/L) | HbF level of non‐carriers(g/L) |
|---|---|---|---|---|---|---|---|---|
| rs61749494 |
| 2:60689441 | T>C | 0.251 | Elevated HbF | 4.24 × 10−6 | 18.6820 | 11.6439 |
| rs10189857 |
| 2:60713235 | A>G | 0.9235 | Decreased HbF | 1.64 × 10−6 | 12.8591 | 20.0668 |
| rs6545816 |
| 2:60714861 | A>C | 0.9216 | Decreased HbF | 3.24 × 10−6 | 12.8711 | 19.7456 |
| rs1427407 |
| 2:60718043 | T>G | 0.9275 | Decreased HbF | 1.95 × 10−6 | 12.9450 | 19.3588 |
| rs7599488 |
| 2:60718347 | C>T | 0.9235 | Decreased HbF | 2.77 × 10−6 | 12.8542 | 20.1268 |
| rs766432 |
| 2:60719970 | C>A | 0.9255 | Decreased HbF | 1.65 × 10−5 | 13.0060 | 18.4325 |
| rs4671393 |
| 2:60720951 | A>G | 0.9275 | Decreased HbF | 3.43 × 10−6 | 12.9880 | 18.8090 |
| rs375867652 |
| 6:135419038 | delC | 0.3275 | Elevated HbF | 1.59 × 10−4 | 17.3037 | 11.5147 |
| rs11759553 |
| 6:135422296 | A>T | 0.3451 | Elevated HbF | 5.61 × 10−5 | 17.2904 | 11.3657 |
| rs35959442 |
| 6:135424179 | C>G | 0.349 | Elevated HbF | 6.06 × 10−5 | 17.2321 | 11.3613 |
| rs4895440 |
| 6:135426558 | A>T | 0.349 | Elevated HbF | 6.06 × 10−5 | 17.2321 | 11.3613 |
| rs4895441 |
| 6:135426573 | A>G | 0.349 | Elevated HbF | 6.06 × 10−5 | 17.2321 | 11.3613 |
| rs9402686 |
| 6:135427817 | G>A | 0.351 | Elevated HbF | 6.06 × 10−5 | 17.1452 | 11.3906 |
| rs9494142 |
| 6:135431640 | T>C | 0.3627 | Elevated HbF | 5.17 × 10−5 | 16.9296 | 11.4070 |
| rs6934903 |
| 6:135451564 | T>A | 0.3373 | Elevated HbF | 9.34 × 10−5 | 16.6348 | 11.7694 |
| rs78981054 |
| 11:5270347 | delAAAG | 0.9863 | Decreased HbF | 4.66 × 10−8 | 13.1293 | 33.6006 |
| rs34879481 |
| 11:5274452 | insT | 0.1392 | Elevated HbF | 4.19 × 10−12 | 23.3682 | 11.7998 |
| rs28379094 |
| 11:5269806 | C>T | 0.9843 | Decreased HbF | 1.65 × 10–12 | 13.0232 | 37.7005 |
| rs2187608 |
| 11:5269931 | G>C | 0.1373 | Elevated HbF | 1.41 × 10–12 | 23.3314 | 11.832 |
| rs7482933 |
| 11:5270002 | G>A | 0.8588 | Decreased HbF | 1.15 × 10–9 | 11.8743 | 22.7546 |
| rs2855039 |
| 11:5271671 | C>T | 0.1373 | Elevated HbF | 6.93 × 10–12 | 23.3314 | 11.832 |
| rs2855038 |
| 11:5272154 | T>C | 0.9863 | Decreased HbF | 4.46 × 10–11 | 13.1293 | 33.6006 |
| rs2855036 |
| 11:5272682 | C>T | 0.1373 | Elevated HbF | 1.30 × 10–11 | 23.3314 | 11.832 |
| rs2070972 |
| 11:5274717 | A>C | 0.9843 | Decreased HbF | 5.86 × 10–11 | 13.1411 | 30.303 |
| rs11036474 |
| 11:5275178 | T>C | 0.1412 | Elevated HbF | 7.90 × 10–12 | 23.3499 | 11.7764 |
| rs11036475 |
| 11:5275240 | G>A | 0.9863 | Decreased HbF | 2.06 × 10–11 | 13.1293 | 33.6006 |
| rs11036476 |
| 11:5275343 | C>T | 0.9863 | Decreased HbF | 9.69 × 10–11 | 13.1293 | 33.6006 |
| rs2070973 |
| 11:5275407 | T>C | 0.9863 | Decreased HbF | 9.69 × 10–11 | 13.1293 | 33.6006 |
| rs7482144 |
| 11:5276169 | G>A | 0.1412 | Elevated HbF | 1.81 × 10–11 | 23.3499 | 11.7764 |
| rs2855123 |
| 11:5277078 | A>T | 0.9863 | Decreased HbF | 3.92 × 10‐10 | 13.1293 | 33.6006 |
| rs2855122 |
| 11:5277236 | C>T | 0.9863 | Decreased HbF | 1.20 × 10–11 | 13.1293 | 33.6006 |
| rs2855121 |
| 11:5277291 | C>T | 0.1392 | Elevated HbF | 7.90 × 10–12 | 23.3682 | 11.7998 |
| rs34306743 |
| 11:5272553 | insA | 0.1373 | Elevated HbF | 1.30 × 10–11 | 23.3314 | 11.832 |
| rs483352838 |
| 19:12996518 | insGGCGCCG | 0.0137 | Elevated HbF | 1.93 × 10−6 | 39.2343 | 13.0509 |
Note: Among the 510 β0/β0 samples from 22,309 sequencing data, 74 variants were shown to be significant after association analysis in Plink judged by the P‐values after a Bonferroni correction. 34 of the variants were located in our candidate genes or the HMIP region. The 74 variants are available in the supplementary document.
Abbreviation: SNP, single nucleotide polymorphism.
The chromosomal locations are given in GRCh37/hg19 coordinates.
Figure 1Homepage of the HBB gene from our LOVD‐China database. LOVD, Leiden Open Variation Database
Figure 2Workflow of DASH. Data resources including LOVD‐China, 22,309 phenotype‐genotype data set, and HbVar data set were mainly used for the clinical genotyping module and at‐risk assessment module. DASH, diagnosis and at‐risk assessment system for hemoglobinopathies; LOVD, Leiden Open Variation Database
26 complicated cases of compound heterozygotes in combination with modifier variants
| Major classes | Genotypes | Number of patients | |
|---|---|---|---|
| TM | TI | ||
| β‐thal modification samples | 4 | 19 | |
| α‐Globin gene triplication | (β0/β0, ααα/αα) | 3 | 0 |
| (β0/β+, ααα/αα) | 1 | 0 | |
| (β0/βN or β+/βN, ααα/αα or ααα/ααα) | 0 | 9 | |
| Other modifier variants | (β0 / β0) + ( | 0 | 5 |
| (β0 / β0) + 4 significant variants* | 0 | 5 | |
| Atypical thalassemia samples |
|
| |
| Microcytic hypochromic anemia | ( | 0 | 3 |
Note: Four significant variants*: 4 functional variants exerting a significant impact on the clinical severity of β‐thalassemia patients: HBA1 and HBA2 disease‐causing variants, rs7482144 (Xmn1), rs61749494 (BCL11A), and rs11759553 (HMIP). Details of all 26 samples are available in the Supporting Information document (Table S3). The bold values means the number of TM(thalassemia major) or TI(thalassemia intermedia) patients, with no special significance.
Abbreviations: TI, thalassemia intermedia; TM, thalassemia major.