| Literature DB >> 28379995 |
Tânia Carlice-Dos-Reis1, Jaime Viana1,2, Fabiano Cordeiro Moreira1,3, Greice de Lemos Cardoso1, João Guerreiro1, Sidney Santos1,3, Ândrea Ribeiro-Dos-Santos1,3.
Abstract
Mutations in the HBB gene are responsible for several serious hemoglobinopathies, such as sickle cell anemia and β-thalassemia. Sickle cell anemia is one of the most common monogenic diseases worldwide. Due to its prevalence, diverse strategies have been developed for a better understanding of its molecular mechanisms. In silico analysis has been increasingly used to investigate the genotype-phenotype relationship of many diseases, and the sequences of healthy individuals deposited in the 1,000 Genomes database appear to be an excellent tool for such analysis. The objective of this study is to analyze the variations in the HBB gene in the 1,000 Genomes database, to describe the mutation frequencies in the different population groups, and to investigate the pattern of pathogenicity. The computational tool SNPEFF was used to align the data from 2,504 samples of the 1,000 Genomes database with the HG19 genome reference. The pathogenicity of each amino acid change was investigated using the databases CLINVAR, dbSNP and HbVar and five different predictors. Twenty different mutations were found in 209 healthy individuals. The African group had the highest number of individuals with mutations, and the European group had the lowest number. Thus, it is concluded that approximately 8.3% of phenotypically healthy individuals from the 1,000 Genomes database have some mutation in the HBB gene. The frequency of mutated genes was estimated at 0.042, so that the expected frequency of being homozygous or compound heterozygous for these variants in the next generation is approximately 0.002. In total, 193 subjects had a non-synonymous mutation, which 186 (7.4%) have a deleterious mutation. Considering that the 1,000 Genomes database is representative of the world's population, it can be estimated that fourteen out of every 10,000 individuals in the world will have a hemoglobinopathy in the next generation.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28379995 PMCID: PMC5381778 DOI: 10.1371/journal.pone.0174637
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Alignment of the 1000 Genomes and HG19 sequences of HBB using the SNPEFF tool; predictors and BD used for the investigation of pathogenic mutations.
Position and SNP ID of the mutations observed at the nucleotide level, the respective consequences at the amino acid level, the types of mutations, and the number of individuals.
| Position | SNP ID | Nucleotide change | AA alteration | Type of mutation | N° Individuals | Ref. |
|---|---|---|---|---|---|---|
| 5246840 | rs36020563 | G/A | His144His | Synonymous | 1 | [ |
| 5246870 | rs113082294 | C/G | Val134Val | Synonymous | 9 | [ |
| 5246883 | rs111645889 | G/A | Ala130Val | Missense | 1 | [ |
| 5246890 | rs33971634 | G/A | Gln128 | Stop gained | 1 | [ |
| 5246908 | rs33946267 | C/G | Glu122Gln | Missense | 3 | [ |
| 5246947 | rs33958637 | T/G | Asn109His | Missense | 1 | [ |
| 5246948 | rs193922562 | G/A | Gly108Gly | Synonymous | 1 | [ |
| 5247876 | rs145669504 | G/T | Leu82Leu | Synonymous | 5 | [ |
| 5247992–5247996 | rs281864900 | CAAAG/C | Phe42fs | Frameshift | 5 | [ |
| 5248004 | rs11549407 | G/A | Gln40 | Stop gained | 1 | [ |
| 5248029 | rs1135071 | C/A | Arg31Ser | Splice region and missense | 1 | [ |
| 5248030 | rs33943001 | C/G | Splice acceptor and intron variant | 1 | [ | |
| 5248159 | rs33971440 | C/T | Splice donor and intron variant | 1 | [ | |
| 5248162 | rs35578002 | G/T | Glu30Gly | Splice region and synonymous variant | 1 | [ |
| 5248173 | rs33950507 | C/T | Glu27Lys | Missense | 14 | [ |
| 5248200 | rs33986703 | T/A | Lys18 | Stop gained | 6 | [ |
| 5248205 | rs63750783 | C/T | Trp16 | Stop gained | 2 | [ |
| 5248232 | rs334 | T/A | Glu7Val | Missense | 137 | [ |
| 5248233 | rs33930165 | C/T | Glu7Lys | Missense | 17 | [ |
| 5248236 | rs33912272 | G/A | Pro6Ser | Missense | 1 | [ |
#—Intronic variant mutations
SNP ID, nucleotide and Amino Acid changes, number of individuals and population frequency of each mutation.
| SNP ID | Nucleotide change | Amino Acid change | Total individuals | N°/ Freq AFR | N°/ Freq AMR | N°/ Freq EAS | N°/Freq EUR | N°/ Freq SAS | Total Allele Frequency |
|---|---|---|---|---|---|---|---|---|---|
| rs36020563 | G/A | His144His | 1 | 1 (0.0008) | 0 | 0 | 0 | 0 | 0.00019 |
| rs113082294 | C/G | Val134Val | 9 | 0 | 2 (0.0029) | 0 | 7 (0.007) | 0 | 0.00179 |
| rs111645889 | G/A | Ala130Val | 1 | 1 (0.0008) | 0 | 0 | 0 | 0 | 0.00019 |
| rs33971634 | G/A | Gln128 | 1 | 0 | 1 (0.0014) | 0 | 0 | 0 | 0.00019 |
| rs33946267 | C/G | Glu122Gln | 3 | 0 | 0 | 0 | 0 | 3 (0.0031) | 0.00059 |
| rs33958637 | T/G | Asn109His | 1 | 0 | 0 | 1 (0.001) | 0 | 0 | 0.00019 |
| rs193922562 | G/A | Gly108Gly | 1 | 1 (0.0008) | 0 | 0 | 0 | 0 | 0.00019 |
| rs145669504 | G/T | Leu82Leu | 5 | 0 | 0 | 5 (0.005) | 0 | 0 | 0.00099 |
| rs281864900 | CAAAG/C | Phe42fs | 5 | 0 | 0 | 5 (0.005) | 0 | 0 | 0.00099 |
| rs11549407 | G/A | Gln40 | 1 | 0 | 1 (0.0014) | 0 | 0 | 0 | 0.00019 |
| rs1135071 | C/A | Arg31Ser | 1 | 0 | 0 | 0 | 1 (0.001) | 0 | 0.00019 |
| rs33943001 | C/G | # | 1 | 0 | 0 | 0 | 0 | 1 (0.001) | 0.00019 |
| rs33971440 | C/T | # | 1 | 0 | 1 (0.0014) | 0 | 0 | 0 | 0.00019 |
| rs35578002 | G/T | Glu30Gly | 1 | 1 (0.0008) | 0 | 0 | 0 | 0 | 0.00019 |
| rs33950507 | C/T | Glu27Lys | 14 | 0 | 0 | 8 (0.0079) | 0 | 6 (0.0061) | 0.00279 |
| rs33986703 | T/A | Lys18 | 6 | 0 | 0 | 6 (0.006) | 0 | 0 | 0.00119 |
| rs63750783 | C/T | Trp16 | 2 | 0 | 0 | 0 | 0 | 2 (0.002) | 0.00039 |
| rs334 | T/A | Glu7Val | 137 | 132 (0.0072) | 5 (0.0998) | 0 | 0 | 0 | 0.02735 |
| rs33930165 | C/T | Glu7Lys | 17 | 17 (0.0129) | 0 | 0 | 0 | 0 | 0.00339 |
| rs33912272 | G/A | Pro6Ser | 1 | 0 | 0 | 0 | 1 (0.001) | 0 | 0.00019 |
AFR: African.; AMR: American; EAS: Eastern Asian; EUR: European; SAS: Southern Asian.
SNP ID; nucleotide alteration; amino acid alteration; total number of individuals; list of the results from CLINVAR, dbSNP, HbVar, POLYPHEN, PROVEAN, SIFT, PANTHER, and MUTPRED; and final analysis of pathogenicity.
| SNP ID | Nucleotide change | Amino acid change | Total individuals | CLINVAR | dbSNP (NCBI) | HbVar | POLYPHEN | PROVEAN | SIFT | PANTHER | MUTPRED | Conclusion pathogenicity |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs11164588 | G/A | Ala130Val | 1 | Other | Other | Benign | Benign | Damaging | Damaging | Damaging | Damaging | Conflict |
| rs33971634 | G/A | Gln128 | 1 | Damaging | Other | Damaging | Damaging | Damaging | ||||
| rs33946267 | C/G | Glu122Gln | 3 | Damaging | Damaging | Benign | Benign | Benign | Damaging | Benign | Damaging | Conflict |
| rs33958637 | T/G | Asn109His | 1 | Other | Benign | Probably damaging | Damaging | Damaging | Benign | Damaging | Conflict | |
| rs281864900 | CAAAG/C | Phe42fs | 5 | Damaging | Damaging | Damaging | Damaging | Damaging | ||||
| rs11549407 | G/A | Gln40 | 1 | Damaging | Damaging | Damaging | Damaging | |||||
| rs1135071 | C/A | Arg31Ser | 1 | Damaging | Damaging | Benign | Probably damaging | Damaging | Damaging | Damaging | Damaging | Damaging |
| rs33943001 | C/G | 1 | Damaging | Damaging | Damaging | Damaging | ||||||
| rs33971440 | C/T | 1 | Damaging | Damaging | Damaging | Damaging | ||||||
| rs35578002 | G/T | Glu30Gly | 1 | Damaging | Benign | Benign | Benign | Benign | Benign | Conflict | ||
| rs33950507 | C/T | Glu27Lys | 14 | Damaging | Damaging | Damaging | Benign | Damaging | Damaging | Damaging | Damaging | Damaging |
| rs33986703 | T/A | Lys18 | 6 | Damaging | Damaging | Damaging | Damaging | Damaging | ||||
| rs63750783 | C/T | Trp16 | 2 | Damaging | Damaging | Damaging | Damaging | Damaging | ||||
| rs334 | T/A | Glu7Val | 137 | Damaging | Damaging | Damaging | Benign | Damaging | Damaging | Damaging | Damaging | |
| rs33930165 | C/T | Glu7Lys | 17 | Damaging | Damaging | Damaging | Benign | Damaging | Damaging | Damaging | Damaging | |
| rs33912272 | G/A | Pro6Ser | 1 | Other | Other | Benign | Benign | Benign | Benign | Damaging | Conflict |
* Could not be evaluated
# Intronic variant mutations