| Literature DB >> 26223264 |
Mingming Liu1, Layne T Watson2,3,4, Liqing Zhang5.
Abstract
BACKGROUND: Many genetic variants have been identified in the human genome. The functional effects of a single variant have been intensively studied. However, the joint effects of multiple variants in the same genes have been largely ignored due to their complexity or lack of data. This paper uses HMMvar, a hidden Markov model based approach, to investigate the combined effect of multiple variants from the 1000 Genomes Project. Two tumor suppressor genes, TP53 and phosphatase and tensin homolog (PTEN), are also studied for the joint effect of compensatory indel variants.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26223264 PMCID: PMC4520001 DOI: 10.1186/s40246-015-0040-4
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Fig. 1An example of variant classification in terms of genotypes. The colored sticks on the gene represent variants at different locations. Colors represent different classes of variants. The format v1:T;A|A means variant v1’s ancestral allele is T and the genotype is A|A, the same as other variants. The boxes on the transcripts represent exon regions. The gene and the transcripts share the same coordinate system
Variants sets related to gene ABCB5
| Individual ID | Transcript ID | Set ID |
|---|---|---|
| NA20805 | NM_001163941.1 | 7619 |
| NA20805 | NM_001163942.1 | 3062 |
| NA20805 | NM_001163993.2 | 3062 |
| NA20805 | NM_178559.5 | 7619 |
| NA20806 | NM_178559.5 | 2807 |
| NA20806 | NM_001163993.2 | 3062 |
| NA20806 | NM_001163942.1 | 3062 |
| NA20806 | NM_001163941.1 | 2807 |
| ... | ... | ... |
Data description
| Type | Database | |
|---|---|---|
| IARC (TP53) | COSMIC (PTEN) | |
| Insertion (in-frame) | 90 | 7 |
| Insertion (frameshift) | 419 | 116 |
| Deletion (in-frame) | 364 | 43 |
| Deletion (frameshift) | 1016 | 283 |
| Complex (in-frame) | 94 | 8 |
| Complex (frameshift) | 53 | 19 |
| Total | 2036 | 476 |
Fig. 2Comparison between variant set score (black) and single variant score (red)
Fig. 3Allele frequency distribution of SNP variants in CM sets and nonCM sets
Scoring multiple mutations in β MHC and MyBP-C genes
| Gene | Mutation1 | Score1 | Mutation2 | Score2 | Set score |
|---|---|---|---|---|---|
|
| Val39Met | 1.7 | Arg723Cys | 3.4 | 5.0 |
|
| Pro211Leu | 2.4 | Arg663HIs | 2.2 | 4.5 |
|
| Met349Thr | 2.2 | Arg719Trp | 3.1 | 5.3 |
|
| Arg663His | 2.2 | Val763Met | 2.3 | 4.4 |
|
| Arg719Gln | 1.7 | Thr1513Ser | 0.0 | 1.6 |
|
| Asp906Gly | 2.7 | Leu908Val | 2.0 | 4.6 |
| MyBP-C | Gly5Arg | 1.7 | Arg502Trp | 4.9 | 5.7 |
| MyBP-C | Arg502Trp | 3.9 | Ser858Asn | 2.4 | 6.4 |
| MyBP-C | Alu542Gln | 2.2 | Ala851Val | 2.2 | 4.4 |
| MyBP-C | Asp745Aly | 3.9 | Pro873His | 4.0 | 7.9 |
| MyBP-C | Arg810His | 2.9 | Arg820Gln | 2.6 | 5.5 |
Fig. 4Scatter plot of HMMvar score of a single variant versus the median HMMvar score of the corresponding compensatory indel sets for the TP53 gene and the PTEN gene. The red line is y=x. a TP53 compensatory indels. b PTEN compensatory indels. The red solid circle marks the COSMIC variant with ID 428080
Fig. 5Zygosity of disease-causing single (a–b) or multiple mutations (c–e). The cross and circle indicate different mutations; the different thicknesses indicate different alleles of a mutation, and the thicker one indicates a disease-causing allele; the boxes represent a gene