| Literature DB >> 23650587 |
Yoshihiro Kawamura1, Shigeru Takasaki, Masashi Mizokami.
Abstract
The recommended treatment for patients with chronic hepatitis C, pegylated interferon α (PEG-IFN-α) plus rebavirin (RBV), does not provide a sustained virologic response in all patients, especially those with hepatitis C virus (HCV) genotype 1. It is therefore important to predict whether or not a new patient with HCV genotype 1 will be cured by the recommended treatment. We propose a prediction method for a new patient using a decision tree learning model based on SNPs evaluated in a genome-wide association study. By the decision tree learning for 142 Japanese patients with HCV genotype 1 (78 with null virologic response and 64 with virologic response), we can predict with high probability (93%) whether or not a new patient with HCV will be helped by the recommended treatment.Entities:
Keywords: Chronic hepatitis C genotype 1; Decision tree learning; GDI, Gini diversity index; GWAS, genome-wide association study; Genome-wide association study; HCV, hepatitis C virus; Het, one major and one minor genotype; MM, both major genotypes; NVR, null virologic response; Null virologic response; OR, Odds ratio; PEG-IFN-α, pegylated interferon α; Pegylated interferon α; RBV, ribavirin; Rebavirin; SNPs, single nucleotide polymorphisms; SVR, sustained virologic response; Single nucleotide polymorphism; Sustained virologic response; mm, both minor genotypes
Year: 2012 PMID: 23650587 PMCID: PMC3645974 DOI: 10.1016/j.fob.2012.04.007
Source DB: PubMed Journal: FEBS Open Bio ISSN: 2211-5463 Impact factor: 2.693
The three types of branches in the decision tree.
| Type | Combination 1 | Combination 2 |
|---|---|---|
| 1 | MM | Het + mm |
| 2 | Het | MM + mm |
| 3 | mm | MM + Het |
MM: both nucleotides are major genotypes (e.g., CC, C: major genotype).
Het: one nucleotide is a major genotype and the other is a minor genotype (e.g., TC, T: minor genotype).
mm: both nucleotides are minor genotypes (e.g., TT).
Fig. 1Decision tree diagram for the treatment of Japanese HCV patients (78 NVR and 64 VR samples). The box at the top of a branch node indicates the SNP ID (e.g., rs8099917). The bottom of the branch node shows the branch condition: left for “yes” and right for “no”. In the case of rs8099917, for example, “mm + Het” is “yes” and “MM” is “no”. B1, B2, and B3 are branch nodes and B1 is the root node. The numbers in the green parts of the nodes are the numbers of NVR samples, and those in the blue parts are the numbers of VR samples. The percentage next to each node shows the percentage of the greater number of samples. In B1, for example, 78 is greater than 64 and 78/(78 + 64) is 54.9%. The two circles in B2 and B3 indicate two ratios; the external one is the ratio of the upper branch node and the internal one is the ratio of the present branch node. In B2, for example, the external ratio shows that of B1 and the internal ratio indicates that of node B2, i.e., 75.3% (58/(19 + 58)). L1, L2, L3, and L4 are leaf nodes.
Attribute SNPs in the three decision trees.
| Model | SNP used | Probability predicted (%) | ||||
|---|---|---|---|---|---|---|
| SNP | OR | Chromosome | Allele ratio (%) | |||
| 1 | rs8099917 | 3.11 × 10−15 | 30 | 19 | 23.9 | 90.1 |
| rs4906195 | 4.52 × 10−4 | 3.9 | 14 | 18 | ||
| rs3816768 | 2.85 × 10−4 | 4 | 15 | 16.4 | ||
| 2 | rs6586361 | 7.81 × 10−5 | 5 | 1 | 17.3 | 92.3 |
| rs6793110 | 2.82 × 10−4 | 10.7 | 3 | 5.7 | ||
| rs12713624 | 1.56 × 10−4 | 3.8 | 2 | 33.1 | ||
| rs10079121 | 1.11 × 10−3 | 3.1 | 5 | 25.7 | ||
| rs1455474 | 2.96 × 10−4 | 4.6 | 8 | 47.9 | ||
| rs315023 | 2.67 × 10−3 | 2.8 | 1 | 29.6 | ||
| rs194507 | 3.19 × 10−4 | 3.5 | 7 | 34.4 | ||
| 3 | rs4876271 | 2.3 × 10−5 | 4.5 | 8 | 27.8 | 92.9 |
| rs11163027 | 1.51 × 10−4 | 4.7 | 1 | 12.7 | ||
| rs1995122 | 1.98 × 10−4 | 4.1 | 6 | 14.8 | ||
| rs7655238 | 3.21 × 10−3 | 3.4 | 4 | 46.1 | ||
| rs12180485 | 5.16 × 10−3 | 2.8 | 6 | 17.6 | ||
| rs1353348 | 1.36 × 10−4 | 3.9 | 15 | 35.1 | ||
| rs1903998 | 2.58 × 10−3 | 3.1 | 10 | 40.5 | ||
P-value: χ2 test for allele frequencies, OR: Odds ratio.