| Literature DB >> 23773794 |
Sevtap Savas1, Geoffrey Liu, Wei Xu.
Abstract
Analysis of genetic polymorphisms may help identify putative prognostic markers and determine the biological basis of variable prognosis in patients. However, in contrast to other variables commonly used in the prognostic studies, there are special considerations when studying genetic polymorphisms. For example, variable inheritance patterns (recessive, dominant, codominant, and additive genetic models) need to be explored to identify the specific genotypes associated with the outcome. In addition, several characteristics of genetic polymorphisms, such as their minor allele frequency and linkage disequilibrium among multiple polymorphisms, and the population substructure of the cohort investigated need to be accounted for in the analyses. In addition, in cancer research due to the genomic differences between the tumor and non-tumor DNA, differences in the genetic information obtained using these tissues need to be carefully assessed in prognostic studies. In this article, we review these and other considerations specific to genetic polymorphism by focusing on genetic prognostic studies in cancer.Entities:
Mesh:
Year: 2013 PMID: 23773794 PMCID: PMC3729672 DOI: 10.1186/1741-7015-11-149
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
A summary of special considerations in genetic prognostic studies
| Correlation among genetic polymorphisms | (i) Utilization of the linkage disequilibrium (LD) information and investigating the tagging single nucleotide polymorphisms (tagSNPs) instead can prevent this issue [ | (i) reduces the redundancy among variables and simplify the analysis while also reducing the genotyping cost and efforts [ |
| (ii) Once an association is found with a genetic polymorphism, this genomic region (usually within the same LD block) may be investigated in detail to identify the nearby ‘true’ prognostic factor that modifies the prognosis in patients | (ii) may identify the prognostic factor biologically linked to variable prognosis in patients | |
| Genetic polymorphisms as confounders | Some of the genetic polymorphisms confounding the relationship between the prognostic factor and the outcome are likely to be in close vicinity and can be identified by investigating the genomic region in detail | Genetic confounders can be identified |
| Hardy-Weinberg equilibrium (HWE) testing in case-only cohorts | Whether appropriate or not remains to be established | |
| Estimating the correct genetic model | Visual inspection of Kaplan-Meier curves for the codominant genetic model may reveal the best suitable genetic model for investigation of each polymorphism in multivariable models | Provides a logical and comprehensive solution while also reduces the number of tests to be performed |
| Minor allele frequency (MAF) of genetic polymorphisms | Excluding the rare polymorphisms (for example, MAF <5%) from the analysis is a common practice | Prevents unstable model construction and by reducing the multiple testing burden and increasing the events/variables ratio also improves the study power |
| Population stratification due to variable frequencies of genetic polymorphisms in different ethnicities | Detecting and controlling for the population substructure in the cohort eliminates this problem (for example, outlier samples may be eliminated from the analysis or ethnicity can be used as a covariate in the analysis) | Prevents biased estimations and increases the study power |
| Multiple testing issue due to the investigation of large numbers of polymorphisms | Correction for multiple testing using a variety of methods such as Bonferroni or false discovery rate (FDR) methods [ | Reduces the false-positive rate (however, ironically may also increase the false-negative rate) |
| Use of genomic material extracted from archived specimen | Use of new technologies with high rates of successful genotyping [ | Reduces bias and increases study power by allowing the construction of models with a higher number of patients |
| Use of tumor versus non-tumor DNA in the same study | Using one type (either tumor or non-tumor) depending on the objectives of the study in the cohort or checking the correlation of genotype data obtained from both tumor and non-tumor DNA samples in a set of patients to see whether they are comparable with each other (for example, the tumor DNA may not be a good surrogate for non-tumor DNA all the time) | Prevents bias in study results created by alterations in tumor tissue DNA (that is, different genotypes in tumor DNA compared to non-tumor DNA) |
The main characteristics of genetic polymorphisms that require additional considerations in genetic prognostic research are summarized. The majority of the solutions are already applied in susceptibility studies, which can be or have been extended to the prognostic studies.
Figure 1A partial linkage disequilibrium (LD) map of the human (calcium-sensing receptor) gene. Rs numbers for polymorphisms in this gene are shown at the top. The triangle points to the predicted LD block. The rectangles indicate the correlation coefficient (r2) values between the different polymorphisms; the darker the color, the higher the r2 values. This figure was obtained using Haploview [22] with the genotype data for Caucasian samples posted at the HapMap database [20,21].
Figure 2Kaplan-Meier curves may identify the best fitting genetic model for a polymorphism. For simplicity, survival curves are shown as straight lines. AA = major allele homozygous genotype, AB = heterozygous genotype, BB = minor allele homozygous genotype, assuming allele ‘A’ is the common allele. (a) The effect of the AB genotype on survival is approximately half between the AA and BB genotypes, thus the additive model is appropriate for this polymorphism in the multivariate analysis. (b) The curves of AB and BB genotypes cluster closer to each other when compared to the AA genotype’s curve, thus, the effect of the polymorphism is likely to be dominant. (c) AA and AB genotype survival curves cluster together and clearly separate from the BB genotype curve. Thus, the inheritance pattern is likely to be recessive. (d) In this case, the effect of AB genotype is somewhat in between the effects of AA and BB genotypes, thus, analyzing this polymorphism assuming the codominant model is suitable. (e) This is an interesting polymorphism where the heterozygotes are associated with worse survival compared to either homozygous genotypes (AA and BB). The codominant genetic model is the appropriate model to investigate such polymorphisms in multivariate analyses. Exact biological and genetic reasons for such associations are not clear, but it may be due to heterozygote disadvantage where the heterozygotes display phenotype but not the either homozygotes. (f) The heterozygotes have better survival than AA and BB homozygotes. This case may represent a ‘heterozygote advantage’ situation, where the heterozygotes have favorable survival characteristics. Similar examples are observed in Mendelian diseases, such as sickle cell anemia [56]. In both (e) and (f), presence of another genetic variation in close proximity acting as a prognostic factor (which is not highly correlated with this polymorphism) may be an alternative explanation.