| Literature DB >> 15904492 |
Claire Bardel1, Vincent Danjean, Jean-Pierre Hugot, Pierre Darlu, Emmanuelle Génin.
Abstract
BACKGROUND: The cladistic approach proposed by Templeton has been presented as promising for the study of the genetic factors involved in common diseases. This approach allows the joint study of multiple markers within a gene by considering haplotypes and grouping them in nested clades. The idea is to search for clades with an excess of cases as compared to the whole sample and to identify the mutations defining these clades as potential candidate disease susceptibility sites. However, the performance of this approach for the study of the genetic factors involved in complex diseases has never been studied.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15904492 PMCID: PMC1173100 DOI: 10.1186/1471-2156-6-24
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Power to detect an association, one susceptibility site simulated and included in the analysis
| Tree | CT | CLADHC | HT | SbST |
| 1 | 96.4 | 88.9 | 91.1 | |
| 2 | 99.0 | 95.7 | 91.3 | |
| 3 | 86.0 | 91.1 | 91.2 | |
| 4 | 96.3 | 95.6 | 91.5 | |
| 5 | 88.8 | 95.3 | 90.4 | |
| 6 | 84.5 | 88.8 | 90.7 | |
| 7 | 97.4 | 96.3 | 91.9 | |
| 8 | 98.5 | 91.2 | 99.4 | |
| 9 | 88.1 | 94.0 | 91.3 | |
| 10 | 92.5 | 93.0 | 91.3 | |
| Average | 92.9 | 93.7 | 91.2 | |
| Std dev.a | 5.63 | 3.24 | 0.41 | 0.13 |
Sample size: 100 cases and 100 controls, 1000 replicates, penetrance vector: [0.03;0.06;0.3], frequency of the susceptibility allele: 0.205 ± 0.005. The corresponding type I errors are indicated in italics within brackets. For each tree, the method giving the best result is underlined. CT: Cladistic Test, CLADHC: test developed by Durrant et al. [29], HT: Haplotypic Test, SbST: Site by Site Test.
a Standard deviation
Power to detect an association, one susceptibility site simulated and removed from the analysis
| Tree | CT | CLADHC | HT | SbST |
| 1 | 89.1 | 91.2 | 89.5 | |
| 2 | 94.0 | 92.7 | 87.6 | |
| 3 | 79.3 | 88.9 | 86.2 | |
| 4 | 88.7 | 85.7 | 87.8 | |
| 5 | 85.6 | 90.5 | 94.0 | |
| 6 | 78.6 | 81.6 | 67.0 | |
| 7 | 92.4 | 90.5 | 87.8 | |
| 8 | 98.6 | 91.2 | 99.6 | |
| 9 | 89.2 | 91.2 | 89.1 | |
| 10 | 82.8 | 81.3 | 79.1 | |
| Average | 89,38 | 89.35 | 88,28 | |
| Std dev.a | 6.71 | 5.2 | 3.06 | 9.81 |
Sample size: 100 cases and 100 controls, 1000 replicates, penetrance vector: [0.03;0.06;0.3], frequency of the susceptibility allele: 0.205 ± 0.005. The corresponding type I errors are indicated in italics within brackets. For each tree, the method giving the best result is underlined. CT: Cladistic Test, CLADHC: test developed by Durrant et al. [29], HT: Haplotypic Test, SbST: Site by Site Test.
a Standard deviation
Power to detect an association for two susceptibility sites simulated under model 2
| Model 2 | ||||
| Tree | CT | CLADHC | HT | SbST |
| 1 | 97.8 | 97.7 | 99.1 | |
| 2 | 95.1 | 97.7 | 95.8 | |
| 3 | 94.2 | 97.9 | 98.6 | |
| 4 | 97.6 | 97.8 | 97.8 | |
| 5 | 98.5 | 98.9 | 98.2 | |
| 6 | 97.9 | 98.5 | 96.9 | |
| 7 | 93.0 | 93.5 | 95.5 | |
| 8 | 95.2 | 98.2 | 95.4 | |
| 9 | 96.7 | 97.3 | 97.6 | |
| 10 | 99.5 | 98.2 | 99.8 | |
| Average | 96.7 | 97.96 | 97.56 | |
| Std dev.a | 2.24 | 1.83 | 0.54 | 1.58 |
Sample size: 200 cases and 200 controls, 1000 replicates, penetrance: 0.9 for the homozygotes carrying the two susceptibility alleles, 0.3 for the other genotypes. The corresponding type I errors are indicated in italics within brackets. For each tree, the method giving the best result is underlined. CT: Cladistic Test, CLADHC: test developed by Durrant et al. [29], HT: Haplotypic Test, SbST: Site by Site Test
a Standard deviation
Power to detect an association for two susceptibility sites simulated under model 3
| Model 3 | ||||
| Tree | CT | CLADHC | HT | SbST |
| 1 | 40.7 | 42.0 | 47.5 | |
| 2 | 31.7 | 32.7 | 43.0 | |
| 3 | 26.8 | 40.7 | 45.2 | |
| 4 | 34.8 | 39.2 | 38.9 | |
| 5 | 41.5 | 44.2 | 37.2 | |
| 6 | 42.0 | 34.8 | 41.3 | |
| 7 | 28.6 | 32.1 | 39.9 | |
| 8 | 33.2 | 42.9 | 37.7 | |
| 9 | 33.5 | 39.1 | 41.2 | |
| 10 | 48.1 | 42.1 | 56.9 | |
| Average | 37.84 | 42.48 | 41.9 | |
| Std dev.a | 11.07 | 7.84 | 1.45 | 5.94 |
Sample size: 200 cases and 200 controls, 1000 replicates, penetrance: 0.6 for the homozygotes carrying the two susceptibility alleles, 0.3 for the other genotypes. The corresponding type I errors are indicated in italics within brackets. For each tree, the method giving the best result is underlined. CT: Cladistic Test, CLADHC: test developed by Durrant et al. [29], HT: Haplotypic Test, SbST: Site by Site Test
a Standard deviation
Figure 1Comparison of the efficiency to localize the susceptibility site between CT, and SbST or CLADHC. The efficiency to localize the susceptibility site is measured by the percentage of replicates in which the simulated susceptibility site is either uniquely detected or detected among other putative sites. black circles: comparison of CT and SbST, crosses: comparison of CT and CLADHC. Each symbol corresponds to one of the ten tree topologies.
Figure 2Comparison of the efficiency to localize the two susceptibility sites between CT and SbST. The efficiency to localize the two susceptibility sites is measured by the percentage of replicates in which the two sites are detected (black circles) or at least one of the two sites (white circles) Each symbol corresponds to one of the ten tree topologies. (A): penetrance 0.9 for homozygotes carrying the two susceptibility alleles and 0.3 for all other genotypes; (B): penetrance 0.6 for homozygotes carrying the two susceptibility alleles and 0.3 for all other genotypes.
Cladistic analysis of Crohn's data 1000 equiparsimonious trees are studied. SNPs are ranked from the least to the most significant. In bold, the presumed susceptibility sites previously identified by Hugot et al. [30].
| least significant | --------> | most significant | ||||||
| mfHa | SNP 5 | SNP 9 | SNP 6 | SNP 7 | ||||
| consHb | SNP 9 | SNP 5 | SNP 6 | SNP 7 | ||||
a Rooting on the most frequent haplotype
b Rooting on a consensus sequence
Figure 3Description of the nested clade analysis. (A) shows the homogeneity test performed at level k (between clades C1 and C2). If it is not significant (B), a test will be performed at the following level (k+1), between all the sub-clades descending from clades C1 and C2, i.e between clades C1.1, C1.2, C2.1 and C2.2 (3 degree of freedom). If it is significant the analysis ends because an association is detected.