| Literature DB >> 19812787 |
Ao Yuan1, Guanjie Chen, Zhong-Cheng Zhou, George Bonney, Charles Rotimi.
Abstract
Gene copy number changes are common characteristics of many genetic disorders. A new technology, array comparative genomic hybridization (a-CGH), is widely used today to screen for gains and losses in cancers and other genetic diseases with high resolution at the genome level or for specific chromosomal region. Statistical methods for analyzing such a-CGH data have been developed. However, most of the existing methods are for unrelated individual data and the results from them provide explanation for horizontal variations in copy number changes. It is potentially meaningful to develop a statistical method that will allow for the analysis of family data to investigate the vertical kinship effects as well. Here we consider a semiparametric model based on clustering method in which the marginal distributions are estimated nonparametrically, and the familial dependence structure is modeled by copula. The model is illustrated and evaluated using simulated data. Our results show that the proposed method is more robust than the commonly used multivariate normal model. Finally, we demonstrated the utility of our method using a real dataset.Entities:
Keywords: cluster; copula; family data; gene copy number; semiparametric model
Year: 2008 PMID: 19812787 PMCID: PMC2735963 DOI: 10.4137/bbi.s839
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Kinship coeffcient for selected relative pairs.
| Relationship | Δ7 | Δ8 | Δ9 | γ | τ |
|---|---|---|---|---|---|
| Grand parent-offspring | 0 | 1/2 | 1/2 | 1/8 | 1/4 |
| Parent-Offspring | 0 | 1 | 0 | 1/4 | 1/2 |
| Half Siblings | 0 | 1/2 | 1/2 | 1/8 | 1/4 |
| Full Siblings | 1/4 | 1/2 | 1/4 | 1/4 | 1/2 |
| First Cousins | 0 | 1/4 | 3/4 | 1/16 | 1/8 |
| Double First Cousins | 1/16 | 6/16 | 9/16 | 1/8 | 1/4 |
| Second Cousins | 0 | 1/16 | 15/16 | 1/64 | 1/32 |
| Uncle-Nephew | 0 | 1/2 | 1/2 | 1/8 | 1/4 |
Cluster results for normal copula and multi-normal models for 10 pedigrees and 200 loci.
| Data | λ | Cluster 1 | Cluster 2 | Cluster 3 | Log-likelihood |
|---|---|---|---|---|---|
| Normal Copula | 0.25 | 1–80 | 81–150 | 151–200 | −11438825.76 |
| 0.50 | 1–80 | 81–150 | 151–200 | −22088970.51 | |
| 0.75 | 1–80 | 81–150 | 151–200 | −32749199.84 | |
| 1.00 | 1–80 | 81–150 | 151–200 | −43412062.82 | |
| Multi-normal Model | 0.50 | 1–80 | 81–150(+38) | 151–200(−38) | −3117275.81 |
| 0.75 | 1–80 | 81–150 | 151–200 | −3644388.69 | |
| 1.00 | 1–80 | 81–150 | 151–200 | −4652713.94 |
Summary cluster results from normal copula and multi-normal data sets for 100 pedigrees with 4 or 5 Family Members.
| Pedigree size | Data | λ | Cluster 1 | Cluster 2 | Cluster 3 | Log-likelihood |
|---|---|---|---|---|---|---|
| 4 | Normal Copula | 0.25 | 1–80 | 81–150 | 151–200 | −5503651.81 |
| 0.50 | 1–80 | 81–150 | 151–200 | −10207757.93 | ||
| 0.75 | 1–80 | 81–150 | 151–200 | −14924762.04 | ||
| 1.00 | 1–80 | 81–150 | 151–200 | −19645406.52 | ||
| 4 | Multi-normal Model | 0.50 | 1–80 | 81–150(+9) | 151–200(−9) | −1716690.87 |
| 0.75 | 1–80 | 81–150 | 151–200 | −2213287.79 | ||
| 1.00 | 1–80 | 81–150 | 151–200 | −2726810.52 | ||
| 5 | Normal Copula | 0.25 | 1–80(−7) | 81–150(+7) | 151–200 | −7841636.12 |
| 0.50 | 1–80(−7) | 81–150(+7) | 151–200 | −15364258.65 | ||
| 0.75 | 1–80(−7) | 81–150(+7) | 151–200 | −22940970.23 | ||
| 1.00 | 1–80(−6) | 81–150(+6) | 151–200 | −28643031.41 | ||
| 5 | Multi-normal Model | 0.25 | 1–80 | 81–150 | 151–200 | −1639830.86 |
| 0.50 | 1–80 | 81–150 | 151–200 | −2250676.24 | ||
| 0.75 | 1–80 | 81–150 | 151–200 | −2874104.71 | ||
| 1.00 | 1–80 | 81–150 | 151–200 | −3503763.43 |