| Literature DB >> 20701809 |
John M Henshall1, Vicki A Whan, Belinda J Norris.
Abstract
BACKGROUND: Repeated blocks of genome sequence have been shown to be associated with genetic diversity and disease risk in humans, and with phenotypic diversity in model organisms and domestic animals. Reliable tests are desirable to determine whether individuals are carriers of copy number variants associated with disease risk in humans and livestock, or associated with economically important traits in livestock. In some cases, copy number variants affect the phenotype through a dosage effect but in other cases, allele combinations have non-additive effects. In the latter cases, it has been difficult to develop tests because assays typically return an estimate of the sum of the copy number counts on the maternally and paternally inherited chromosome segments, and this sum does not uniquely determine the allele configuration. In this study, we show that there is an old solution to this new problem: segregation analysis, which has been used for many years to infer alleles in pedigreed populations.Entities:
Mesh:
Year: 2010 PMID: 20701809 PMCID: PMC2928190 DOI: 10.1186/1297-9686-42-34
Source DB: PubMed Journal: Genet Sel Evol ISSN: 0999-193X Impact factor: 4.297
Expected ratios and standard deviations of the assay
| Standard Deviation of Assay | ||||
|---|---|---|---|---|
| Copies | Expected Ratio | CV = 3% | CV = 6% | CV = 9% |
| 2 | 0.000 | NA | NA | NA |
| 3 | 0.333 | 0.010 | 0.020 | 0.030 |
| 4 | 0.500 | 0.015 | 0.030 | 0.045 |
| 5 | 0.600 | 0.018 | 0.036 | 0.054 |
| 6 | 0.667 | 0.020 | 0.040 | 0.060 |
| 7 | 0.714 | 0.021 | 0.043 | 0.064 |
| 8 | 0.750 | 0.023 | 0.045 | 0.068 |
| 9 | 0.778 | 0.023 | 0.047 | 0.070 |
| 10 | 0.800 | 0.024 | 0.048 | 0.072 |
| 11 | 0.818 | 0.025 | 0.049 | 0.074 |
| 12 | 0.833 | 0.025 | 0.050 | 0.075 |
The assay is not required for sheep with two copies as they are phenotypically black and easily identified
Estimated copy number frequencies (A) and allele frequencies (B) derived from assay results
| A. Estimated Copy Number Frequencies | B. Estimated Allele Frequencies | ||
|---|---|---|---|
| 2 | 0.00 | 0.10 | |
| 3 | 0.08 | 0.43 | |
| 4 | 0.29 | 0.41 | |
| 5 | 0.34 | 0.04 | |
| 6 | 0.23 | 0.00 | |
| 7 | 0.04 | 0.02 | |
| 8 | 0.02 | 0.00 | |
Data are from 87 white Merino ewes, and a CV of 3.0% for the assay scores was assumed
Copy number probabilities and penetrance values for an assay value of 0.675
| Copies | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|
| Probability | 0.000 | 0.000 | 0.000 | 0.838 | 0.159 | 0.003 | 0.000 |
| Copy number allele from sire | 0.000 | 0.000 | 0.000 | 0.000 | 0.838 | 0.159 | |
| 0.000 | 0.000 | 0.000 | 0.838 | 0.159 | 0.003 | ||
| 0.000 | 0.000 | 0.838 | 0.159 | 0.003 | 0.000 | ||
| 0.000 | 0.838 | 0.159 | 0.003 | 0.000 | 0.000 | ||
| 0.838 | 0.159 | 0.003 | 0.000 | 0.000 | 0.000 | ||
| 0.159 | 0.003 | 0.000 | 0.000 | 0.000 | 0.000 | ||
The CV of the assay was assumed to be 3%; the penetrance values are proportional to the probability of an assay value of 0.675 given the genotype
Frequencies of alleles in the final generation of the simulated populations
| Frequency Allele | 0.0:0.1 | 0.1:0.2 | 0.2:0.3 | 0.3:0.4 | 0.4:0.5 | 0.5:0.6 | 0.6:0.7 | 0.7:0.8 | 0.8:0.9 |
|---|---|---|---|---|---|---|---|---|---|
| Ac1 | 0.79 | 0.05 | 0.08 | 0.08 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 |
| Ac2 | 0.07 | 0.14 | 0.14 | 0.11 | 0.18 | 0.16 | 0.07 | 0.08 | 0.04 |
| Ac3 | 0.04 | 0.13 | 0.14 | 0.10 | 0.19 | 0.16 | 0.07 | 0.11 | 0.06 |
| Ac4 | 0.89 | 0.04 | 0.03 | 0.03 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| Ac5 | 1.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| Ac6 | 0.96 | 0.01 | 0.02 | 0.01 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
Frequencies of estimated probabilities of being a carrier
| Status | P < 0.001 | P < 0.01 | P < 0.05 | P > 0.95 | P > 0.99 | P > 0.999 |
|---|---|---|---|---|---|---|
| carrier | 0.00 | 0.00 | 0.07 | 0.48 | 0.48 | 0.48 |
| non-carrier | 0.07 | 0.25 | 0.71 | 0.00 | 0.00 | 0.00 |
For a lamb assayed and analysed without including pedigree data, given actual status (carrier or non-carrier); the assay CV simulated and CV used in the analysis were both 3%
Frequencies of estimated probabilities of being a carrier
| Status | Half-sibs | P < 0.001 | P < 0.01 | P < 0.05 | P > 0.95 | P > 0.99 | P > 0.999 |
|---|---|---|---|---|---|---|---|
| carrier | 1 | 0.00 | 0.01 | 0.04 | 0.48 | 0.48 | 0.48 |
| carrier | 2 | 0.00 | 0.01 | 0.04 | 0.48 | 0.48 | 0.48 |
| carrier | 4 | 0.00 | 0.01 | 0.04 | 0.50 | 0.50 | 0.50 |
| carrier | 8 | 0.00 | 0.00 | 0.02 | 0.58 | 0.54 | 0.53 |
| carrier | 16 | 0.00 | 0.00 | 0.01 | 0.66 | 0.62 | 0.57 |
| carrier | 32 | 0.00 | 0.00 | 0.01 | 0.72 | 0.70 | 0.66 |
| non-carrier | 1 | 0.20 | 0.46 | 0.69 | 0.00 | 0.00 | 0.00 |
| non-carrier | 2 | 0.28 | 0.47 | 0.69 | 0.00 | 0.00 | 0.00 |
| non-carrier | 4 | 0.35 | 0.57 | 0.73 | 0.00 | 0.00 | 0.00 |
| non-carrier | 8 | 0.46 | 0.68 | 0.79 | 0.00 | 0.00 | 0.00 |
| non-carrier | 16 | 0.63 | 0.75 | 0.81 | 0.00 | 0.00 | 0.00 |
| non-carrier | 32 | 0.72 | 0.78 | 0.82 | 0.00 | 0.00 | 0.00 |
For progeny, given actual status (carrier or non-carrier) and number of half-sibs in family; the assay CV simulated and CV used in the analysis were both 3% and the progeny and sire in each family were assayed
Frequencies of estimated probabilities of being a carrier
| Status | Pedigree | Assay | P < 0.001 | P < 0.01 | P < 0.05 | P > 0.95 | P > 0.99 | P > 0.999 |
|---|---|---|---|---|---|---|---|---|
| carrier | sire | p | 0.00 | 0.01 | 0.03 | 0.71 | 0.62 | 0.58 |
| carrier | sire | ps | 0.00 | 0.00 | 0.01 | 0.88 | 0.85 | 0.80 |
| carrier | sire | psd | 0.00 | 0.00 | 0.00 | 0.98 | 0.98 | 0.96 |
| carrier | progeny | p | 0.00 | 0.01 | 0.03 | 0.56 | 0.52 | 0.50 |
| carrier | progeny | ps | 0.00 | 0.00 | 0.01 | 0.66 | 0.62 | 0.57 |
| carrier | progeny | psd | 0.00 | 0.00 | 0.02 | 0.73 | 0.71 | 0.68 |
| carrier | dam | p | 0.00 | 0.00 | 0.00 | 0.15 | 0.09 | 0.06 |
| carrier | dam | ps | 0.00 | 0.00 | 0.00 | 0.26 | 0.22 | 0.15 |
| carrier | dam | psd | 0.00 | 0.00 | 0.03 | 0.78 | 0.74 | 0.69 |
| non-carrier | sire | p | 0.37 | 0.67 | 0.84 | 0.00 | 0.00 | 0.00 |
| non-carrier | sire | ps | 0.84 | 0.93 | 0.96 | 0.00 | 0.00 | 0.00 |
| non-carrier | sire | psd | 0.98 | 0.99 | 0.99 | 0.00 | 0.00 | 0.00 |
| non-carrier | progeny | p | 0.40 | 0.61 | 0.77 | 0.00 | 0.00 | 0.00 |
| non-carrier | progeny | ps | 0.63 | 0.75 | 0.81 | 0.00 | 0.00 | 0.00 |
| non-carrier | progeny | psd | 0.80 | 0.83 | 0.90 | 0.00 | 0.00 | 0.00 |
| non-carrier | dam | p | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| non-carrier | dam | ps | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
| non-carrier | dam | psd | 0.58 | 0.69 | 0.85 | 0.00 | 0.00 | 0.00 |
Given actual status (carrier or non-carrier), position in pedigree (sire, dam or progeny), and assay strategy (p = progeny only, ps = progeny and sires, psd = progeny, sires and dams); the assay CV simulated and CV used in the analysis were both 3% and 16 half-sibs in each family were assayed
Frequencies of estimated probabilities of being a carrier
| Status | CVsim | CVan | P < 0.001 | P < 0.01 | P < 0.05 | P > 0.95 | P > 0.99 | P > 0.999 |
|---|---|---|---|---|---|---|---|---|
| carrier | 0.030 | 0.015 | 0.00 | 0.01 | 0.02 | 0.68 | 0.65 | 0.61 |
| carrier | 0.030 | 0.030 | 0.00 | 0.00 | 0.01 | 0.66 | 0.62 | 0.57 |
| carrier | 0.060 | 0.030 | 0.01 | 0.02 | 0.04 | 0.61 | 0.58 | 0.55 |
| carrier | 0.060 | 0.060 | 0.00 | 0.01 | 0.03 | 0.55 | 0.51 | 0.48 |
| carrier | 0.090 | 0.045 | 0.01 | 0.02 | 0.06 | 0.56 | 0.53 | 0.51 |
| carrier | 0.090 | 0.090 | 0.00 | 0.01 | 0.04 | 0.46 | 0.38 | 0.25 |
| non-carrier | 0.030 | 0.015 | 0.68 | 0.76 | 0.80 | 0.00 | 0.00 | 0.00 |
| non-carrier | 0.030 | 0.030 | 0.63 | 0.75 | 0.81 | 0.00 | 0.00 | 0.00 |
| non-carrier | 0.060 | 0.030 | 0.48 | 0.64 | 0.74 | 0.00 | 0.00 | 0.00 |
| non-carrier | 0.060 | 0.060 | 0.33 | 0.59 | 0.76 | 0.00 | 0.00 | 0.00 |
| non-carrier | 0.090 | 0.045 | 0.32 | 0.52 | 0.69 | 0.01 | 0.00 | 0.00 |
| non-carrier | 0.090 | 0.090 | 0.14 | 0.44 | 0.69 | 0.00 | 0.00 | 0.00 |
For progeny, given actual status (carrier or non-carrier), CV used in the simulation (CVsim) and CV used in the analysis (CVan); the sire and 16 progeny in each family were assayed