| Literature DB >> 31703552 |
Audrey Mauguen1, Venkatraman E Seshan2, Irina Ostrovnaya2, Colin B Begg2.
Abstract
BACKGROUND: We previously introduced a random-effects model to analyze a set of patients, each of which has two distinct tumors. The goal is to estimate the proportion of patients for which one of the tumors is a metastasis of the other, i.e. where the tumors are clonally related. Matches of mutations within a tumor pair provide the evidence for clonal relatedness. In this article, using simulations, we compare two estimation approaches that we considered for our model: use of a constrained quasi-Newton algorithm to maximize the likelihood conditional on the random effect, and an Expectation-Maximization algorithm where we further condition the random-effect distribution on the data.Entities:
Keywords: Cancer; Clonality; EM algorithm; Parameter estimation; Random effect model; Tumor mutation
Mesh:
Year: 2019 PMID: 31703552 PMCID: PMC6839069 DOI: 10.1186/s12859-019-3148-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Study of contralateral breast cancers
| Case # | Somatic mutations | Details of matches | |||
|---|---|---|---|---|---|
| Left breast | Right breast | Matches | Mutations | Probabilities | |
| 1 | 9 | 7 | 0 | ||
| 2 | 3 | 3 | 0 | ||
| 3 | 2 | 7 | 0 | ||
| 4 | 8 | 10 | 0 | ||
| 6 | 6 | 5 | 0 | ||
| 8 | 6 | 2 | 1 | < 1/1000 | |
| 9 | 2 | 3 | 0 | ||
| 12 | 14 | 3 | 0 | ||
| 13 | 3 | 3 | 0 | ||
| 15 | 8 | 5 | 0 | ||
| 16 | 10 | 8 | 0 | ||
| 17 | 6 | 8 | 0 | ||
| 18 | 8 | 2 | 0 | ||
| 21 | 4 | 3 | 0 | ||
| 23 | 10 | 4 | 0 | ||
| 24 | 4 | 3 | 0 | ||
| 25 | 4 | 6 | 0 | ||
| 26 | 6 | 5 | 0 | ||
| 27 | 4 | 5 | 0 | ||
| 29 | 3 | 1 | 0 | ||
| 30 | 6 | 5 | 0 | ||
| 31 | 6 | 5 | 0 | ||
| 32 | 5 | 4 | 0 | ||
| 33 | 6 | 4 | 0 | ||
| 35 | 5 | 4 | 0 | ||
| 36 | 3 | 4 | 3 | < 1/1000 | |
| < 1/1000 | |||||
| < 1/1000 | |||||
| 38 | 8 | 2 | 0 | ||
| 40 | 10 | 1 | 0 | ||
| 41 | 0 | 9 | 0 | ||
| 43 | 4 | 4 | 0 | ||
| 44 | 9 | 21 | 0 | ||
| 45 | 3 | 4 | 0 | ||
| 48 | 2 | 3 | 2 | < 1/1000 | |
| < 1/1000 | |||||
| 52 | 5 | 7 | 0 | ||
| 56 | 2 | 5 | 0 | ||
| 58 | 3 | 4 | 0 | ||
| 59 | 2 | 3 | 0 | ||
| 62 | 4 | 4 | 0 | ||
| 63 | 3 | 9 | 1 | 0.137 | |
| 64 | 5 | 4 | 0 | ||
| 66 | 33 | 3 | 0 | ||
| 67 | 4 | 1 | 1 | 0.137 | |
| 70 | 5 | 2 | 0 | ||
| 71 | 3 | 1 | 0 | ||
| 72 | 1 | 3 | 0 | ||
| 74 | 2 | 1 | 0 | ||
| 75 | 4 | 3 | 1 | 0.137 | |
| 76 | 7 | 5 | 0 | ||
| 77 | 3 | 1 | 0 | ||
Fig. 1Log-normal distributions of the clonality signal
Simulation results
| One-step maximization | EM algorithm | EM algorithm - subset | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N cases | True | Scenario | mean | (sd) | range | N 0-1 | mean | (sd) | range | N 0-1 | mean | (sd) | range | N 0-1 |
| 100 | 0.10 | 1: | 0.127 | (0.126) | 0.010-1.000 | 0-7 | 0.086 | (0.036) | 0.010-0.202 | 0-0 | 0.076 | (0.037) | 0.034-0.138 | 0-0 |
| 2: | 0.105 | (0.038) | 0.020-0.234 | 0-0 | 0.099 | (0.033) | 0.020-0.212 | 0-0 | ||||||
| 3: | 0.101 | (0.031) | 0.030-0.220 | 0-0 | 0.101 | (0.031) | 0.030-0.220 | 0-0 | ||||||
| 0.25 | 1: | 0.259 | (0.091) | 0.079-0.729 | 0-0 | 0.214 | (0.051) | 0.077-0.387 | 0-0 | |||||
| 2: | 0.250 | (0.049) | 0.121-0.387 | 0-0 | 0.245 | (0.047) | 0.121-0.377 | 0-0 | ||||||
| 3: | 0.252 | (0.043) | 0.130-0.380 | 0-0 | 0.252 | (0.043) | 0.130-0.380 | 0-0 | ||||||
| 0.50 | 1: | 0.518 | (0.113) | 0.245-0.881 | 0-0 | 0.440 | (0.066) | 0.230-0.621 | 0-0 | |||||
| 2: | 0.498 | (0.055) | 0.325-0.640 | 0-0 | 0.490 | (0.054) | 0.319-0.624 | 0-0 | ||||||
| 3: | 0.498 | (0.049) | 0.350-0.620 | 0-0 | 0.498 | (0.049) | 0.350-0.620 | 0-0 | ||||||
| 0.75 | 1: | 0.756 | (0.116) | 0.495-1.000 | 0-31 | 0.662 | (0.068) | 0.477-0.924 | 0-0 | 0.758 | (0.052) | 0.623-0.924 | 0-0 | |
| 2: | 0.747 | (0.050) | 0.616-0.881 | 0-0 | 0.738 | (0.049) | 0.609-0.875 | 0-0 | ||||||
| 3: | 0.748 | (0.043) | 0.630-0.850 | 0-0 | 0.748 | (0.043) | 0.630-0.850 | 0-0 | ||||||
| 50 | 0.10 | 1: | 0.138 | (0.193) | 0.000-1.000 | 19-18 | 0.083 | (0.049) | 0.000-0.265 | 11-0 | 0.083 | (0.070) | 0.000-0.265 | 11-0 |
| 2: | 0.113 | (0.079) | 0.000-1.000 | 4-1 | 0.101 | (0.048) | 0.000-0.272 | 3-0 | 0.038 | (0.056) | 0.000-0.125 | 3-0 | ||
| 3: | 0.100 | (0.042) | 0.000-0.260 | 2-0 | 0.100 | (0.042) | 0.000-0.260 | 2-0 | 0 | (0.000) | 0.000-0.000 | 2-0 | ||
| 0.25 | 1: | 0.270 | (0.145) | 0.043-1.000 | 0-4 | 0.210 | (0.071) | 0.043-0.456 | 0-0 | 0.194 | (0.049) | 0.122-0.234 | 0-0 | |
| 2: | 0.255 | (0.076) | 0.100-0.714 | 0-0 | 0.245 | (0.064) | 0.101-0.447 | 0-0 | ||||||
| 3: | 0.248 | (0.061) | 0.100-0.440 | 0-0 | 0.248 | (0.061) | 0.100-0.440 | 0-0 | ||||||
| 0.50 | 1: | 0.520 | (0.154) | 0.222-1.000 | 0-7 | 0.441 | (0.091) | 0.212-0.804 | 0-0 | 0.64 | (0.097) | 0.494-0.804 | 0-0 | |
| 2: | 0.501 | (0.075) | 0.296-0.739 | 0-0 | 0.492 | (0.073) | 0.293-0.713 | 0-0 | ||||||
| 3: | 0.498 | (0.069) | 0.320-0.700 | 0-0 | 0.498 | (0.069) | 0.320-0.700 | 0-0 | ||||||
| 0.75 | 1: | 0.747 | (0.143) | 0.480-1.000 | 0-52 | 0.659 | (0.091) | 0.469-0.933 | 0-0 | 0.783 | (0.072) | 0.650-0.933 | 0-0 | |
| 2: | 0.746 | (0.075) | 0.530-1.000 | 0-2 | 0.736 | (0.071) | 0.527-0.938 | 0-0 | 0.926 | (0.018) | 0.913-0.938 | 0-0 | ||
| 3: | 0.746 | (0.060) | 0.600-0.920 | 0-0 | 0.746 | (0.060) | 0.600-0.920 | 0-0 | ||||||
| 25 | 0.10 | 1: | 0.128 | (0.197) | 0.000-1.000 | 101-18 | 0.099 | (0.079) | 0.000-0.443 | 46-0 | 0.088 | (0.112) | 0.000-0.441 | 46-0 |
| 2: | 0.118 | (0.121) | 0.000-1.000 | 46-4 | 0.103 | (0.063) | 0.000-0.365 | 26-0 | 0.056 | (0.084) | 0.000-0.365 | 26-0 | ||
| 3: | 0.103 | (0.061) | 0.000-0.330 | 29-0 | 0.101 | (0.056) | 0.000-0.280 | 22-0 | 0.018 | (0.045) | 0.000-0.228 | 22-0 | ||
| 0.25 | 1: | 0.276 | (0.192) | 0.000-1.000 | 6-9 | 0.216 | (0.103) | 0.039-0.543 | 0-0 | 0.222 | (0.108) | 0.039-0.432 | 0-0 | |
| 2: | 0.262 | (0.110) | 0.040-1.000 | 0-1 | 0.246 | (0.092) | 0.040-0.618 | 0-0 | 0.176 | 0.176-0.176 | 0-0 | |||
| 3: | 0.250 | (0.087) | 0.040-0.520 | 0-0 | 0.249 | (0.088) | 0.040-0.520 | 0-0 | ||||||
| 0.50 | 1: | 0.515 | (0.198) | 0.122-1.000 | 0-19 | 0.433 | (0.124) | 0.109-0.878 | 0-0 | 0.622 | (0.145) | 0.384-0.878 | 0-0 | |
| 2: | 0.505 | (0.110) | 0.201-1.000 | 0-1 | 0.492 | (0.102) | 0.201-0.846 | 0-0 | 0.490 | 0.490-0.490 | 0-0 | |||
| 3: | 0.500 | (0.096) | 0.200-0.760 | 0-0 | 0.500 | (0.096) | 0.200-0.760 | 0-0 | ||||||
| 0.75 | 1: | 0.752 | (0.175) | 0.358-1.000 | 0-82 | 0.665 | (0.136) | 0.332-1.000 | 0-2 | 0.835 | (0.097) | 0.614-1.000 | 0-2 | |
| 2: | 0.752 | (0.098) | 0.489-1.000 | 0-5 | 0.741 | (0.094) | 0.483-1.000 | 0-1 | 0.975 | (0.028) | 0.941-1.000 | 0-1 | ||
| 3: | 0.749 | (0.084) | 0.480-0.960 | 0-0 | 0.749 | (0.084) | 0.480-0.960 | 0-0 | ||||||
The EM algorithm – subset results present the estimates obtained with the EM algorithm for the datasets where the one-step maximization gave results on the boundary. N 0-1 shows the number of times the estimate was exactly 0 - number of times it was exactly 1