| Literature DB >> 23912609 |
N Feller1, V H J van der Velden, R A Brooimans, N Boeckx, F Preijers, A Kelder, I de Greef, G Westra, J G Te Marvelde, P Aerts, H Wind, M Leenders, J W Gratama, G J Schuurhuis.
Abstract
Flow-cytometric detection of minimal residual disease (MRD) has proven in several single-institute studies to have an independent prognostic impact. We studied whether this relatively complex approach could be performed in a multicenter clinical setting. Five centers developed common protocols to accurately define leukemia-associated (immuno)phenotypes (LAPs) at diagnosis required to establish MRD during/after treatment. List mode data files were exchanged, and LAPs were designed by each center. One center, with extensive MRD experience, served as the reference center and coordinator. In quarterly meetings, consensus LAPs were defined, with the performance of centers compared with these. In a learning (29 patients) and a test phase (35 patients), a mean of 2.2 aberrancies/patient was detected, and only 1/63 patients (1.6%) had no consensus LAP(s). For the four centers without (extensive) MRD experience, clear improvement could be shown: in the learning phase, 39-63% of all consensus LAPs were missed, resulting in a median 30% of patients (range 21-33%) for whom no consensus LAP was reported; in the test phase, 27-40% missed consensus LAPs, resulting in a median 16% (range 7-18%) of 'missed' patients. The quality of LAPs was extensively described. Immunophenotypic MRD assessment in its current setting needs extensive experience and should be limited to experienced centers.Entities:
Year: 2013 PMID: 23912609 PMCID: PMC3763381 DOI: 10.1038/bcj.2013.27
Source DB: PubMed Journal: Blood Cancer J ISSN: 2044-5385 Impact factor: 11.037
Standard immunophenotypic panel
| 1 | PBS | PBS | CD45 | PBS |
| 2 | CD34 | CD22 | CD45 | CD117 |
| 3 | CD15 | CD13 | CD45 | CD14 |
| 4 | HLA-DR | CD33 | CD45 | CD11b |
| 5 | CD2 | CD56 | CD45 | CD7 |
| 6 | TdT/CD36 | CD133 | CD45 | CD19 |
Abbreviations: APC, allophycocyanin; ECD, energy coupled dye; FITC, fluorescein isothiocyanate; PBS, phosphate-buffered saline; PE, phycoerythrin; PerCP, peridinyl chlorophyllin.
Before use, monoclonal antibodies were centrally titrated, diluted to working solutions and distributed to the participating institutes. The first tube, containing CD45 in FL3 and PBS in the other channels, was used to discriminate between absence and presence of marker expression. During the test phase of the study, TdT was replaced by CD36.
Below is indicated per fluorochrome the antibody clone from BD/antibody clone from Immunotech (markers are in bold for reasons of clarity). In case of other suppliers, these are indicated in between parentheses.
FITC: CD34: 8G12/581; CD15: MMA/MMA; HLA-DR: L243/im.357; CD2: S5.2/39C1.5; TdT: HT-6 (DAKO)/HT-6 (DAKO); and CD36: FA6-152/CLB 703 (CLB).
PE: CD22: S-HCL-1/SJ10.1H11; CD13: SJ1D1/SJ1D1; CD33: P67.6/D3HL60.251; CD56: NCAM16.2/NCAM16.2; and CD133: CD133/2 (Miltenyi)/CD133/2 (Miltenyi).
PerCP/ECD: CD45: 2D1/J33.
APC/PE-CY5: CD117: 104D2/104D2D1; CD14: MoP9/RM052; CD11b: D12/Bear 1; CD7: M-T701/8H8.1; and CD19: SJ25C1/J4.119.
Fluorochrome used for BD FCM/fluorochrome used for Beckman Coulter FCM.
Figure 1Organization of the study. When any of the five centers entered an AML patient (top of the figure), it ran the monoclonal antibody panel shown in Table 1 and sent the LMD to the other centers. All centers then formulated their putative LAP(s) and sent these to the coordinator, who subsequently designed ‘proposed LAP(s)'. Finally, at the quarterly group meetings, ‘consensus LAP(s)' were agreed upon, and ‘final LAPs' were formulated for subsequent MRD studies.
Consensus LAPs in terms of sensitivity and specificity
| A | |||
|---|---|---|---|
| >50 (high) | High | 28 (51) | 37 (42) |
| Low | 7 (13) | 7 (8) | |
| 20–50 (intermediate) | High | 13 (24) | 29 (33) |
| Low | 2 (4) | 6 (7) | |
| 10–20 (low) | High | 4 (7) | 6 (7) |
| Low | 1 (2) | 3 (3) | |
| Total | 55 (101) | 88 (100) | |
Abbreviation: LAP, leukemia-associated (immuno)phenotype.
LAPs are defined by the following preference: first, LAPs with the highest sensitivity (>50% expression) and best specificity (<0.1% background) are denoted as ‘high/high', and with decreasing preference, this is followed by ‘intermediate/high', ‘high/low', ‘low/high', ‘intermediate/low' and, lastly, ‘low/low'.
LAPs were categorized as a function of sensitivity (‘high', >50% expression; ‘intermediate', 20–50% expression; and ‘low', 10–20% expression by the leukemic blasts) and specifitity (‘high',<0.1% background; ‘low', ⩾0.1% background).
In A, total number of consensus LAPs are denoted, with percentages of total number in learning or test phase; in B, total number total number of best consensus LAPs per patient (which thus equals the number of patients) are denoted, with percentages of total number in learning or test phase.
In learning phase, no LAPs were found in one patient (3.5%), one LAP in 38%, two LAPs in 41%, three LAPs in 0%, four LAPs in 14% and five LAPs in 3.5% of the cases (not shown in the table). In test phase, LAPs were found in all patients: one LAP in 28%, two LAPs in 26%, three LAPs in 17%, four LAPs in 23% and five LAPs in 6% of the cases (not shown in the table).
Summary of consensus LAPs
| 26 | |||
| 11 | |||
| 8 | |||
| 3 | |||
| 3 | |||
| 1 | |||
| 6 | 4.2 | <10 | |
| 4 | |||
| 1 | |||
| 1 | |||
| 5 | 3.5 | <10 | |
| 4 | |||
| 1 | |||
| 3 | 2.1 | <10 | |
| 2 | |||
| 1 | |||
| 2 | 1.4 | <10 | |
| 1 | |||
| 1 | |||
| 14 | 9.8 | ||
| 13 | |||
| 1 | |||
| 19 | |||
| 7 | |||
| 4 | |||
| 3 | |||
| 3 | |||
| 1 | |||
| 1 | |||
| 9 | 6.3 | <10 | |
| 4 | |||
| 2 | |||
| 1 | |||
| 1 | |||
| 1 | |||
| 8 | 5.6 | ||
| 4 | |||
| 2 | |||
| 1 | |||
| 1 | |||
| 7 | 4.9 | ||
| 7 | |||
| 6 | 4.2 | See legends | |
| 3 | |||
| 1 | |||
| 1 | |||
| 1 | |||
| 5 | 3.5 | <10 | |
| 4 | |||
| 1 | |||
| 4 | 2.8 | <10 | |
| 4 | |||
| 1 | 0.7 | ||
| 1 | |||
| <10 | |||
| 4 | 2.8 | <10 | |
| 2 | |||
| 1 | |||
| 1 | |||
| 4 | 2.8 | <10 | |
| 2 | |||
| 2 | |||
| 1 | 0.7 | <10 | |
| 1 | |||
| 1 | 0.7 | <10 | |
| 1 | |||
| 8 | 5.6 | <10 | |
| 5 | |||
| 1 | |||
| 1 | |||
| 1 | |||
| 3 | 2.1 | <10 | |
| 1 | |||
| 1 | |||
| 1 | |||
| 2 | 1.4 | <10 | |
| 1 | |||
| 1 | |||
| 2 | 1.4 | <10 | |
| 1 | |||
| 1 | |||
| 1 | 0.7 | <10 | |
| 1 | |||
| 1 | 0.7 | <10 | |
| 1 | |||
| 1 | 0.7 | <10 | |
| 1 | |||
| Total | 143 | 100 | |
Abbreviations: LAP, leukemia-associated (immuno)phenotype; MM, myeloid marker (usually CD13 or CD33, depending on diagnostic phenotype).
LAPs identified in the 64 BM samples using the standard immunophenotyping panel.
The mean number of LAPs per patients is 2.2 (143 LAPs/64 patients). In only one sample, no LAP could be defined.
A primitive marker (CD34, CD117 or CD133) was included in 125 of 143 (87%) of the LAPs.
‘Best consensus LAP' is defined as the LAP with highest sensitivity and specificity (see Results). More than one LAP may fulfill the same requirement, thereby defining more than one best consensus LAP per patient. As a result in the column used as ‘best consensus LAP (% of patients)', the total percentages exceed 100%.
CD34 alone was used in 42%, CD117 alone in 27% and CD133 alone in 5%, although any of the four possible combinations of at least two primitive markers were used in 18% of the cases (data not shown). In the latter cases, it was often possible to define for the same patient at least two LAPs based on different primitive markers.
LAPs with underexpression (usually absence) of markers were almost exclusively present in the group with asynchronous expression (data not shown) and not separately scored.
As two or more consensus LAPs were identified in many patients, the total number of consensus LAPs (i.e., 143) exceeded the total number of patients (i.e., 64).
Figure 2Aberrant expression of marker/marker combinations on WBC and immature cell fractions in normal BM. Different LAP immunophenotypes tested in normal BM. All LAPs contained CD45 and, if not indicated in the figure, a myeloid marker (CD13 or CD33) or CD117. (a) LAP expression as % of WBC. Note the relatively high expression of CD34+CD36+. (b) Aberrant marker expression as a percentage of primitive marker compartment (CD34 or CD117). Note logarithmic scale in a and b.
Percentages of missed LAPs by the different centers in learning and test phase
| 1 | 6/55 | 11 | 18/46 | 39 | 28/55 | 51 | 34/54 | 63 | 20/33 | 61 |
| 2 | 3/88 | 3 | 24/88 | 27 | 32/88 | 36 | 34/84 | 40 | 18/65 | 28 |
| Δ | −8 | −12 | −15 | −23 | −33 | |||||
Abbreviation: LAP, leukemia-associated (immuno)phenotype.
A LAP was noted as ‘missed' in cases where a LAP had not been defined with the aberrant marker agreed upon in the consensus LAP. Only institutes 1 and 3 evaluated all 64 samples, with a maximally reachable number of 143 LAPs. Institutes 2, 4 and 5 evaluated 59, 62 and 39 of the 64 samples, respectively. In the samples evaluated by these institutes, the maximally reachable number of LAPs was 134, 138 and 98, respectively, and percentages of missed LAPs were calculated using the latter numbers.
‘Δ' indicates difference in % between learning phase (phase 1) and test phase (phase 2).
Differences between the two phases were significant: P=0.015 (all institutes) and P=0.022 for institutes 2–5.
Figure 3Missed consensus LAPs. For 63 of the 64 patients reported here, at least one consensus LAP could be agreed upon. Cases with no LAP defined for a particular institute and a particular patient were scored as ‘missed patient'. Institutes 1–5 analyzed 29, 24, 29, 28 and 12 samples in the learning phase (gray bars) and 35, 35, 35, 34 and 27 samples in the test phase (black bars), respectively. Panel a shows the proportions of patients in whom no LAPS were defined and panel b shows the proportions of patients in whom extra LAPs were missed. (a) For institutes 1–5, percentages of patients who would not have been eligible for MRD assessment are shown (no LAPs were defined, whereas at least one consensus LAP was present). Improvement was 0%, 10%, 17%, 14% and 26% for institutes 1–5, respectively. Differences between the two phases were significant: P=0.035 (all institutes: n=5) and P=0.016 (institutes 2–5). (b) For institutes 1–5, the percentages of patients in whom only one LAP was defined, whereas at least two consensus LAPs were present, are shown. Only institute 2 had made considerable improvement (20%). Note that this figure does not include learning effects for all defined LAPs: these have been shown earlier in Table 4. In that table, all institutes made progression, which apparently is merely due to increased performance in defining LAPs additional to the two defined for Figure 3b.
Figure 4Performance of individual institutes in different sensitivity categories in the learning and test phase. The figure shows the performance of the four centers to define LAPs, now subdivided into the three sensitivity categories (i.e., high (>50%), black; intermediate (20–50%), dark gray; and low (10–20%), light gray). Missed LAPS in a sensitivity category are expressed as percentage of the maximal number of consensus LAPs defined in that specific sensitivity category. In the learning phase, the percentage of missed LAPs was significantly higher in the intermediate-sensitivity (marked *P=0.01) and in the low-sensitivity category (marked #P=0.001) than in the high-sensitivity category (all centers included). These differences were also significant when the coordinating center (1) was excluded from analysis (P-values ranging between 0.01 and 0.05). When comparing the learning and test phases, for centers 2–5 there was an improvement in the intermediate-sensitivity category (marked ‡P=0.016). There was also improvement in the lowest sensitivity category (marked §P=0.024).