Literature DB >> 34885257

An Extensive Quality Control and Quality Assurance (QC/QA) Program Significantly Improves Inter-Laboratory Concordance Rates of Flow-Cytometric Minimal Residual Disease Assessment in Acute Lymphoblastic Leukemia: An I-BFM-FLOW-Network Report.

Margarita Maurer-Granofszky1, Angela Schumich1, Barbara Buldini2, Giuseppe Gaipa3, Janos Kappelmayer4, Ester Mejstrikova5, Leonid Karawajew6, Jorge Rossi7, Adın Çınar Suzan8, Evangelina Agriello9, Theodora Anastasiou-Grenzelia10, Virna Barcala11, Gábor Barna12, Drago Batinić13, Jean-Pierre Bourquin14, Monika Brüggemann15, Karolina Bukowska-Strakova16, Hasan Burnusuzov17, Daniela Carelli18, Günnur Deniz8, Klara Dubravčić13, Tamar Feuerstein19, Marie Isabel Gaillard20, Adriana Galeano21, Hugo Giordano22, Alejandro Gonzalez23, Stefanie Groeneveld-Krentz6, Zsuzsanna Hevessy4, Ondrej Hrusak5, Maria Belen Iarossi24, Pál Jáksó25, Veronika Kloboves Prevodnik26,27, Saskia Kohlscheen15, Elena Kreminska28, Oscar Maglia3, Cecilia Malusardi29, Neda Marinov30, Bibiana Maria Martin31, Claudia Möller14, Sergey Nikulshin32, Jorge Palazzi33, Georgios Paterakis34, Alexander Popov35, Richard Ratei36, Cecilia Rodríguez37, Elisa Olga Sajaroff7, Simona Sala3, Gordana Samardzija38, Mary Sartor39, Pamela Scarparo2, Łukasz Sędek40, Bojana Slavkovic38, Liliana Solari41, Peter Svec42, Tomasz Szczepanski43, Anna Taparkou44, Montserrat Torrebadell45, Marianna Tzanoudaki46, Elena Varotto2, Helly Vernitsky47, Andishe Attarbaschi48, Martin Schrappe49, Valentino Conter50, Andrea Biondi50, Marisa Felice51, Myriam Campbell30, Csongor Kiss52, Giuseppe Basso2, Michael N Dworzak1,48.   

Abstract

Monitoring of minimal residual disease (MRD) by flow cytometry (FCM) is a powerful prognostic tool for predicting outcomes in acute lymphoblastic leukemia (ALL). To apply FCM-MRD in large, collaborative trials, dedicated laboratory staff must be educated to concordantly high levels of expertise and their performance quality should be continuously monitored. We sought to install a unique and comprehensive training and quality control (QC) program involving a large number of reference laboratories within the international Berlin-Frankfurt-Münster (I-BFM) consortium, in order to complement the standardization of the methodology with an educational component and persistent quality control measures. Our QC and quality assurance (QA) program is based on four major cornerstones: (i) a twinning maturation program, (ii) obligatory participation in external QA programs (spiked sample send around, United Kingdom National External Quality Assessment Service (UK NEQAS)), (iii) regular participation in list-mode-data (LMD) file ring trials (FCM data file send arounds), and (iv) surveys of independent data derived from trial results. We demonstrate that the training of laboratories using experienced twinning partners, along with continuous educational feedback significantly improves the performance of laboratories in detecting and quantifying MRD in pediatric ALL patients. Overall, our extensive education and quality control program improved inter-laboratory concordance rates of FCM-MRD assessments and ultimately led to a very high conformity of risk estimates in independent patient cohorts.

Entities:  

Keywords:  acute lymphoblastic leukemia; flow cytometry; minimal residual disease; quality control program

Year:  2021        PMID: 34885257      PMCID: PMC8656726          DOI: 10.3390/cancers13236148

Source DB:  PubMed          Journal:  Cancers (Basel)        ISSN: 2072-6694            Impact factor:   6.639


1. Introduction

The application of flow cytometric minimal residual disease measurement (FCM-MRD) has become a routine clinical practice in most current study protocols worldwide, in order to assign patients to different risk groups and to tailor treatment intensity [1,2,3,4,5,6]. FCM-MRD is applicable in almost all (>98%) patients with lymphoblastic leukemia and can reach sensitivities similar to a quantitative real-time polymerase chain reaction (RQ-PCR) (≥10−5) if it is ensured that the laboratories involved in MRD testing are able to reliably detect and quantify leukemic cells, even in the low MRD range [1]. Several national centers and international consortia previously worked on standardizing the FCM-MRD methodology in multi-centric settings to improve inter-laboratory concordance rates, and to assure sensitive and accurate MRD assessments [5,7,8,9,10,11]. The allied Associazione Italiana di Ematologia Oncologia Pediatrica (AIEOP) and the Berlin-Frankfurt-Münster (BFM) FCM-MRD study groups, which have collaborated since May 2000, initiated a standardization program based on education of personnel, technical alignment, and on continued quality control (QC) [9]. This study, involving four experienced national reference laboratories, proved that FCM-MRD can be standardized for multi-centric application in large trials. Such a multi-facetted QC/quality assurance (QA) program included the standardization of the methodology, continuous staff training, performance reviews through sample and list-mode data (LMD) exchange, as well as the assessment of concordance of “on trial” MRD results from independent patient cohorts [9]. Similarly, an educational component complementing the standardization of the FCM-MRD methodology was necessary to warrant concordant quality of MRD assessment in a study involving eight experienced centers in North America [12]. In the current study, we performed a unique and comprehensive training and QC/QA program involving a total of 41 laboratories collaborating in the international (I)-BFM consortium. Of note, many associated laboratories of the network were inexperienced when entering the program. Our approach consists of four major building blocks: (i) a twinning maturation program, where well-trained expert laboratories work closely with more inexperienced laboratories to improve their performance in individual feedback sessions; (ii) the obligatory participation in external QA programs (spiked sample send around), e.g., United Kingdom National External Quality Assessment Service (UK NEQAS); (iii) regular participation in the analysis of LMD file ring trials (data file send arounds) with an educational component, where difficult and discordant cases are jointly discussed; and (iv) surveys of independent data derived from trial results, where the agreement of different centers, with respect to MRD results from their locally recruited patient cohorts, were tested. We demonstrate that the training of laboratories using experienced twinning partners along with continuous educational feedback, significantly improves the performance of laboratories to detect and quantify MRD in pediatric ALL patients. In addition to a standardized methodology, educational feedback and quality oversight by a coordinating structure is required to ensure the sufficient quality of MRD assessment for clinical routine.

2. Materials and Methods

2.1. Samples

MRD investigations were approved as part of the international trial by the institutional ethical committees and were conducted according to informed consent guidelines. For list-mode file ring trials, we used bone marrow (BM) samples from pediatric patients (0–≤18 years of age) recruited in the AIEOP-BFM-ALL clinical trial in Vienna (AUT). In total, data from 101 BM samples from regenerative post-induction time-points of pediatric patients with B-ALL were used for the LMD file ring trials (“B-cell precursor (BCP) Regeneration”) during 2016–2020. These regenerative bone marrows are usually characterized by the appearance of BCP subsets at different stages of (BCP 1–3) [13]. Of the 101 BM samples, 53 were FCM-MRD negative, and 48 samples were FCM-MRD positive (range 0.004–18.16%). If available, RT-PCR data served as an independent confirmation of the FCM-MRD target votes. The target MRD values were calculated as a median from six experienced and matured AIEOP-BFM reference laboratories (Vienna, Monza, Padova, Prague, Kiel and Berlin).

Sample Preparation

Detailed standard operating procedures (SOP) from the AIEOP-BFM (Supplementary Materials File S1 “SOP FLOW-MRD 1.5 AEIOP-BFM”) as well as from the intercontinental ALL- IC reference centers can be found in the Supplementary Materials (Supplementary Materials File S2 and S3 “SOP ALL-IC 2009” and “Amendment to SOP ALL-IC-BFM_2014”). Samples for the generation of LMD files were prepared at the coordinating center in Vienna briefly as follows: whole BM samples were stained either with a two-tube approach (7- and 8-color panel, 2016–2017, Table S1) or with a single, dried, 10-color format tube (ReALB, DuraCloneTM, Beckman Coulter, Brea, CA, USA), Table S1) (2018–2020) for 15′ and lysed with a fixing lysis reagent (BD Lyse, Becton Dickinson, Franklin Lanes, Evansville, EL, USA) for another 15′. Samples were washed once with 1× phosphate-buffered saline (PBS, Gibco, Thermo Fisher Scientific Inc., Waltham, MA, USA), resuspended in 200 µL 1×PBS and measured immediately. Samples were acquired using an LSRII flow cytometer (Becton Dickinson) and analyzed at the reference center using DIVA software (Becton Dickinson). Syto41 staining was used to exclude non-nucleated events such as erythrocytes, platelets, and debris. For detailed information on the gating strategy, see Supplementary Materials S1–S3. MRD was reported as a percentage of Syto41-positive cells with exclusion of doublets using FSC-H/FSC-A gating. Minimal residual disease positivity was defined as an accumulation of ≥10 cells with a leukemic phenotype [9].

2.2. I-BFM-FLOW Twinning and Maturation Program

A total number of 29 laboratories participated in the twinning and maturation program of which 22 were ALL IC-BFM (Acute Lymphoblastic Leukemia Intercontinental Berlin-Frankfurt-Münster) trainee laboratories. Seven experienced and already matured AIEOP-BFM laboratories served as twinning and training partners. “Maturation” is granted to a center according to specific criteria and is defined as the permission to release MRD results for clinical use without further control by the partner laboratory (“twinning partner”). “Maturation” can be granted for either B-lineage ALL-MRD at day 15/d33 or at regenerative time-points, and for T-ALL-MRD (independent of time-point). “Maturation” is granted after completion of a series of at least 25 different locally and consecutively recruited patient sample pairs (diagnosis and MRD time-points) per lineage, the analysis of which, along with the results, was promptly submitted to the twinning partner for review. Harmonization and technical alignment of the FCM-MRD technology including machine settings, panel composition, sample preparation as well as staining was an essential part of the twinning maturation program and took place before the test series was started. During the initial phase of the program, an intensive interaction and discussion between the trainee laboratory and its twinning partner following each case ensured a rapid and effective training progress. Therefore, the first half of the samples serves, for the most part, an intensive training, the immediate success of which should and must be proven with the second half of the samples. In the second half of the sample series, each gross error then requires a further three MRD assessments of consecutive cases without errors to achieve maturation. Gross errors are defined as inadequacy of staining and analysis, errors in blast population, or gross misinterpretation or misdocumentation on a predefined report sheet. Provided that no more gross errors in identifying and/or quantifying MRD are detected in the most recent half (i.e., 12 patients) of the sample series, maturation is granted. For a detailed description of the guidelines, see Supplementary Materials Files S4 and S5 “I-BFM FLOW Twinning Maturation” and “Maturation Certificate”. Feedback by training laboratories was mainly provided via email and/or phone or during training visits and included recommendations regarding further optimization of technical issues (e.g., compensation, etc.) as well as the gating strategy or explanations of phenotypic properties of normal and aberrant cells, respectively.

2.3. External Quality Assessment (EQA)

UK NEQAS (https://ukneqas.org.uk/ (accessed on 30 November 2021) and [14]) provides external quality assessment/proficiency testing for all major aspects of clinical laboratory testing including “Minimal Residual Disease for ALL by flow cytometry”. UK NEQAS ships stabilized blood samples spiked with B-ALL blasts to participating centers. Individual results are compared to the robust mean of all the submitted values. UK NEQAS sends out two samples with different MRD values per trial and usually issues four wet trials per year. This program was used by the I-BFM network for EQA. The local results of all I-BFM centers submitting results were evaluated in a centralized manner by the I-BFM coordinating center in Vienna.

2.4. List-Mode-Data File Ring Trials (LMD Ring Trial)

Anonymized list-mode files were prepared and sent out by the I-BFM FLOW coordinating center in Vienna. A set of at least 20 files containing data of regenerating bone marrow samples with either negative MRD or different positive levels of MRD was sent to the participating laboratories once a year for blinded LMD file interpretation. The participating laboratories were members of the AIEOP-BFM or ALL-IC BFM consortia or cooperating laboratories. Reported MRD values that were either more than three times or less than one-third of the target MRD value (concordance margin, see [9]) were considered as false assessments. The target MRD value was calculated as a median from six experienced and matured AIEOP-BFM reference laboratories (Vienna, Monza (ITA), Padova (ITA), Prague (CZE), Kiel (GER) and Berlin (GER)). Laboratories, which reported false MRD values in >10%–≤25% of cases were flagged with a warning and reports with >25% false MRD values were considered to be critical.

2.5. Independent Data Survey

We conducted a survey among eight AIEOP-BFM reference laboratories to collect FCM-MRD-based risk classification data within the AIEOP-BFM-ALL 2009 trial. The following reference laboratories participated in this survey: Berlin, Monza, Padova, Prague, Vienna, Zurich (CH), Sydney (AUS) and the two reference laboratories of Israel (ISR). Details of all requested data can be found in Supplementary Materials File S6, “Data query”. According to the AIEOP-BFM ALL 2009 study protocol, the criteria for assignment of patients to either flow low risk (FLR), flow medium risk (FMR) or flow high risk (FHR) were defined as follows: percentage of blasts in the BM at day 15 of induction therapy ≤0.1% FLR, >0.1% <10% FMR, ≥10% FHR.

2.6. Statistical Analysis

The concordance of independent flow-risk estimate data was assessed using GraphPad Prism ver. 8.3.0 (GraphPad Software, San Diego, CA, USA) and Two-Way ANOVA (Tukey test for multiple comparison correction). We used http://www.analyticscalculators.com (accessed on 30 November 2021) for computation of exact test probabilities for 2 × 2 and 2 × 3 Contingency Tables [15,16].

3. Results

3.1. Twinning Maturation Program

In total, 682 samples were processed and analyzed by the 22 trainee laboratories using diagnostic methodology based on central standard operating procedures (SOPs) (see Supplementary Materials File S1–S3), and the data were sent to the training laboratories for review (Table 1).
Table 1

Overview of the I-BFM FLOW twinning program.

Overview
Nº of trainee labs § 22
Nº of expert trainer labs *7
Nº of samples reviewed682
Nº of samples required for maturation/lab25
Median needed [range]28 (25–57)
Mean needed [SD]31 (8.8)

§ Laboratories of the ALL IC-BFM FLOW network; * Laboratories of the i-BFM FLOW network.

According to the guidelines, a minimum of 25 locally and consecutively recruited samples per lineage (B- or T-ALL, Table S1) had to be assessed by the trainees without gross errors in the second half of the samples (i.e., in the last 12 samples). For each gross error, three additional samples had to be processed, analyzed and sent again for review. Overall, out of the 682 analyzed B-ALL samples, MRD assessment was erroneous in 11.6% of samples (Table 2). In total, laboratories required a median number of 28 (range 25–57) and a mean number of 31 ± 8.8 samples to reach maturation status (Table 1). The difference in the number or errors in the first (Series 1: sample 1–13) and second half (Series 2: sample 14–25) of the sample set was marginally not statistically significant (14.7% vs. 9.5%), but reached significance when considering the entire sample set (Series 1: sample 1–13 vs. Series 3: sample 14-pX) (14.7% vs. 9.3%; p < 0.05) (Table 2 and Figure 1). Sixteen laboratories achieved maturation status within thirty-one samples (which is the mean number of samples needed for maturation). In those 16 laboratories, there was a significant performance improvement between sample series 1 and 2 (11.1% vs. 3.1%, p < 0.01; Table S2 and Figure 1). Overall, the initial performance of the training laboratories was variable, and six laboratories required more than 31 samples to achieve maturation status. The continuous educational feedback during the twinning maturation process significantly improved the ability of the trainee laboratories to correctly detect and quantify MRD in B- (Table 2) and T-ALL (data not shown) samples.
Table 2

Twinning maturation program. The twinning maturation program significantly reduces the number of wrong MRD assessments after a first series of training samples.

All 22 Trainee Laboratories
Main Sample Set (Sample 1–25)Series 1 (Sample 1–13)Series 2 (Sample 14–25)Series 3 (Sample 14–pX) §
Total Nº of reported results550286264396
Nº of failures67422537
Failures [%]12.214.79.59.3
All samples (sample 1–57)
Total Nº of reported results682
Nº of failures79
Failures [%]11.6

§ pX = last sample (sample57 latest).

Figure 1

Twinning maturation program. (A) Number of failures by all 22 laboratories in the main sample set (samples 1–25) (B) Number of failures of the 16 laboratories that finished within a range of a maximum of 31 samples (31 samples = mean number of samples needed for maturation).

3.2. External Quality Assessment

The United Kingdom National External Quality Assessment Scheme (UK NEQAS) for leukocyte immunophenotyping provides external quality assessment (EQA)/proficiency testing (PT) programs for minimal residual disease for ALL by FCM. We collected performance data from 29 issued UK NEQAS ring trials (containing 2 samples/ring trial) from up to 38 participating I-BFM-FLOW laboratories, and monitored their performance over time by correlating local results of I-BFM centers with target votes (Figure 2). Target votes were defined as the median of all submitted I-BFM center results. Reported MRD values within the concordance margin were considered as concordant (i.e., either more than three times or less than 1/3 of the target MRD value [9]), whereas MRD values outside the concordance margin were defined as outliers. In total, results of 1679 samples have been submitted and compared, of which 487 samples were submitted by AIEOP-BFM laboratories (n = 9) and 1192 by ALL IC-BFM and other associated laboratories (n = 29). Generally, AIEOP-BFM laboratories had a significantly lower proportion of outliers (2.3%) than ALL IC-BFM/other associated laboratories (5.6%) (p = 0.0043), which is in concordance with the higher number of long-term experienced centers within the AIEOP-BFM group. The overall discordance rate was 4.6% (Table 3 and Figure 2). We also assessed the concordance of local versus target votes regarding MRD positivity or negativity, where positivity was defined as ≥10 MRD events among 3 × 105 nucleated events. Here, the concordance rate was 97%, specificity was 81% (141/175) and sensitivity was 99% (1488/1504) (Figure 2).
Figure 2

Correlation of the local MRD estimates of UK NEQAS samples analyzed by I-BFM-Flow laboratories. The target value represents the median of all submitted results. Red dots indicate outliers (i.e., MRD results outside the concordance margin, n = 78). Blue dots indicate concordant results (n = 1601). Positivity was defined as ≥10 MRD events among 3 × 105 nucleated events, i.e., >0.003% (pos, positive; neg, negative).

Table 3

Table summarizing participation and performance of AIEOP-BFM and ALL IC-BFM laboratories in UK NEQAS “Minimal residual disease for ALL by Flow cytometry” ring trials. Data evaluation was completed with each single ring trial by the I-BFM FLOW coordinating center in Vienna. In case of outliers or missing results, the local/individual center was contacted, and corrective measures were discussed. Note that the № of possible results is higher than the number of submitted results, because not all laboratories submitted all results.

Participation
Nº of issued trials a29
Nº of participating laboratories29–38
Nº of possible results1925
Nº of submitted results1679
Performance
AIEOP-BFMALL IC-BFM/othersTotal
Nº of submitted results48711921679
Nº of Outliers b116778
Outliers b [%]2.35.64.6
p = 0.0043 c

a Evaluation period: starting with trial number 131403 until 202103; b Outliers are definded as reported MRD levels ≥ 3× larger or smaller than the target value; i.e., MRD-level > half a log up or down in a log10-correlation (concordance margin); Target MRD levels: median of all reported values of the sample derived from all I-BFM laboratories submitting a result. c Statistics was performed using a 2 × 2 contingency table and Two sided Fisher’s test.

3.3. List Mode Data File Ring Trials (LMD Ring Trial)

For further continuous quality assessment and improvement, the performance of the I-BFM-FLOW laboratories was monitored regularly via LMD ring trials consisting of data files with either negative or varying levels of MRD from B- or T-ALL cases, or of data files from diagnosis for testing the ability of the reference laboratories for the correct lineage assessment of AL by immunophenotyping (see Table S3). The staining panel setup is shown in Table S1 (B- and T-ALL). Since the detection and quantification of MRD by FCM is particularly challenging in B-ALL in periods of heavy hematopoietic regeneration, while it is relatively straight forward at early timepoints of therapy (i.e., d15), a special focus was on MRD training at regenerative timepoints. A series of list mode data ring trials of B-ALL at regenerative time points (e.g., day 78 or after relapse therapies) is shown in Figure 3. LMD ring trials (RT) were organized once a year by the I-BFM-FLOW coordinating center in Vienna. The total number of participating laboratories (AIEOP-BFM, ALL IC-BFM as well as other associated laboratories) varied from 31 in the first RT 2016 up to 41 laboratories in RT 2020. In the first LMD ring trial, 19 out of 31 laboratories (61%) reported MRD values consistent with the target value, while four laboratories (13%) submitted >10% discordant values and eight laboratories (26%) were even considered critical with >25% discordant values (Figure 3, lower panel).
Figure 3

Regular ring trial tests using the list-mode file sent around reveal a significant performance improvement. LMD Files from 20 B-ALL cases at day 78 or after relapse therapies were sent out to all participating laboratories once a year. Laboratories, which reported false MRD values in >10%–≤25% of cases were flagged with a warning (yellow circle) and reports with >25% false MRD values were considered to be critical (red flag). Total contributing I-BFM labs contained AIEOP-BFM, ALL IC-BFM, and other associated laboratories. Statistical analysis was performed using a 3 × 2 contingency table and two-sided Fisher’s test.

Samples that were incorrectly assessed by several centers were subjected to error analysis (n = 24) (Table S4). Most of the repeatedly misinterpreted samples were regenerative BM with a particularly high number of early B cell progenitors (hematogones) (18 out of 24 discordant samples; 75%) resulting mostly in false positive assessments in 13 out of these 18 samples (72%). Additionally, false negative assessments (5/18) as well as inaccurate gating leading to lower or higher MRD levels (5/18) occurred in this group of samples. Along with these considerations, common reasons for failure were an insufficient theoretical separation of blasts from hematogones by classical leukemia-associated immunophenotype (LAIP) markers or maker combinations (7/24), the absence of mature B-cells that could serve as a normal reference population (6/24), negative or only dim expression of CD34 on early BCP1 in burst regeneration (4/24), the co-existence of BCP1 and blasts with only a few aberrations of the latter (4/24), a rare and/or heterogeneous phenotype of the blasts (2/24), or the presence of only very few MRD cells (2/24) (Table S4). The most common errors, together with solutions and suggestions for improvements, were presented and discussed at regular joint meetings and educational material was disseminated to all participants (Supplementary Materials File S7 “Educational Slides”). During these educational feedback meetings, guidelines, and suggestions for better distinguishing hematogones from aberrant blasts were provided. This educational feedback proved to be efficient and sustainable. In the second LMD ring trial (RT 2017), the overall performance of the laboratories significantly improved (p = 0.005). No laboratory submitted >25% discordant results while 8/32 laboratories (25%) were still assigned a warning flag (Figure 3, lower panel). In the last LMD ring trial (RT 2020), 93% of all laboratories succeeded, while no laboratory was assigned a critical status, and only three laboratories (7%) were assigned a warning flag. Of the ALL IC-BFM laboratories, only 54% passed successfully in RT 2016, whereas this number increased to 80% in RT 2017 and remained at high levels (range 68–84%) in RT 2018–2020 (Figure 3, middle panel and table). There was no significant improvement when considering only the AIEOP-BFM laboratories, although the percentage of laboratories successfully passing the LMD ring trial improved from 70% in 2016 (7 out of 10) to 100% in 2020 (Figure 3, upper panel). When assessing only those laboratories which were active from the beginning, results of the improvement status over time did not differ from the assessment of all participating laboratories (Figure S1). A representative example of an LMD ring trial summary with data files from B-ALL samples of regenerative timepoints is illustrated in Figure 4.
Figure 4

Plot from a representative list-mode file ring trial. RT 2020 comprised 20 regenerative bone marrow samples with either negative or varying levels of MRD. LMD files were sent out to 41 centers, all of which reported results. AIEOP-BFM n = 12, ALL IC-BFM = 19, and other I-BFM cooperating centers = 10. The target MRD value was calculated as a median from six experienced and matured AIEOP-BFM reference laboratories (Vienna, Monza, Padova, Prague, Kiel and Berlin). Two opportunities for educational intervention can be seen in this figure: (i) similar errors by four and six centers, respectively, in samples 14 and sample 4 representing a systematic error; and (ii) an accumulation of errors in single centers (see center 13 and 14) representing lack of experience.

3.4. Independent Data Concordance: A Retrospective Quality Control Measure

We tested the agreement of eight matured and highly experienced I-BFM centers regarding the distribution of patients in different FCM-based risk groups within the AIEOP-BFM-ALL 2009 trial (details on all requested data of the survey can be found in Supplementary Materials File S6 “Data query”). The percentage of ALL patients assigned to either the low-risk (FLR), medium-risk (FMR) or high-risk (FHR) group by FCM were highly concordant and there were no significant differences between the independent patient cohorts from the eight centers (Figure 5 (B-ALL) and Figure S2 (T-ALL)).
Figure 5

Homogeneity of B-ALL risk classification data of well-trained and matured centers within the AIEOP-BFM ALL 2009 trial. In a survey including nine AIEOP-BFM FLOW reference laboratories from eight countries, independent data of FCM-MRD-based risk classification of B-ALL on day 15 were collected and compared. Statistical analysis was performed using GraphPad Prism 8.3.0. FHR, Flow high risk; FLR, Flow low risk; FMR, Flow medium risk; INS, contains combined data from the two reference laboratories in Israel.

4. Discussion

This I-BFM-FLOW network report summarizes the comprehensive QC/QA program on FCM-MRD involving an unprecedented, intercontinental effort of two of the world’s largest pediatric ALL trial groups. As of November 2020, twenty-two trainee and seven expert trainer laboratories participated in a twinning maturation program and all trainee laboratories achieved maturation status, enabling them to report FCM-MRD results for clinical decision making without further control by partner laboratories according to the study group’s guidelines. Forty-one I-BFM-FLOW laboratories submitted results in the most recent LMD file send around (RT2020) and data on 1679 results of 29 issued external QA ring trials (UK NEQAS) of up to 38 laboratories participating per ring trial were collected. As shown in this report, these exercises were successfully developed, and maintained and controlled the proficiency of the many FCM laboratories involved in AIEOP-BFM and ALL IC-BFM trial conduct. The harmonization and alignment of the technical aspects of the FCM-MRD methodology, including machine settings, panel composition, sample preparation, as well as sample staining between different centers, are important prerequisites for concordant multicenter applications [9]. Several previous studies showed that the harmonization of technical and methodological aspects is feasible, even in an international context [5,9,10,12]. This is particularly important for large, internationally collaborative trials, such as the AIEOP-BFM-ALL and the ALL IC-BFM trial, where the results of the different laboratories must be concordant for the correct stratification and risk-adapted treatment of patients. While the technical aspects of the FCM-MRD method can be well harmonized, the human factor, i.e., the interpretation of the electronic files of the acquired sample, is the actual Achilles heel of the methodology and strongly relies on operator skills and expert knowledge [9,12,17]. Hence, discrepancies in MRD values between laboratories can be due to wrong interpretations of the data [9,12]. This becomes especially apparent in the case of discrepancies in the results of FCM data file exchanges, where preanalytical and acquisition issues do not play a role. Hence, we based the continuous quality control on two major cornerstones: firstly, the LMD file ring trials, where we focused on the assessment of the analysis performance of each laboratory; and, secondly, on the external UK NEQAS quality control program that allows the performance of the whole methodology to be assessed. Our LMD file ring trials also contain an educational component, where problematic cases are discussed in joint meetings. As part of such meetings, suggestions and solutions for the proper identification of the aberrant cells are provided. In the LMD ring trials, strong BCP regeneration in the bone marrow was found to be the main cause of incorrect MRD assessment. In most cases, these were false positive assessments because normal B cell progenitors (hematogones) were wrongly mistaken for leukemic cells. Therefore, we specifically selected cases with different stages of regeneration and different MRD levels for the LMD ring trials to train the laboratories, especially on such challenging samples. Although BCP regeneration does normally not occur at day 15 or at day 33, it becomes relevant at day 78 of BFM-type induction or during later follow-up time points [4]. While most previous studies were conducted using four-color techniques, panels with ≥6 colors significantly improved the specificity and sensitivity of MRD analysis [18,19,20] and the use of a single-tube with the addition of markers such as CD38, frequently allowed for a reliable differentiation between hematogones and leukemic cells [19,20]. Our approach has some limitations. There are situations where standard panels do not allow for a correct identification of MRD events, even by the experts themselves, unless additional markers, which are useful for such rare cases, are deployed for a better distinction of aberrant cells [21]. In addition, time and staff constraints form a bottleneck for the continued QC and training necessities in a large consortium such as the I-BFM FLOW network. Therefore, approaches with the aim of supporting the operator by using an objective, automated, FCM-analysis tool based on machine learning techniques are necessary and are already being developed [17,22,23,24]. Such a tool may not only relevantly decrease technical variability inherent in manual gating strategies but may also serve as a virtual training partner in twinning programs, such as those presented here to reduce the need for the laborious and costly face-to-face trainings or remote educational feedback loops of trainees with experts. With respect to the survey of independent on-trial data, it does not allow for short-term quality control and immediate intervention and improvements, however, can be used as retrospective evidence of the reliability of the data.

5. Conclusions

Our extensive training and quality assurance program contribute significantly to improving and continuously ensuring the performance of individual laboratories. Of note, the structure based on a coordinating center responsible for quality oversight and feedback proved particularly effective and relevant. Overall, this led to a very high inter-laboratory concordance rate of FCM-MRD assessments and to a high conformity of risk estimates in independent patient cohorts. This is essential for conducting large ALL treatment trials which use multi-laboratory FCM-MRD assessments for stratification. “The I-BFM FLOW-network is open to further national or trial-based reference laboratories to join the quality program for providing excellent, quality-assured diagnostics to young patients with acute leukemia and for promoting collaborative research.”
  23 in total

1.  Note on an exact treatment of contingency, goodness of fit and other problems of significance.

Authors:  G H FREEMAN; J H HALTON
Journal:  Biometrika       Date:  1951-06       Impact factor: 2.445

2.  Quality control of flow cytometry data analysis for evaluation of minimal residual disease in bone marrow from acute leukemia patients during treatment.

Authors:  Elisabet Björklund; Irma Matinlauri; Anne Tierens; Susanne Axelsson; Erik Forestier; Stefan Jacobsson; Asa Jeppsson Ahlberg; Goran Kauric; Pentti Mäntymaa; Liv Osnes; Tarja-Leena Penttilä; Hanne Marquart; Eeva-Riitta Savolainen; Sanna Siitonen; Kerstin Torikka; Joanna Mazur; Anna Porwit
Journal:  J Pediatr Hematol Oncol       Date:  2009-06       Impact factor: 1.289

3.  Automated Flow Cytometric MRD Assessment in Childhood Acute B- Lymphoblastic Leukemia Using Supervised Machine Learning.

Authors:  Michael Reiter; Markus Diem; Angela Schumich; Margarita Maurer-Granofszky; Leonid Karawajew; Jorge G Rossi; Richard Ratei; Stefanie Groeneveld-Krentz; Elisa O Sajaroff; Susanne Suhendra; Martin Kampel; Michael N Dworzak
Journal:  Cytometry A       Date:  2019-07-07       Impact factor: 4.355

4.  A QA Program for MRD Testing Demonstrates That Systematic Education Can Reduce Discordance Among Experienced Interpreters.

Authors:  Michael Keeney; Brent L Wood; Benjamin D Hedley; Joseph A DiGiuseppe; Maryalice Stetler-Stevenson; Elisabeth Paietta; Gerard Lozanski; Adam C Seegmiller; Bruce W Greig; Aaron C Shaver; Lata Mukundan; Howard R Higley; Caroline C Sigman; Gary Kelloff; J Milburn Jessup; Michael J Borowitz
Journal:  Cytometry B Clin Cytom       Date:  2017-05-05       Impact factor: 3.058

5.  Standardized flow cytometry for highly sensitive MRD measurements in B-cell acute lymphoblastic leukemia.

Authors:  Prisca Theunissen; Ester Mejstrikova; Lukasz Sedek; Alita J van der Sluijs-Gelling; Giuseppe Gaipa; Marius Bartels; Elaine Sobral da Costa; Michaela Kotrová; Michaela Novakova; Edwin Sonneveld; Chiara Buracchi; Paola Bonaccorso; Elen Oliveira; Jeroen G Te Marvelde; Tomasz Szczepanski; Ludovic Lhermitte; Ondrej Hrusak; Quentin Lecrevisse; Georgiana Emilia Grigore; Eva Froňková; Jan Trka; Monika Brüggemann; Alberto Orfao; Jacques J M van Dongen; Vincent H J van der Velden
Journal:  Blood       Date:  2016-11-30       Impact factor: 22.113

6.  Immunological detection of minimal residual disease in children with acute lymphoblastic leukaemia.

Authors:  E Coustan-Smith; F G Behm; J Sanchez; J M Boyett; M L Hancock; S C Raimondi; J E Rubnitz; G K Rivera; J T Sandlund; C H Pui; D Campana
Journal:  Lancet       Date:  1998-02-21       Impact factor: 79.321

7.  Multiparameter phenotype mapping of normal and post-chemotherapy B lymphopoiesis in pediatric bone marrow.

Authors:  M N Dworzak; G Fritsch; C Fleischer; D Printz; G Fröschl; P Buchinger; G Mann; H Gadner
Journal:  Leukemia       Date:  1997-08       Impact factor: 11.528

8.  Standardization of flow cytometric minimal residual disease evaluation in acute lymphoblastic leukemia: Multicentric assessment is feasible.

Authors:  Michael Norbert Dworzak; Giuseppe Gaipa; Richard Ratei; Marinella Veltroni; Angela Schumich; Oscar Maglia; Leonid Karawajew; Allessandra Benetello; Ulrike Pötschger; Zvenyslava Husak; Helmut Gadner; Andrea Biondi; Wolf-Dieter Ludwig; Giuseppe Basso
Journal:  Cytometry B Clin Cytom       Date:  2008-11       Impact factor: 3.058

9.  Risk of relapse of childhood acute lymphoblastic leukemia is predicted by flow cytometric measurement of residual disease on day 15 bone marrow.

Authors:  Giuseppe Basso; Marinella Veltroni; Maria Grazia Valsecchi; Michael N Dworzak; Richard Ratei; Daniela Silvestri; Alessandra Benetello; Barbara Buldini; Oscar Maglia; Giuseppe Masera; Valentino Conter; Maurizio Arico; Andrea Biondi; Giuseppe Gaipa
Journal:  J Clin Oncol       Date:  2009-10-05       Impact factor: 44.544

10.  Automated analysis of acute myeloid leukemia minimal residual disease using a support vector machine.

Authors:  Wanmao Ni; Beili Hu; Cuiping Zheng; Yin Tong; Lei Wang; Qing-Qing Li; Xiangmin Tong; Yong Han
Journal:  Oncotarget       Date:  2016-11-01
View more
  2 in total

1.  One-point flow cytometric MRD measurement to identify children with excellent outcome after intermediate-risk BCP-ALL: results of the ALL-MB 2008 study.

Authors:  Alexander Popov; Guenter Henze; Julia Roumiantseva; Oleg Budanov; Mikhail Belevtsev; Tatiana Verzhbitskaya; Elena Boyakova; Liudmila Movchan; Grigory Tsaur; Maria Fadeeva; Svetlana Lagoyko; Liudmila Zharikova; Natalia Miakova; Dmitry Litvinov; Olga Khlebnikova; Olga Streneva; Elena Stolyarova; Natalia Ponomareva; Galina Novichkova; Larisa Fechina; Olga Aleinikova; Alexander Karachunskiy
Journal:  J Cancer Res Clin Oncol       Date:  2022-09-28       Impact factor: 4.322

Review 2.  MRD Monitoring by Multiparametric Flow Cytometry in AML: Is It Time to Incorporate Immune Parameters?

Authors:  Ilias Pessach; Theodoros Spyropoulos; Eleftheria Lamprianidou; Ioannis Kotsianidis
Journal:  Cancers (Basel)       Date:  2022-09-01       Impact factor: 6.575

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.