Literature DB >> 26046539

Developing Molecular Signatures for Chronic Lymphocytic Leukemia.

Edouard Cornet1, Agathe Debliquis2, Valérie Rimelen2, Natacha Civic3, Mylène Docquier3, Xavier Troussard1, Bernard Drénou2, Thomas Matthes4.   

Abstract

Chronic lymphocytic leukemia (CLL) is a clonal malignancy of mature B cells that displays a great clinical heterogeneity, with many patients having an indolent disease that will not require intervention for many years, while others present an aggressive and symptomatic leukemia requiring immediate treatment. Although there is no cure for CLL, the disease is treatable and current standard chemotherapy regimens have been shown to prolong survival. Recent advances in our understanding of the biology of CLL have led to the identification of numerous cellular and molecular markers with potential diagnostic, prognostic and therapeutic significance. We have used the recently developed digital multiplexed gene-expression technique (DMGE) to analyze a cohort of 30 CLL patients for the presence of specific genes with known diagnostic and prognostic potential. Starting from a set of 290 genes we were able to develop a molecular signature, based on the analysis of 13 genes, which allows distinguishing CLL from normal peripheral blood and from normal B cells, and a second signature based on 24 genes, which distinguishes mutated from unmutated cases (LymphCLL Mut). A third classifier (LymphCLL Diag), based on a 44-gene signature, distinguished CLL cases from a series of other B-cell chronic lymphoproliferative disorders (n = 51). While the methodology presented here has the potential to provide a "ready to use" classification tool in routine diagnostics and clinical trials, application to larger sample numbers are still needed and should provide further insights about its robustness and utility in clinical practice.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26046539      PMCID: PMC4457530          DOI: 10.1371/journal.pone.0128990

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Chronic lymphocytic leukemia (CLL) is the most common leukemia in the Western world. Diagnosis is based on the results of flow cytometric analysis of malignant B cells obtained from peripheral blood, bone marrow, lymph nodes, and other organs. The characteristic phenotype is defined by the combination of several surface markers (CD5, CD19, CD20, and CD23), and the Royal Marsden Hospital (RMH) score is widely used to distinguish CLL from other B-cell chronic lymphoproliferative disorders (B-CLPD) [1]. The clinical course is highly heterogeneous, with some patients dying from their disease within months, while others have a normal life expectancy. Predicting the disease outcome is therefore very helpful in patient management and therapeutic decision-making. Over the past decade, several prognostic markers based on genetic, phenotypic, or molecular characteristics of CLL B cells have thus been added to the original staging systems of Rai and Binet (reviewed by Chiorazzi; 2012) [2]. One of the best-studied markers is the immunoglobulin variable region heavy chain (IgVH) mutation status. In fact, roughly one half of CLL cases exhibit somatically mutated variable heavy chain genes and their presence correlates with a less aggressive clinical course [3],[4]. Specific cytogenetic changes have also been associated with unfavorable outcome (e.g.: deletion 17p) or, alternatively, with improved survival (e.g.: isolated deletion 13q) [5]. At diagnosis, the determination of the correct type of B cell disease associated with a precise outcome prediction currently depends on the interpretation of flow cytometry, cytogenetic and molecular analyses by the corresponding experts, i.e. hematologists, cytogeneticists and pathologists. These methods are costly, and labor- and time-intensive. Cheaper, objective and rapid techniques are therefore warranted. Recently, the company NanoString has developed a new high-throughput RNA expression profiling system (nCounter), which allows the direct digital readout of hundreds of mRNA molecules and their relative abundance using small amounts of total RNA (100 ng), without requiring cDNA synthesis or enzymatic reactions (DMGE; digital multiplexed gene expression) [6]. Several groups have shown high correlation with standard Affymetrix-type profiling and with quantitative RT-PCR, and have also applied this technology to mRNA extracted from Formalin-Fixed Paraffin-Embedded (FFPE) material [7],[8]. DMGE was also applied successfully to the classification of GC and ABC subtypes in diffuse large B cell lymphoma [8]. Our group has observed a high correlation between mRNA measured by DMGE and protein levels in a cohort of acute leukemia patients [9]. Using this technique for the study of a cohort of CLL samples we were able to develop a molecular signature for the diagnosis of CLL (LymphCLL Diag) as well as for the distinction between unmutated and mutated cases (LymphCLL Mut). This analysis is technically simple, can be run in every small hospital, and should be ideal for the use in clinical trials as well as for normal routine diagnosis.

Material and Methods

Patients and patient characteristics

Fresh peripheral blood (PB) samples were obtained from 30 patients of the Hematology Services of the Geneva, Mulhouse and Caen hospitals, The Ethics Committee of the Hospital of Geneva as well as of the Hospitals of Caen and Mulhouse have approved this research. Written informed consent was obtained from all patients. All patient data was analyzed anonymously. From each sample white blood cells were either (a) lysed directly in RNA lysis buffer (Qiagen, Venlo, Netherlands) and stored at -80°C, or (b) resuspended in DMSO, stored in liquid nitrogen, thawed for the present study, and then put into RNA lysis buffer (Mulhouse), or (c) lysed in RNA lysis buffer, followed by RNA extraction and storage at -80°C (Caen). For each patient a hematologic work-up was performed and the diagnosis of CLL established according to standard diagnostic guidelines ([10]). Flow cytometric analysis on CLL B cells was done for CD5, CD19, CD20, CD23, CD43, CD200, as well as for CD38 and ZAP70 expression, karyotype analysis for the detection of del17p, del13q, trisomy 12. The analysis of the IGHV-D-J mutation status was performed on genomic DNA after isolation of leukemic cells on a Ficoll gradient. PCR amplification of IgH rearrangements was performed with either family-specific VH leader primers [11] or FR1 primers, using the BIOMED-2 protocol [12]. PCR amplicons were subjected to direct sequencing on both strands. Sequence data were analyzed using the IMGT database and the IMGT/V-QUEST tool (http://www.imgt.org). Only productive rearrangements were evaluated. VH sequences with a germline homology of 98% or higher were considered as unmutated, and those with a homology less than 98% were considered as mutated [13]. All the relevant patient information is presented in S1 Table. Normal blood samples were obtained from blood donors of the Geneva blood transfusion center. Pure CD19+ B cells were prepared from four Ficoll-enriched normal blood samples using a Selection Kit from Stemcell Technologies, according to manufacturer’s instructions. Purity of the isolated cell populations was verified by flow cytometry with specific anti-CD19 and anti-CD20 antibodies and was >95% in all cases (data not shown). Additional samples for this study were obtained from the three centers from a series of 51 patients with the following diagnoses: 20 mantle cell lymphoma (MCL), 22 marginal zone lymphoma (MZL) including 8 splenic marginal zone lymphoma (SMZL) with villous lymphocytes, 4 follicular lymphoma (FL), 5 hairy cell leukemia (HCL). RNA from these samples was processed in the same way as from the CLL samples described above.

mRNA Analysis

For the analysis with the nCounter system either 250 ng of extracted mRNA or mRNA in lysis buffer, corresponding to the equivalent of 105 cells, was used, according to the manufacturer’s protocol (Nanostring H Technologies, Seattle, WA, USA). In brief, 4 μl of cell lysate or extracted mRNA was hybridized with the Nanostring CodeSet overnight at 65°C. Probes for the analysis of 290 different antigens were synthesized by NanoString technologies, including probes for nine normalization genes (S2 Table). After probe hybridizations and NanoString nCounter digital reading, counts for each mRNA species were extracted, analyzed using a homemade Excel macro, and then expressed as counts (molecules of mRNA/ sample), as described previously [14]. The nCounter CodeSet contained two types of built-in controls: positive controls (spiked mRNA at various concentrations to assess the overall assay performance), and negative controls (alien probes for background calculation). Data handling and analysis was performed as described: background correction consisted of the subtraction of the negative control average plus two SD from the original counts. To select adequate normalization genes from the series of nine candidates included in the CodeSet (ACTB, TBP, RPL19, RPLP0, G6PD, ABCF1, B2M, TPT1, RPS23), their relative stability was evaluated using geNorm-method [15]. For the final normalization of the sample values the geometric mean of the counts obtained for the three selected normalization genes (RPL19, RPLP0 and TPT1) was calculated and used as normalization factor. The technical specificities of the NanoString technology (linearity, reproducibility, sensitivity, etc.) have all been previously described [6],[14],[9]. The data have been submitted to GEO and can be accessed via Access Number GSE66425).

Establishing a gene list for mRNA analysis

We performed an extensive literature search and extracted a set of 290 genes from published articles and public databases, satisfying one of the following criteria: reported to be overexpressed in normal B cells compared to other blood cells; to be over- or under-expressed in CLL samples compared to normal blood samples; to be over- or under-expressed in CLL samples compared to other B-CLPD (S2 Table). In total, a set of 299 genes (290 genes + 9 normalization genes) was used to study the mRNA profile in 5 normal peripheral blood (PB) samples, 4 purified B cell samples, and 30 samples from patients with CLL.

Statistical analysis

Microsoft Excel, GraphPadPrism and Partek Genomics Suite software packages were used for statistical calculations and data presentation; p-value< 0.05. An arbitrary cut-off was chosen to describe a gene as being over- or under-expressed in comparisons between two patient cohorts: ≥50 counts by DMGE, a ≥2-fold change in expression (mean of population 1 divided by mean of population 2 ≥2 or ≤2), with a p-value ≤ 0.05, using the student t-test.

Results

Numerous gene expression profiling studies have been performed during the last two decades on B-CLPD, and CLL in particular, based on microarray technologies. We set out to test a recently developed DMGE method for its potential utility in CLL diagnostics and prognosis determination.

Genes expressed preferentially by normal B-cells

Pure B cell samples were compared to samples from normal PB. In order for a gene to be considered by normal B cells to be preferentially expressed compared to other blood cells, samples with purified B cells were prepared as described above and compared to samples from normal PB. Out of 290 genes 99 genes fulfilled the criteria of an expression level ≥50 counts, a ≥2-fold change in expression (= mean of the four pure B cell samples divided by the mean of the five normal PB samples ≥2), with a p-value ≤0.05. As expected, this list contained genes coding for B-cell specific surface antigens, namely CD19, CD20 and CD79, as well as for B-cell specific transcription factors, like PAX5 and SOX11, or for immunoglobulin heavy and light chains (Table 1, for the complete list of the 99 genes see S3 Table).
Table 1

Genes expressed preferentially by B cells.

GenesMeanMeanRatiop-value
normal PBpure B cellspure B/normal PB
Surface Markers
CD268/BAFF-R940.2539402.5441.90.001
CD833282.52136406.9141.60.033
CD79A3447.71119505.0234.70.003
CD695409.08181679.9633.60.010
CD23/FCER2226.317362.7132.50.000
CD202112.3266914.3331.70.000
CD22824.9023927.7629.00.002
CD19607.5217177.1228.30.003
CD40/ TNFRSF5541.6512031.2522.20.000
CD200142.543101.5421.80.002
CD79B181.903672.5720.20.039
CD267/TACI167.242259.3513.50.004
CD180443.953135.117.10.000
CD70/ CD27 ligand67.95476.847.00.007
CD32/FCGR2B1652.006086.443.70.023
CD814430.1416162.383.60.007
CD124/IL4R4546.8214900.943.30.000
CD24919.002537.832.80.013
CD150288.06788.852.70.009
CD71/TFRC2572.306537.912.50.030
CD7451335.01121141.052.40.000
CD38496.211108.492.20.048
Transcription Factors
SOX111.48128.0986.70.011
PAX5267.7710912.4840.80.000
EBF19.86368.4037.40.021
IRF41106.867649.876.90.024
JUN6269.4028921.654.60.000
BCL21728.647690.484.40.003
MYC1640.935322.693.20.000
Immunglobulin genes
IGHD1071.8823185.9921.60.000
IGHM10313.33198106.2119.20.003
kappa11354.3470720.316.20.000
lambda17487.7364435.643.70.002

Listed are genes with expression values > 50, a pure B cell/normal PB ratio > 2, with a p-value < 0.05.

Listed are genes with expression values > 50, a pure B cell/normal PB ratio > 2, with a p-value < 0.05. Fold changes with values of 20–40 corresponded to the ratio of B cells present in the pure B cell and peripheral blood samples (>95% versus 2–3%, respectively) and were found typically for B cell specific genes, like CD19, CD20 and CD22. Fold changes with values <20 correspond to genes which are expressed not only by B cells, but also by other peripheral blood cells, or to genes, for which the hybridization kinetics were not optimal.

Genes expressed preferentially in CLL samples compared to normal blood cells

A panel of 30 CLLs, with a mean of 80% malignant B cells/sample (range: 58–100% per sample) and characterized for their typical cell surface phenotype, and the presence or absence of IgVH mutations, was used for this study. In order to find among the 290 gene list genes that could differentiate most effectively between CLL and normal samples, we compared the mean of all gene counts from the 30 CLL samples to the mean from the normal PB samples, and to the mean of the pure B cell samples. As previously, we selected only genes that fulfilled the arbitrary criteria of an expression level ≥50 counts, a ≥2-fold change in expression with a p-value ≤0.05. Fig 1A shows the Venn diagram for the genes expressed preferentially in the three different sample groups.
Fig 1

Analysis of the expression of 290 genes in normal PB, pure B cell and CLL samples.

(A) Venn diagram of genes expressed preferentially in the different sample groups (normal PB, n = 5; pure B cells, n = 4; and CLL samples, n = 30). Genes were considered preferentially expressed by one sample group, if they showed an expression level ≥50 counts and a ≥2-fold difference in expression levels between the 2 groups, with a p-value ≤0.05. (B) Samples from PB (n = 5), B cell samples (n = 4), and samples from CLL patients (n = 30) were analyzed by PCA, based on the results of the differential expression of 290 genes. (C) PCA analysis on the same samples as in (b), but using a restricted set of 44 genes relevant for this purpose according to their differential expression in CLL, B cells and normal blood. (D) Heat map of normal PB, pure B cells, and CLL samples, analyzed with thirteen genes overexpressed homogenously (CV<5%) in all CLL samples compared to normal PB and pure B cells. Unsupervised analysis shows a perfect clustering of the CLL samples compared to the normal samples.

Analysis of the expression of 290 genes in normal PB, pure B cell and CLL samples.

(A) Venn diagram of genes expressed preferentially in the different sample groups (normal PB, n = 5; pure B cells, n = 4; and CLL samples, n = 30). Genes were considered preferentially expressed by one sample group, if they showed an expression level ≥50 counts and a ≥2-fold difference in expression levels between the 2 groups, with a p-value ≤0.05. (B) Samples from PB (n = 5), B cell samples (n = 4), and samples from CLL patients (n = 30) were analyzed by PCA, based on the results of the differential expression of 290 genes. (C) PCA analysis on the same samples as in (b), but using a restricted set of 44 genes relevant for this purpose according to their differential expression in CLL, B cells and normal blood. (D) Heat map of normal PB, pure B cells, and CLL samples, analyzed with thirteen genes overexpressed homogenously (CV<5%) in all CLL samples compared to normal PB and pure B cells. Unsupervised analysis shows a perfect clustering of the CLL samples compared to the normal samples. We obtained a list of 111 genes expressed preferentially in CLL samples compared to normal PB, and of 86 genes compared to pure B cells (S4 Table). 44 genes were common to both lists (Table 2A). Interestingly, this set of 44 genes contained genes well known to be overexpressed in CLL, like CD5, LPL and ROR1, but also kappa, lambda and IgG genes, showing that on a per-cell-basis CLL B cells produce more IgG mRNA than normal B cells.
Table 2

List of genes over- or under-expressed specifically by CLL B cells.

A: List of 44 genes expressed specifically by CLL B-cellsB: List of 28 genes specifically underexpressed in CLL compared to pure B cell samples
GenesMeanSDCVGenesFold changep-value
CLL/pure B cell
ABCA63783.31942.90.51
ADAM29482.5857.11.78BAFF-R/ CD2680.4260.0018
AICDA69.9143.72.06BANK10.2240.0042
BIK414.0426.01.03BIRC 30.4840.0070
BMI1607.6202.40.33CD220.3590.0036
BUB1B152.4118.60.78CD40/ TNFRSF50.4810.0000
CD20010131.94443.10.44CD690.1240.0105
CD249424.47446.10.79CD830.0860.0388
CD269/BCMA1857.31201.70.65CHL10.0970.0191
CD27/ TNFRSF78532.63512.40.41CHL1_v40.1440.0000
CD54039.11799.70.45CXCR4/ SDF-1R0.3180.0116
CHIT1243.2162.80.67EBF10.0310.0210
CLLU15605.312595.82.25EBI30.1720.0019
CNR1/ CB1357.5533.31.49EPB41L20.1330.0011
COL9A24955.22398.90.48SERPINA90.0330.0408
CTLA412573.811128.30.89IL60.0010.0032
CXCR31609.2849.20.53ITGA4/ CD49d0.4100.0000
DMD3625.73727.61.03JAM3/ JAM-C0.1290.0001
DNMBP4539.72102.50.46JUN0.4110.0002
FAIM3/ Toso44146.020379.20.46LGMN0.1140.0249
FCER2/ CD2315988.58133.00.51MMP120.0190.0003
FGF2707.3693.60.98MS4A1/ CD200.2930.0001
FGFR1554.3784.81.42MYC0.1980.0001
FILIP1L2284.91636.20.72NREP0.2800.0050
FLT3406.5512.41.26RAGE/ MOK0.4510.0086
FMOD17401.212667.90.73REL0.3350.0005
GNRH1518.1232.40.45SOX110.1890.0105
IGFBP44858.13366.40.69TIMP40.0110.0107
IGHG1 to 413373.711186.20.84ZFP360.4260.0056
IGSF31129.0692.70.61
IL2RA/ IL2R2205.91462.30.66
kappa128933.8120715.30.94
lambda142654.3170967.71.20
LEF17270.02323.70.32
LILRA41799.42540.11.41
LPL641.1681.21.06
RAPGEF3817.9457.30.56
RASGRF12960.01436.70.49
ROR12376.81107.60.47
Selectin P/ CD621067.6708.90.66
SEPT 10667.51305.71.96
SFMBT111065.53547.00.32
TTN7243.93010.20.42
WNT35240.65963.01.14

(A) List of 44 genes expressed specifically by CLL B cells. Expression of mRNA was compared between the mean of 30 CLL samples and the mean of 5 normal PB and 4 pure B cell samples. Listed are those genes that fulfill the following criteria: > 2-fold change in expression between CLL samples and normal PB samples and pure B cell samples; p-value < 0.05; expression level in CLL samples > 50. (B) List of genes underexpressed by CLL samples compared to pure B cell samples. Listed are those genes that fulfill the following criteria: < 0.5-fold change in expression between CLL samples and pure B-cell samples; p-value < 0.05; expression level in pure B cell samples > 50.

(A) List of 44 genes expressed specifically by CLL B cells. Expression of mRNA was compared between the mean of 30 CLL samples and the mean of 5 normal PB and 4 pure B cell samples. Listed are those genes that fulfill the following criteria: > 2-fold change in expression between CLL samples and normal PB samples and pure B cell samples; p-value < 0.05; expression level in CLL samples > 50. (B) List of genes underexpressed by CLL samples compared to pure B cell samples. Listed are those genes that fulfill the following criteria: < 0.5-fold change in expression between CLL samples and pure B-cell samples; p-value < 0.05; expression level in pure B cell samples > 50. As expected, principal component analysis (PCA) performed on all the 290 genes resulted in a clear separation of the samples according to their origin (pure B cells, normal PB or CLL; Fig 1B); restricting the PCA analysis to the 44 genes defined above resulted in a slightly different distribution, with normal PB and pure B cells clustered together, but with all CLL samples clearly separated from them (Fig 1C). A detailed analysis of the expression levels of these 44 genes showed a wide range of coefficients of variation (Table 2A). 13/44 genes were expressed at a highly similar expression level in all CLL samples, with a CV < 0.5 (BMI1, CD200, CD27, CD5, COL9A2, DNMBP, FAIM3, GNRH1, LEF1, RASGRF1, ROR1, SFMBP1, TTN) (S1 Fig). These thirteen genes with the lowest CV can therefore be used as a classifier, which allows unambiguously to separate CLL samples from normal PB samples, being restricted to genes preferentially and homogenously expressed in CLL B cells compared to normal B cells (Fig 1D). Several genes were also found to be specifically underexpressed in all CLL samples as compared to samples with pure B cells (Table 2B). These genes correspond to genes downregulated in CLL B cells compared to normal B cells, like IL-6, TIMP4 and MMP12, whose mRNA is absent in CLL B cells although normal B cells produce them in high amounts (mean: 1758.9; 92.7; 90.5 copies/sample, respectively).

Correlation between protein expression and mRNA expression

Flow cytometry allows the analysis of surface and intracytoplasmatic antigens of CLL B-cells. We studied the correlation between some antigens measured by routine flow cytometry (i.e.: CD19, CD20, CD5, CD23, CD38, CD200, kappa, lambda, ZAP70) and mRNA counts obtained by the nCounter measurements (Table 3). These expression levels corresponded to the protein expression levels detected by flow cytometry. In all the samples the CLL B cells strongly expressed CD5, CD23, CD43 and CD200 on their surface, whereas CD20 expression was found decreased compared to normal B cells.
Table 3

Quantification of mRNA transcripts for surface proteins.

GenesMeanMeanMeanRatio CLL/normal PBRatio CLL/pure B cellsp-value
normal PBpure B cellsCLL
CD431886.1717.201253.70.772.90.000increased expression compared to normal B cells
CD51782.16250.994039.12.316.10.000increased expression compared to normal B cells
CD200142.543101.5410131.971.13.30.000increased expression compared to normal B cells
FCER2/ CD23226.317362.7115988.570.62.20.000increased expression compared to normal B cells
CD79B181.903672.574665.025.61.30.421equal expression compared to normal B cells
CD19607.5217177.1219164.631.51.10.480equal expression compared to normal B cells
CD38496.211108.49860.81.70.80.760equal expression compared to normal B cells
CD79A3447.71119505.0264178.818.60.50.016decreased expression compared to normal B cells
CD22824.9023927.768597.410.40.40.004decreased expression compared to normal B cells
MS4A1/ CD202112.3266914.3319628.99.30.30.000decreased expression compared to normal B cells
Correlating the CD38 mRNA counts with the results from flow cytometry (S1 Table) resulted in a correlation coefficient of 0.53, similar to values of correlation coefficients found in a previous study on surface antigens in acute myeloid leukemia (AML) blasts [9] (Fig 2A). We then compared the mRNA counts in the cohort of CD38pos CLL patients with the CD38neg cohort, and the ZAP70pos cohort to the ZAP70neg one: the expected results were found, with CD38pos and ZAP70pos B-cell samples exhibiting higher mRNA counts then the corresponding negative samples (Fig 2B). Interestingly, the comparison with normal B cells gave different results for CD38 and ZAP70: normal B cells, although expressing low surface CD38 protein, showed higher CD38mRNA counts than the CD38neg CLL samples, but lower ZAP70 mRNA counts than the ZAP70neg CLL samples (Fig 2B).
Fig 2

Correlation between mRNA and protein expression.

(A) Correlation between CD38 protein expression, as measured by flow cytometry (% positive CLL B cells) and CD38 mRNA counts, as measured by the nCounter (arbitrary units). (B) Quantification of CD38 and ZAP70 mRNA counts in normal peripheral blood (PB), pure B cells, and in CLL B cells. CLL B cells were analyzed for ZAP70 and for CD38 expression by flow cytometry and then grouped for the mRNA determination, according to presence or absence of these two antigens. (C) Immunoglobulin light chain ratios in 30 CLL patient samples (rhombi) and in normal B cells (triangles: normal PB samples; circles: pure B cell samples). The mean +/- 2SD interval for ratios from polyclonal normal B cells and normal PB is shown (mean 0.89; SD = 0.22; small dots).

Correlation between mRNA and protein expression.

(A) Correlation between CD38 protein expression, as measured by flow cytometry (% positive CLL B cells) and CD38 mRNA counts, as measured by the nCounter (arbitrary units). (B) Quantification of CD38 and ZAP70 mRNA counts in normal peripheral blood (PB), pure B cells, and in CLL B cells. CLL B cells were analyzed for ZAP70 and for CD38 expression by flow cytometry and then grouped for the mRNA determination, according to presence or absence of these two antigens. (C) Immunoglobulin light chain ratios in 30 CLL patient samples (rhombi) and in normal B cells (triangles: normal PB samples; circles: pure B cell samples). The mean +/- 2SD interval for ratios from polyclonal normal B cells and normal PB is shown (mean 0.89; SD = 0.22; small dots). Analysis of the kappa/lambda ratio at the mRNA and protein level showed a 100% correlation between flow cytometry results and mRNA counts, and all CLL cases, which were monoclonal by cytometry also showed abnormal ratios in their mRNA counts (Fig 2C).

Analysis of mutated and unmutated cases of CLL

One of the most significant prognostic factors identified in CLL that ultimately ties to the biology of disease is the mutational status of the variable region of the immunoglobulin heavy chains. We determined the mutation status in our cohort of patients and found 11 patients with mutated and 17 patients with unmutated IgVH, 2 patients were borderline (S1 Table). In order to determine which genes were correlated with the mutation status, we compared mutated to unmutated samples and listed genes either overexpressed in mutated versus unmutated, or in unmutated compared to mutated samples (Table 4). 24 genes were found to be differentially expressed: nineteen genes were overexpressed in unmutated, and five in mutated samples. Among the differentially expressed genes were nine genes from the 44-gene-list, which we used to distinguish CLL from normal samples, as well as several genes described in the literature: CD38, ZAP-70, LPL, etc. ([4],[16],[17],[18]; see also Table 4).
Table 4

List of genes distinguishing mutated from unmutated CLL samples.

Genes upregulated in unmutated IgVH
GenesmutatedunmutatedRatiop-valueLiterature Reference
IgVHIgVHunmut/mut
MeanSDMeanSD
SEPT 10102011721556121.00.00866[19]
AICDA10121174106.50.01562[20,21]
LDOC171436823355.40.00001[22] [23]
FARP134548318.40.02412
LPL9189107262011.70.00001[24] [25] [18] [23] [26]
CNR1507158261611.60.00413[27]
CD381371191429211910.40.02506[4] [17]
DMD14211718537939313.80.00115[19] [23]
CEACAM11941655905403.00.02355
CRY150157113804502.80.00007[28] [29]
ZAP701341563348212092.60.00001[4] [17] [16]
ITGA4600318152910792.50.00632[30]
TCL1A152851460838680184352.50.00048[19]
CD2660631511052.50.00429[31] [32]
CHIT11351003251572.40.00045
IGHM1452691317683338811384912.30.00494[33]
EPB41L21361163052932.20.03937
VPREB325301420550518732.20.00265
ABCA622951010464318122.00.00351
Genes upregulated in mutated IgVH
mutatedunmutatedRatiop-value
IgVHIgVHmut/unmut
MeanSDMeanSD
CTLA41837013614850366862.20.03908
RARA3398305611855862.90.03850
LGMN1149438383.00.03412
CD150166511174903263.40.00781[34] [35]
ADAM29106110916625916.00.00803[24] [19]

Genes with an expression level > 50, and a ratio mut/unmutated or unmutated/mutated > 2 with a p-value < 0.05 are shown; References from the literature for each cited gene are given, when available.

Genes with an expression level > 50, and a ratio mut/unmutated or unmutated/mutated > 2 with a p-value < 0.05 are shown; References from the literature for each cited gene are given, when available. This 24-gene panel called “LymphCLL Mut” allowed a clear distinction between both types of CLL (Fig 3; S2 Fig).
Fig 3

Characterization of different types of CLL samples using PCA analysis.

Samples from PB (n = 5), B cell samples (n = 4), and samples from CLL patients (n = 30) were analyzed by PCA, based on the results of the differential expression of 44 genes. Mutated CLL cases are shown in blue (n = 11), unmutated in green (n = 17), and borderline cases in red (n = 2).

Characterization of different types of CLL samples using PCA analysis.

Samples from PB (n = 5), B cell samples (n = 4), and samples from CLL patients (n = 30) were analyzed by PCA, based on the results of the differential expression of 44 genes. Mutated CLL cases are shown in blue (n = 11), unmutated in green (n = 17), and borderline cases in red (n = 2). Total light chain production was increased in mutated vs unmutated samples (185207 counts vs 324812 counts; p = 0.008).

Analysis of LDOC1 expression

LDOC1 mRNA has been reported to be highly expressed in aggressive cases of CLL and to correlate with IgVH mutation status and with prognosis [22]. When we analyzed the mRNA expression in our 30 CLL cases, we found indeed a dichotomic distribution, completely different from the homogenous distribution, which we described in the thirteen genes used for the CLL classifier (S1 Fig). Interestingly, when we looked among all the 290 genes analyzed, only eight genes were found to correlate with LDOC1 expression: six genes with a positive correlation (SEPT10, LPL, CD26, EPB41L2, CXCR6, CRY1) and two genes with a negative correlation (ADAM29, CD150; Table 5; S3 Fig). ADAM29 was exclusively expressed in samples with absent/low LDOC1 expression and vice versa (Fig 4A). Superficially, this expression pattern corresponded to the IgVH mutation status of these samples, but a closer inspection yielded a group of five samples with absence of both LDOC1 and ADAM29 mRNAs (two mutated and three unmutated cases). The LDOC1/ADAM29 ratio clearly reflects this separation into three different groups of samples. Interestingly, in a previous report Oppezzo et al have published the LPL/ADAM29 ratio as a surrogate marker for IgVH status [24]. Comparing this ratio in our samples to the IgVH status showed concordance in 27/30 samples; the three discordant samples corresponded to samples with absent LDOC1 or ADAM29 expression (Fig 4B).
Table 5

LDOC1 expression in CLL samples.

GenesMeanMeanRatiop-value
LDOC1 negLDOC1 posLDOC1 pos/neg
LDOC11483483.40.0000
SEPT 1068136520.20.0138
LPL15413318.60.0000
CD26/ DPP4451623.60.0023
EPB41L21323532.70.0280
CXCR633892.70.0374
CRY155114042.50.0001
Ratio
LDOC1 neg/pos
CD15013734623.00.0086
ADAM298322451.30.0096

CLL samples were separated into those highly expressing LDOC1 and those with absent/low expression. Listed are those genes that fulfill the following criteria: >2-fold change in expression between LDOC1 pos samples compared to LDOC1 neg samples, with a p-value < 0.05 or >2-fold change in expression between LDOC1 neg samples compared to LDOC1 pos samples, with a p-value < 0.05.

Fig 4

Analysis of LDOC1 expression.

(A) LDOC1 mRNA expression was measured in 30 CLL samples and samples ordered according to absent/low (No 1–17) or high expression (No 18–30); black columns. ADAM29 mRNA expression shows an inverse pattern (No 1–12); hashed columns. Mutation status is noted below (BL = borderline; M = mutated; UM = unmutated IgVH) (B) Analysis of LDOC1/ADAM29 and LPL/ADAM29 ratios.

Analysis of LDOC1 expression.

(A) LDOC1 mRNA expression was measured in 30 CLL samples and samples ordered according to absent/low (No 1–17) or high expression (No 18–30); black columns. ADAM29 mRNA expression shows an inverse pattern (No 1–12); hashed columns. Mutation status is noted below (BL = borderline; M = mutated; UM = unmutated IgVH) (B) Analysis of LDOC1/ADAM29 and LPL/ADAM29 ratios. CLL samples were separated into those highly expressing LDOC1 and those with absent/low expression. Listed are those genes that fulfill the following criteria: >2-fold change in expression between LDOC1 pos samples compared to LDOC1 neg samples, with a p-value < 0.05 or >2-fold change in expression between LDOC1 neg samples compared to LDOC1 pos samples, with a p-value < 0.05.

Validating the CLL classifier

In order to develop a clinically useful classifier, CLL samples not only have to be distinguished unambiguously from normal PB samples, but also from other lymphoma subtypes. We therefore tested the 44 genes found to distinguish CLL from normal PB samples on a series of 51 patients with different B-CLPD, i.e., MCL, MZL, FL and HCL. The PCA analysis showed a clear separation of the CLL from all the other B-CLPD samples, with the exception of one confirmed CLL case, which was misdiagnosed (Fig 5). None of the B-CLPD samples was misdiagnosed as a CLL.
Fig 5

Comparison of CLL to other B-CLPD using PCA.

Samples from CLL patients (n = 30) and of other B-CLPD (n = 51) were analyzed by PCA, based on the results of the differential expression of the selected 44 genes.

Comparison of CLL to other B-CLPD using PCA.

Samples from CLL patients (n = 30) and of other B-CLPD (n = 51) were analyzed by PCA, based on the results of the differential expression of the selected 44 genes.

Discussion

In the present work we describe the use of DMGE, a recently developed technique for the quantitative and parallel analysis of hundreds of mRNAs, in a cohort of 30 CLL patients. Starting from a set of 290 genes with preferential expression in B cells and CLL cells described in previously published reports, we were able to establish lists of genes, with preferential expression in normal and in CLL B cells, respectively, and which allowed distinguishing unambiguously CLL samples from normal PB samples and CLL B cells from normal B cells. Restricting these gene lists to genes expressed homogenously by all CLL samples, independent of chromosomal abnormalities, yielded a classifier, based on the analysis of only 13 genes. Applying this classifier in an unsupervised analysis of our cohort resulted in a perfect separation of all 30 CLL samples. Adding kappa/lambda ratios to the classifier will certainly increase its discriminative power, since our results show for all cases a clear distinction between polyclonal samples (normal PB and sorted normal B cells) and CLL samples with essentially monoclonal B cell populations. In a previous study, we have already used DMGE in acute myeloid leukemia (AML) to correlate leukemic blast mRNA expression with surface antigens determined by flow cytometry [9]. The present study confirms close correlation between flow cytometry results and DMGE analysis for some surface proteins. The DMGE technique also allows the study of genes with prognostic relevance in parallel with the 13 gene diagnostic classifier, using a single assay. Interestingly, these genes fall broadly into two categories: those expressed with wide variations in different samples (up to 104 difference in mRNA expression; e.g.: LILR4 and CLLU1), and those with a present/absent, dichotomic pattern (e.g.: LDOC1, LPL, ADAM29). Analysis of the IgVH mutation status is widely used to distinguish patients with a good from those with a bad prognosis. Several surrogate markers have been described in the literature and shown to correlate with the IgVH mutation status. We could confirm most of them, such as ZAP70, CD38, LPL and LDOC1 (Table 4). On the contrary, we did not find any differential expression for the following genes FCRL2 (p = 0.08) and HS1 (p = 0.10), also reported to vary between mutated and unmutated CLL samples [36], [37]. In an unsupervised analysis 28/30 (93%) CLL samples were correctly classified using the “Lymph CLL Mut” classifier based on 24 genes with a differential expression between IgVH mutated and unmutated cases. The LPL/ADAM29 ratio has already been described previously to constitute a surrogate marker for the IgVH mutation status [24], and also to be related to prognosis [38]. Whereas this ratio distinguishes two different types of CLL samples, the determination of the LDOC1/ADAM29 ratio allowed distinction of 3 subclasses: IgVH mutated with high expression of ADAM29, unmutated samples with high expression of LDOC1, and a third category (mixed mutated and unmutated samples) without expression of LDOC1 and ADAM29. This third group did not show any common IgVH usage or chromosomal abnormalities. Future studies have to tell us whether there is any clinical significance or any existing correlations between this category and prognosis. In our final analysis we tested the 44-gene signature, which differentiated CLL from normal PB samples, on a set of 51 samples from patients with various common B-CLPD. Similar to the flow cytometric RMH score our “LymphCLL Diag” molecular classifier distinguished CLL from other B-CLPD with high sensitivity and specificity (97% and 100%, respectively). Associating the “LymphCLL Diag” gene panel with the “LymphCLL Mut” panel, the kappa/lambda ratio and the LDOC1/ADAM29 ratio, a complete diagnostic and prognostic procedure could be performed in one single “ready to use” assay, based on a panel of 61 genes. Several of the laboratory analyses described for diagnostic and prognostic purposes in CLL are time- and labor- intensive and not well suited for routine testing in most clinical laboratories. One example is the determination of the IgVH mutational status, which is rather expensive, needs specialized know-how, and is currently only performed in a restricted number of laboratories under the expertise of molecular biologists [39]. Threshold levels are arbitrary (in most reports > 2% are considered mutated) and a grey zone exists [39]. Another example is the ZAP70 expression analysis by flow cytometry, which has been largely abandoned due to difficulties in standardization [40] [41] or the ZAP expression analysis by RT-PCR, which requires purification of B cells prior to the assay [39], rendering this approach unsuitable for routine diagnostics. The new sequencing technologies also hold the promise to give valuable data for prognosis determination of CLL patients, most notably TP53, NOTCH1, ATM, SF3B1 and BIRC3 mutations [42] [43]. Mutations in these genes occur in approx. 2%-17% of CLL patients at diagnosis and the prognostic importance of some of them have already been studied in prospective trials [44] [45]. Whether this information is complementary to established prognostic factors and results from mRNA and gene expression studies like ours have ideally to be investigated in large future prospective and comparative trials. Although already widely used for research purposes deep sequencing techniques are not yet used in routine laboratories and the expensive, labor-intensive technology and bioinformatically complex softwares will make this transfer challenging. With the development of DMGE a new technique has arrived, which allows genetic profiling with the parallel analysis of hundreds of mRNAs by a technically extremely simple method. DMGE has a short turn-around time of < 2 days, needs minimal hands-on-time for technicians and is much less costly than whole gene expression profiling or deep sequencing. Moreover, this technique allows for automated data-analysis, and has a read-out, which is intuitive and does not need complicated bio-informatics tools for the analysis or interpretation. By focusing our approach on the analysis of a highly selected set of genes expressed preferentially by B cells, we could obtain signatures from the analysis of whole blood samples, rendering an additional B cell purification step unnecessary. The parallel quantitative analysis of tens to hundreds of mRNAs allows the integration of several diagnostic and prognostic factors in one assay, contrary to numerous studies from the past, which have only analyzed one or two factors at a time. It should be therefore ideally suited for large trials aiming at the comparison of multiple factors in many different patient samples and for molecular characterization of cases without available living cells for flow cytometry, such as cDNA or FFPE. Smaller labs could also profit from this approach for routine diagnostics based on an automated data analysis. To fully appreciate the clinical usefulness and discriminative power of this approach, prospective studies with a much larger number of CLL samples will have to be performed in the future, including samples with other B-CLPD and reactive/inflammatory conditions. Additional prognostic markers can be easily incorporated to the classifier and then be studied simultaneously in clinical trials, but also in routine diagnosis. Integrating information from gene profiling studies with results from genomic mutation and NGS analyses should ultimately lead to better prognostication schemes for patients.

Comparison of mRNA counts from normal PB, pure B cell and CLL samples.

Shown are 13 selected genes from the 44-gene list with a low CV < 0.5. (TIF) Click here for additional data file.

Heatmap of 30 CLL samples, analyzed with the 24-gene classifier for the recognition of mutated vs unmutated samples.

(TIF) Click here for additional data file.

Heatmap of CLL samples based on analysis of genes correlated with expression of LDOC1 mRNA (unsupervised analysis).

(TIF) Click here for additional data file.

Patient characteristics.

30 patients with a diagnosis of typical CLL were included in the study. Flow cytometry was used to determine the % of malignant CLL B cells/sample, and the % of CD38+ CLL B-cells; values ≥20% were considered positive. Mutation status of the IgVH was determined as described in Material and Methods section. ND = not done. (XLS) Click here for additional data file.

Probes for Nanostring analysis.

290 genes were chosen to constitute the CodeSet for this work, plus nine housekeeping genes. (XLSX) Click here for additional data file.

mRNA expression of 290 genes for all 30 CLL patient samples.

(XLSX) Click here for additional data file.

List of genes expressed preferentially by CLL samples compared to normal PB (A) and pure B cell samples (B).

(XLSX) Click here for additional data file.
  43 in total

1.  Combined PER2 and CRY1 expression predicts outcome in chronic lymphocytic leukemia.

Authors:  Lewin Eisele; Robert Prinz; Ludger Klein-Hitpass; Holger Nückel; Kerstin Lowinski; Jürgen Thomale; Lars C Moeller; Ulrich Dührsen; Jan Dürig
Journal:  Eur J Haematol       Date:  2009-06-02       Impact factor: 2.997

2.  Determining cell-of-origin subtypes of diffuse large B-cell lymphoma using gene expression in formalin-fixed paraffin-embedded tissue.

Authors:  David W Scott; George W Wright; P Mickey Williams; Chih-Jian Lih; William Walsh; Elaine S Jaffe; Andreas Rosenwald; Elias Campo; Wing C Chan; Joseph M Connors; Erlend B Smeland; Anja Mottok; Rita M Braziel; German Ott; Jan Delabie; Raymond R Tubbs; James R Cook; Dennis D Weisenburger; Timothy C Greiner; Betty J Glinsmann-Gibson; Kai Fu; Louis M Staudt; Randy D Gascoyne; Lisa M Rimsza
Journal:  Blood       Date:  2014-01-07       Impact factor: 22.113

3.  Molecular classification of mature aggressive B-cell lymphoma using digital multiplexed gene expression on formalin-fixed paraffin-embedded biopsy specimens.

Authors:  Neus Masqué-Soler; Monika Szczepanowski; Christian W Kohler; Rainer Spang; Wolfram Klapper
Journal:  Blood       Date:  2013-09-12       Impact factor: 22.113

4.  Gene mutations and treatment outcome in chronic lymphocytic leukemia: results from the CLL8 trial.

Authors:  Stephan Stilgenbauer; Andrea Schnaiter; Peter Paschka; Thorsten Zenz; Marianna Rossi; Konstanze Döhner; Andreas Bühler; Sebastian Böttcher; Matthias Ritgen; Michael Kneba; Dirk Winkler; Eugen Tausch; Patrick Hoth; Jennifer Edelmann; Daniel Mertens; Lars Bullinger; Manuela Bergmann; Sabrina Kless; Silja Mack; Ulrich Jäger; Nancy Patten; Lin Wu; Michael K Wenger; Günter Fingerle-Rowson; Peter Lichter; Mario Cazzola; Clemens M Wendtner; Anna M Fink; Kirsten Fischer; Raymonde Busch; Michael Hallek; Hartmut Döhner
Journal:  Blood       Date:  2014-03-20       Impact factor: 22.113

5.  Targeted next-generation sequencing in chronic lymphocytic leukemia: a high-throughput yet tailored approach will facilitate implementation in a clinical setting.

Authors:  Lesley-Ann Sutton; Viktor Ljungström; Larry Mansouri; Emma Young; Diego Cortese; Veronika Navrkalova; Jitka Malcikova; Alice F Muggen; Martin Trbusek; Panagiotis Panagiotidis; Frederic Davi; Chrysoula Belessi; Anton W Langerak; Paolo Ghia; Sarka Pospisilova; Kostas Stamatopoulos; Richard Rosenquist
Journal:  Haematologica       Date:  2014-12-05       Impact factor: 9.941

6.  ZAP70 in B-CLL cells related to the expression in NK cells is a surrogate marker for mutational status.

Authors:  Tom G H Wiggers; Guus Westra; Theresia M Westers; Andre P Abbes; Annuska Strunk; Ellen Kuiper-Kramer; Pino Poddighe; Arjan A van de Loosdrecht; Martine E D Chamuleau
Journal:  Cytometry B Clin Cytom       Date:  2013-10-08       Impact factor: 3.058

7.  Genomic aberrations and survival in chronic lymphocytic leukemia.

Authors:  H Döhner; S Stilgenbauer; A Benner; E Leupolt; A Kröber; L Bullinger; K Döhner; M Bentz; P Lichter
Journal:  N Engl J Med       Date:  2000-12-28       Impact factor: 91.245

8.  ZAP-70 expression as a surrogate for immunoglobulin-variable-region mutations in chronic lymphocytic leukemia.

Authors:  Marta Crespo; Francesc Bosch; Neus Villamor; Beatriz Bellosillo; Dolors Colomer; María Rozman; Silvia Marcé; Armando López-Guillermo; Elies Campo; Emili Montserrat
Journal:  N Engl J Med       Date:  2003-05-01       Impact factor: 91.245

9.  Identification of a global gene expression signature of B-chronic lymphocytic leukemia.

Authors:  Diane F Jelinek; Renee C Tschumper; Gustavo A Stolovitzky; Stephen J Iturria; Yuhai Tu; Jorge Lepre; Nigam Shah; Neil E Kay
Journal:  Mol Cancer Res       Date:  2003-03       Impact factor: 5.852

10.  Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes.

Authors:  Jo Vandesompele; Katleen De Preter; Filip Pattyn; Bruce Poppe; Nadine Van Roy; Anne De Paepe; Frank Speleman
Journal:  Genome Biol       Date:  2002-06-18       Impact factor: 13.583

View more
  5 in total

1.  High-level ROR1 associates with accelerated disease progression in chronic lymphocytic leukemia.

Authors:  Bing Cui; Emanuela M Ghia; Liguang Chen; Laura Z Rassenti; Christopher DeBoever; George F Widhopf; Jian Yu; Donna S Neuberg; William G Wierda; Kanti R Rai; Neil E Kay; Jennifer R Brown; Jeffrey A Jones; John G Gribben; Kelly A Frazer; Thomas J Kipps
Journal:  Blood       Date:  2016-11-04       Impact factor: 22.113

2.  Improved classification of leukemic B-cell lymphoproliferative disorders using a transcriptional and genetic classifier.

Authors:  Alba Navarro; Guillem Clot; Alejandra Martínez-Trillos; Magda Pinyol; Pedro Jares; Blanca González-Farré; Daniel Martínez; Nicola Trim; Verónica Fernández; Neus Villamor; Dolors Colomer; Dolors Costa; Itziar Salaverria; David Martín-Garcia; Wendy Erber; Cristina López; Sandrine Jayne; Reiner Siebert; Martin J S Dyer; Adrian Wiestner; Wyndham H Wilson; Marta Aymerich; Armando López-Guillermo; Àlex Sánchez; Elías Campo; Estella Matutes; Sílvia Beà
Journal:  Haematologica       Date:  2017-05-18       Impact factor: 9.941

3.  Hairy cell leukemia: a specific 17-gene expression signature points to new targets for therapy.

Authors:  Elsa Maitre; Edouard Cornet; Agathe Debliquis; Bernard Drenou; François Gravey; Didier Chollet; Stephane Cheze; Mylène Docquier; Xavier Troussard; Thomas Matthes
Journal:  J Cancer Res Clin Oncol       Date:  2022-04-27       Impact factor: 4.322

4.  Decreased TCF1 and BCL11B expression predicts poor prognosis for patients with chronic lymphocytic leukemia.

Authors:  Taotao Liang; Xiaojiao Wang; Yanyan Liu; Hao Ai; Qian Wang; Xianwei Wang; Xudong Wei; Yongping Song; Qingsong Yin
Journal:  Front Immunol       Date:  2022-09-23       Impact factor: 8.786

5.  Differential expression, localization and activity of MARCKS between mantle cell lymphoma and chronic lymphocytic leukemia.

Authors:  J Vargova; K Vargova; N Dusilkova; V Kulvait; V Pospisil; J Zavadil; M Trneny; P Klener; T Stopka
Journal:  Blood Cancer J       Date:  2016-09-23       Impact factor: 11.037

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.