Literature DB >> 16168099

How many genes are needed for early detection of breast cancer, based on gene expression patterns in peripheral blood cells?

Wuju Li.   

Abstract

Entities:  

Mesh:

Year:  2005        PMID: 16168099      PMCID: PMC1242153          DOI: 10.1186/bcr1295

Source DB:  PubMed          Journal:  Breast Cancer Res        ISSN: 1465-5411            Impact factor:   6.466


× No keyword cloud information.

In their recent report [1], Sharma and coworkers explore the early detection of breast cancer. They analyzed a gene expression data set (1368 genes in 62 normal and 40 tumour samples, including sample duplication in different batches) using the nearest shrunken centroid method. They identified a panel of 37 genes that permitted early detection, with the classification accuracy being about 82%. This is a typical problem with sample classification based on gene expression profiling. The objective is to achieve high prediction accuracy with as few genes as possible, and so feature selection plays an important role; examination of a large number of genes will increase the dimensionality, computational complexity, and clinical cost. According to our previous study of data sets from patients with colon cancer, leukaemia and breast cancer [2], we estimated that five or six genes – rather than 37 -would be sufficient for the early detection of beast cancer [1]. So how many genes are indeed needed? In order to address this question, we evaluated the data presented by Sharma and coworkers using the Tclass system [2]. In the Tclass system, Fisher's linear discriminant analysis and a step-wise optimization procedure for feature selection are used to analyze a batch adjusted data set [1] in two ways. The first is to take the prediction accuracy from the training set as the object function. The second way is to take the classification accuracy from the leave-one-out cross-validation as the object function. For the former, the selected optimal feature sets are evaluated by randomly dividing all tissue samples into a training set (e.g. 50%, 67%, or 85% of samples) and a test set 200 times. The relationship between the prediction accuracy and the number of genes is illustrated in Fig. 1, which shows that the greatest prediction accuracy was achieved using six genes (Fig. 1a); other peaks in accuracy occurred when 10, 13, or 15 genes were used (Fig. 1b). Furthermore, two genes – the 481th (BC009696) and the 801th (BC000514) – permitted classification accuracy as high as 86%, which is greater than the 82% achieved by Sharma and coworkers [1] with the selected 37 genes.
Figure 1

Number of genes examined and classification accuracy. The relationship between the number of genes and classification accuracy is shown for (a) different partition ratios and (b) leave-one-out cross-validation analysis.

In summary, we may draw the following conclusions. First, the number of genes needed for early detection of breast cancer is fewer than 10, based on the data set in the report by Sharma and coworkers [1]. Second, the classification accuracy will gradually decrease when the number of genes exceeds 6 (Fig. 1a) and 10 (Fig. 1b). Related details and information regarding the Tclass system are available upon request or from our website [3].

Competing interests

The authors(s) declare that they have no competing interests.
  3 in total

1.  Tclass: tumor classification system based on gene expression profile.

Authors:  Li Wuju; Xiong Momiao
Journal:  Bioinformatics       Date:  2002-02       Impact factor: 6.937

2.  Early detection of breast cancer based on gene-expression patterns in peripheral blood cells.

Authors:  Praveen Sharma; Narinder S Sahni; Robert Tibshirani; Per Skaane; Petter Urdal; Hege Berghagen; Marianne Jensen; Lena Kristiansen; Cecilie Moen; Pradeep Sharma; Alia Zaka; Jarle Arnes; Torill Sauer; Lars A Akslen; Ellen Schlichting; Anne-Lise Børresen-Dale; Anders Lönneborg
Journal:  Breast Cancer Res       Date:  2005-06-14       Impact factor: 6.466

3.  How many genes are needed for early detection of breast cancer, based on gene expression patterns in peripheral blood cells?

Authors:  Wuju Li
Journal:  Breast Cancer Res       Date:  2005-07-29       Impact factor: 6.466

  3 in total
  8 in total

1.  Gene selection for multiclass prediction by weighted Fisher criterion.

Authors:  Jianhua Xuan; Yue Wang; Yibin Dong; Yuanjian Feng; Bin Wang; Javed Khan; Maria Bakay; Zuyi Wang; Lauren Pachman; Sara Winokur; Yi-Wen Chen; Robert Clarke; Eric Hoffman
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

2.  Selecting normalization genes for small diagnostic microarrays.

Authors:  Jochen Jaeger; Rainer Spang
Journal:  BMC Bioinformatics       Date:  2006-08-22       Impact factor: 3.169

3.  How many genes are needed for early detection of breast cancer, based on gene expression patterns in peripheral blood cells?

Authors:  Wuju Li
Journal:  Breast Cancer Res       Date:  2005-07-29       Impact factor: 6.466

4.  A neural network approach to multi-biomarker panel discovery by high-throughput plasma proteomics profiling of breast cancer.

Authors:  Fan Zhang; Jake Chen; Mu Wang; Renee Drabier
Journal:  BMC Proc       Date:  2013-12-20

5.  Recursive SVM biomarker selection for early detection of breast cancer in peripheral blood.

Authors:  Fan Zhang; Howard L Kaufman; Youping Deng; Renee Drabier
Journal:  BMC Med Genomics       Date:  2013-01-23       Impact factor: 3.063

6.  Prediction and prognosis: impact of gene expression profiling in personalized treatment of breast cancer patients.

Authors:  Michael R Mallmann; Andrea Staratschek-Jox; Christian Rudlowski; Michael Braun; Andrea Gaarz; Matthias Wolfgarten; Walther Kuhn; Joachim L Schultze
Journal:  EPMA J       Date:  2010-08-20       Impact factor: 6.543

7.  Identification of novel autoantibodies for detection of malignant mesothelioma.

Authors:  Xufei Zhang; Weike Shen; Xiaomin Dong; Jiangping Fan; Lixia Liu; Xu Gao; Kemp H Kernstine; Li Zhong
Journal:  PLoS One       Date:  2013-08-19       Impact factor: 3.240

8.  Multiple biomarker panels for early detection of breast cancer in peripheral blood.

Authors:  Fan Zhang; Youping Deng; Renee Drabier
Journal:  Biomed Res Int       Date:  2013-11-26       Impact factor: 3.411

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.