Literature DB >> 15774070

Artificial neural networks as prediction tools in the critically ill.

Abstract

The past 25 years have witnessed the development of improved tools with which to predict short-term and long-term outcomes after critical illness. The general paradigm for constructing the best known tools has been the logistic regression model. Recently, a variety of alternative tools, such as artificial neural networks, have been proposed, with claims of improved performance over more traditional models in particular settings. However, these newer methods have yet to demonstrate their practicality and usefulness within the context of predicting outcomes in the critically ill.

Entities: Disease Species

Mesh：

Year: 2005 PMID： 15774070 PMCID： PMC1175945 DOI： 10.1186/cc3507

Source DB: PubMed Journal: Crit Care ISSN： 1364-8535 Impact factor: 9.097

Introduction

The science of outcome prediction is particularly useful in the setting of the emergency room – the entry point of many to acute care. In this issue of Critical Care Jaimes and coworkers [1] evaluate the usefulness of artificial neural networks (ANNs) in predicting hospital mortality in patients presenting to the emergency room with suspected sepsis. The construction of a prediction tool is a difficult undertaking; it requires careful methodological consideration and validation before the predictions can be deemed valid and reliable in naïve patients [2,3]. These tools identify the presence of associations between the outcome of interest and empiric risk factors that contribute to this outcome. A well designed tool will typically possess three qualities: discrimination (the ability to identify accurately those patients who will reach the outcome from those who will not), goodness of fit (the ability to match accurately predicted and actual outcomes, such as mortality rate, in subgroups of patients), and the ability to achieve these predictions in cohorts of patients similar to those in which the tool was developed [4,5].

Artificial neural networks as prediction tools

Most predictive tool use logistic regression – a well vetted statistical technique that is applicable to situations in which the outcome is binary (e.g. survival/death), measured at a predetermined time in the future [6]. The technique can precisely quantify the relative contribution of each risk factor to outcome, typically crystallized as the odds ratio (i.e. the relative odds that a patient with the risk factor has of reaching the outcome as compared with a patient without the risk factor). ANNs represent an alternative technique for achieving predictions. The key difference between the two techniques is that the contribution of each risk factor is not as rigidly dictated with ANNs as it is in a logistic regression model. ANNs can improve predictions by extracting information drawn from unforeseen interactions between predictors. Arguably, if a modeler had foresight of the important interactions present, then they could design an accurate standard logistic regression model without the heavy price associated with the use of an ANN. Often considered as 'black boxes', ANNs suffer from several shortcomings. There is no clear prescription to construct an ANN. Off the shelf software is readily available, but there is little guidance as to how ANN parameters, such as number of intermediate (hidden) layers of neurons, number of neurons in those layers, learning rate, activation functions, and several other tuning parameters, should be chosen and tuned for optimal performance by a novice user. ANNs, by virtue of the presence of numerous weights linking the neurons, can accommodate nonlinearity but they also include very large numbers of parameters. There are no clear techniques that provide limits of confidence for those parameters. Consequently, the relative contribution of each input risk factor to the outcome is difficult to quantify. Because of the large number of parameters, it is very easy for an ANN to overfit the development dataset. In other words, the predictions offered by the ANN will be overly optimistic in the original population of patients, and this good performance will not generalize well to populations to which the ANN is naïve. Jaimes and coworkers [1] do not offer a prediction in a naïve population, nor do those investigators clearly indicate whether techniques were applied to minimize the risk of overfitting. Thus, their conclusion may be overstated. Limited measures guarding against overfitting can be applied within a development set, without specific recourse to an independent validation set. Finally, preparing input data for an ANN requires some amount of a priori knowledge. For example, white blood cell count or fever typically have a 'U'-shaped relationship to outcome. Both low and high values portend a bad outcome, whereas intermediate values are normal. The predictions of an ANN will be significantly improved if this unintuitive relationship between risk factor and outcome is already 'known' to the modeler ahead of time. ANNs have been used by many investigators to predict outcomes in the critically ill [7-11]. Clearly, ANNs – as prediction tools – require significant expertise to build, have significant shortcomings, and should be developed and validated as rigorously as logistic regression models [12]. ANNs do offer greater flexibility and may allow the identification of unforeseen interactions. ANNs can predict continuous outcomes, and thus offer an alternative to multivariable regression. Similarly, ANNs are easily scalable to ordinal or categorical outcomes, and to survival analysis [13], which is much less familiar territory for the critical care physician. However, ANNs can easily be misused, typically because their limitations are not well understood [14,15].

Conclusion

Therefore, in settings in which familiar tools such as logistic regression can be applied, ANNs should be reserved for those situations where standard models do not perform well; where one suspects the presence of intense but poorly characterized interactions between risk factors; where one does not particularly care about quantifying the relative contributions of risk factors; where appropriate validation is possible; and, most importantly, where proper expertise is readily available. The science of developing prediction models is best left to the expert, but the clinician can contribute invaluable content knowledge to the process.

Abbreviations

ANN = artificial neural network.

Competing interests

Dr Clermont is a paid consultant to Amgen, Altana, and Immunetrics, LLC. The current manuscript was realized without any financial or other material support from commercial entities.

14 in total

1. On the misuses of artificial neural networks for prognostic and diagnostic classification in oncology.

Authors: G Schwarzer; W Vach; M Schumacher
Journal: Stat Med Date: 2000-02-29 Impact factor: 2.373

Review 2. Predicting outcome after ICU admission. The art and science of assessing risk.

Authors: D P Schuster
Journal: Chest Date: 1992-12 Impact factor: 9.410

3. Artificial neural network prediction of ventilator-induced lung edema formation.

Authors: John R Hotchkiss; Alain F Broccard; Philip S Crooke
Journal: Crit Care Med Date: 2003-08 Impact factor: 7.598

4. Predicting hospital mortality for patients in the intensive care unit: a comparison of artificial neural networks with logistic regression models.

Authors: G Clermont; D C Angus; S M DiRusso; M Griffin; W T Linde-Zwirble
Journal: Crit Care Med Date: 2001-02 Impact factor: 7.598

5. A technique for using neural network analysis to perform survival analysis of censored data.

Authors: M De Laurentiis; P M Ravdin
Journal: Cancer Lett Date: 1994-03-15 Impact factor: 8.679

6. Predicting outcomes after liver transplantation. A connectionist approach.

Authors: H R Doyle; I Dvorchik; S Mitchell; I R Marino; F H Ebert; J McMichael; J J Fung
Journal: Ann Surg Date: 1994-04 Impact factor: 12.969

7. A review of goodness of fit statistics for use in the development of logistic regression models.

Authors: S Lemeshow; D W Hosmer
Journal: Am J Epidemiol Date: 1982-01 Impact factor: 4.897

8. A comparison of statistical and connectionist models for the prediction of chronicity in a surgical intensive care unit.

Authors: T G Buchman; K L Kubos; A J Seidler; M J Siegforth
Journal: Crit Care Med Date: 1994-05 Impact factor: 7.598

9. Modeling mortality in the intensive care unit: comparing the performance of a back-propagation, associative-learning neural network with multivariate logistic regression.

Authors: G S Doig; K J Inman; W J Sibbald; C M Martin; J M Robertson
Journal: Proc Annu Symp Comput Appl Med Care Date: 1993

10. Comparison between logistic regression and neural networks to predict death in patients with suspected sepsis in the emergency room.

Authors: Fabián Jaimes; Jorge Farbiarz; Diego Alvarez; Carlos Martínez
Journal: Crit Care Date: 2005-02-17 Impact factor: 9.097

1 in total

1. In Silico Modeling in Infectious Disease.

Authors: Silvia Daun; Gilles Clermont
Journal: Drug Discov Today Dis Models Date: 2007-10-01

1 in total