| Literature DB >> 26484146 |
Paul Dickinson1, Claire L Smith2, Thorsten Forster1, Marie Craigon3, Alan J Ross3, Mizan R Khondoker3, Alasdair Ivens4, David J Lynn5, Judith Orme2, Allan Jackson2, Paul Lacaze3, Katie L Flanagan6, Benjamin J Stenson2, Peter Ghazal1.
Abstract
Neonatal infection remains a primary cause of infant morbidity and mortality worldwide and yet our understanding of how human neonates respond to infection remains incomplete. Changes in host gene expression in response to infection may occur in any part of the body, with the continuous interaction between blood and tissues allowing blood cells to act as biosensors for the changes. In this study we have used whole blood transcriptome profiling to systematically identify signatures and the pathway biology underlying the pathogenesis of neonatal infection. Blood samples were collected from neonates at the first clinical signs of suspected sepsis alongside age matched healthy control subjects. Here we report a detailed description of the study design, including clinical data collected, experimental methods used and data analysis workflows and which correspond with data in Gene Expression Omnibus (GEO) data sets (GSE25504). Our data set has allowed identification of a patient invariant 52-gene classifier that predicts bacterial infection with high accuracy and lays the foundation for advancing diagnostic, prognostic and therapeutic strategies for neonatal sepsis.Entities:
Keywords: Gene expression profiling; Microarray; Neonatal sepsis; Whole blood
Year: 2014 PMID: 26484146 PMCID: PMC4535963 DOI: 10.1016/j.gdata.2014.11.003
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Patient demographics of samples used, microorganisms identified from infected patients and reasons for blood sampling in controls.
| Patient demographics of samples used | |||||||
|---|---|---|---|---|---|---|---|
| Sample set | Training set | Platform test set | Validation test set | ||||
| Infection status | Infected ( | Control ( | Infected ( | Control ( | Infected ( | Control ( | |
| Male | 15 (54%) | 22 (63%) | 10 (56%) | 15 (63%) | 10 (63%) | 9 (90%) | |
| Gestation completed at birth (week): range (mean) | 24–38 (28.5) | 26–42 (37.9) | 24–38 (28.8) | 26–42 (37.3) | 23–40 (28.3) | 24–41 (31) | |
| Gestation completed at sampling (week): range (mean) | 26–39 (31.1) | 31–44 (39.4) | 26–39 (30.8) | 31–44 (39.1) | 25–41 (33.8) | 29–42 (34.9) | |
| Birthweight (g): range (mean) | 430–3380 (1126) | 650–4570 (3080) | 430–3380 (1236) | 650–4350 (2941) | 635–3160 (1134) | 800–4220 (1932) | |
A. Patient demographics of samples used. Patient sample details are shown displaying the demographics of the population studied. B. Microorganisms identified from infected patients. Organisms detected for each infected infant are shown — these samples were taken at, or within 6 h of, the time of clinical suspicion of infection. C. Reasons for blood sampling in controls. The reasons for clinical blood sampling in the control group are shown — all of the screening tests in these infants were normal. Table 1 was adapted from Supplementary Table 3 of Smith et al. 2014 [2] by permission from Macmillan Publishers Ltd: Nature Communications [2], copyright (2014).
Clinical details of patient samples used in the study. Table 2 was adapted from Supplementary Data 4 of Smith et al. 2014 [2] by permission from Macmillan Publishers Ltd: Nature Communications [2], copyright (2014).
Fig. 1Study recruitment and sample processing. This flow diagram depicts process of neonatal subject recruitment over sample processing and microarray hybridization. Boxes and arrows are color-coded as follows. Healthy (presenting for clinical reasons other than suspected infection) control neonate samples = blue; neonate samples of suspected but unconfirmed infections = gray; neonate samples with blood-culture test confirmed infection = pink; neonate samples with blood-culture negative test but confirmed viral infection = striped pink. Figure 1 was adapted from Supplementary Figure 9 of Smith et al. 2014 [2] by permission from Macmillan Publishers Ltd: Nature Communications [2], copyright (2014).
Fig. 2Sequence of study analyses prior to validating 52-gene set as a classifier. This flow diagram identifies the sequence of analyses carried out on Illumina microarray data. The gray box indicates that the analyses within are used in combination to inform a subsequent result. Figure 2 was adapted from Supplementary Figure 10 of Smith et al. 2014 [2] by permission from Macmillan Publishers Ltd: Nature Communications [2], copyright (2014).
Fig. 3Training and testing of 52-gene classifier of sepsis in neonates. This diagram details the stages comprising training and testing of the ROC-based classifier. Top box represents processes in the training of the classifier; bottom box represents processes in the testing of the classifier on various types of test sets. LOOCV stands for leave-one-out-cross-validation, which is the iterative process in which a single sample of the training set is predicted based on the classifier trained on all remaining samples. Black arrows are data processing steps; red arrows indicate classifier training and prediction steps. Sample color coding: healthy (presenting for other clinical reasons than suspected infection) control neonate samples = blue; neonate samples of suspected but unconfirmed infections = gray; neonate samples with blood-culture test confirmed infection = pink; neonate samples with blood-culture negative test but confirmed viral infection = striped pink. Figure 3 was adapted from Supplementary Figure 11 of Smith et al. 2014 [2] by permission from Macmillan Publishers Ltd: Nature Communications [2], copyright (2014).
| Specifications | |
|---|---|
| Organism/cell line/tissue | Homo sapiens/whole blood |
| Sex | Male and female |
| Sequencer or array type | Illumina HT-12V3.0 Whole Human Genome microarray, CodeLink 55K Whole Human Genome microarray, Affymetrix U219 Whole Human Genome microarray and Affymetrix HG U133 Plus 2.0 Whole Human Genome microarray |
| Data format | Raw data (Tab delimited text files of background subtracted signals and .CEL files) |
| Experimental factors | Blood culture or cerebrospinal fluid positive bacterial sepsis vs. healthy control whole blood samples and culture negative suspected infected samples |
| Experimental features | A case–control gene expression profiling study of whole blood taken from neonates at the first clinical sign of sepsis and control healthy neonates. Study includes training and replication sets for blood culture positive samples and clinical evaluation set of blood culture negative sepsis cases. Results compared blood culture or cerebrospinal fluid positive septic neonates, blood culture negative septic neonates and healthy control neonates. Prior power calculations were based on Healthy Edinburgh neonates using the CodeLink platform and Gambian infants (9 months of age) were used for further refinement of power calculations using Illumina HT-12 platform. |
| Consent | Written informed consent was obtained from parents of all enrolled infants in accordance with approval granted by the Lothian Research Ethics Committee for blood samples for RNA isolation obtained at the first time of clinical signs of suspected sepsis (reference 05/s1103/3). Samples obtained from The Gambia conformed to MRC policy regarding ethical research in children and were approved by the local scientific coordinating committee (SCC), the Joint Gambia Government/MRC Ethics Committee and by the London School of Hygiene and Tropical Medicine Ethics Committee (reference SCC1085 Pilot Study 1 (L2008.63)) |
| Sample source location | Edinburgh, UK and The Gambia |