| Literature DB >> 19852842 |
Tina Raman1, Timothy P O'Connor, Neil R Hackett, Wei Wang, Ben-Gary Harvey, Marc A Attiyeh, David T Dang, Matthew Teater, Ronald G Crystal.
Abstract
BACKGROUND: Microarray technology provides a powerful tool for defining gene expression profiles of airway epithelium that lend insight into the pathogenesis of human airway disorders. The focus of this study was to establish rigorous quality control parameters to ensure that microarray assessment of the airway epithelium is not confounded by experimental artifact. Samples (total n = 223) of trachea, large and small airway epithelium were collected by fiberoptic bronchoscopy of 144 individuals and hybridized to Affymetrix microarrays. The pre- and post-chip quality control (QC) criteria established, included: (1) RNA quality, assessed by RNA Integrity Number (RIN) > or = 7.0; (2) cRNA transcript integrity, assessed by signal intensity ratio of GAPDH 3' to 5' probe sets < or = 3.0; and (3) the multi-chip normalization scaling factor < or = 10.0.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19852842 PMCID: PMC2774870 DOI: 10.1186/1471-2164-10-493
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Demographic of the Study Population and Biologic Samples1
| n | 17 | 21 | 35 | 15 | 32 | 44 | 3 | 4 | 10 | 22 | 20 |
| Age | 42 ± 7 | 42 ± 9 | 43 ± 10 | 43 ± 7 | 44 ± 6 | 44 ± 6 | 36 ± 6 | 39 ± 6 | 41 ± 10 | 49 ± 7 | 52 ± 8 |
| Gender2 | 13/4 | 15/6 | 26/9 | 13/2 | 22/10 | 31/13 | 2/1 | 3/1 | 5/5 | 16/6 | 16/4 |
| Ancestry3 | 6/4/0/7 | 8/3/1/9 | 15/4/1/15 | 5/5/0/5 | 7/4/0/21 | 11/4/0/29 | 0/2/0/1 | 1/2/0/1 | 3/3/0/4 | 4/1/0/17 | 8/4/1/7 |
| Smoking history | <1.0 | <1.0 | <1.0 | 28 ± 16 | 28 ± 18 | 28 ± 16 | 14 ± 4 | 16 ± 9 | 21 ± 13 | 31 ± 18 | 38 ± 23 |
| Pulmonary function parameters4 | |||||||||||
| FVC | 111 ± 16 | 105 ± 11 | 109 ± 11 | 108 ± 11 | 109 ± 12 | 109 ± 12 | 116 ± 6 | 113 ± 7 | 110 ± 109 | 102 ± 11 | 93 ± 23 |
| FEV1 | 111 ± 18 | 101 ± 26 | 105 ± 21 | 108 ± 13 | 109 ± 13 | 109 ± 14 | 112 ± 16 | 112 ± 13 | 108 ± 20 | 97 ± 12 | 72 ± 22 |
| FEV1/FVC | 83 ± 7 | 82 ± 6 | 80 ± 7 | 82 ± 6 | 81 ± 5 | 81 ± 5 | 80 ± 6 | 81 ± 3 | 81 ± 13 | 79 ± 4 | 61 ± 9 |
| TLC | 106 ± 17 | 99 ± 14 | 104 ± 13 | 100 ± 8 | 102 ± 12 | 100 ± 12 | 106 ± 4 | 108 ± 4 | 104 ± 19 | 93 ± 13 | 105 ± 22 |
| DLCO | 110 ± 9 | 101 ± 18 | 101 ± 17 | 94 ± 7 | 96 ± 11 | 96 ± 11 | 92 ± 14 | 95 ± 13 | 94 ± 18 | 65 ± 8 | 73 ± 19 |
| Average # of cells recovered (×106) | 5.4 | 6.8 | 5.7 | 4.4 | 6.4 | 6.4 | 6.6 | 7.6 | 6.2 | 5.9 | 6.3 |
| Cell differential5 | |||||||||||
| % epithelial | 100 ± 0.2 | 100 ± 0.7 | 100 ± 0.6 | 100 ± 0.2 | 100 ± 0.7 | 100 ± 0.5 | 100 ± 0.0 | 100 ± 0.6 | 100 ± 0.4 | 99 ± 0.8 | 99 ± 1.7 |
| % inflammatory | 0.1 ± 0.2 | 0.3 ± 0.7 | 0.3 ± 0.6 | 0.1 ± 0.2 | 0.3 ± 0.7 | 0.2 ± 0.5 | 0.0 | 0.4 ± 0.6 | 0.4 ± 0.4 | 0.6 ± 0.8 | 1.5 ± 1.7 |
| % ciliated | 49 ± 7.1 | 55 ± 3.9 | 77 ± 5.6 | 27 ± 8.2 | 49 ± 9.0 | 72 ± 6.7 | 26 ± 2.8 | 47 ± 16 | 76 ± 5.4 | 73 ± 8.7 | 69 ± 2.8 |
| % secretory | 6.6 ± 4.0 | 12 ± 4.0 | 6.8 ± 3.5 | 8.8 ± 4.4 | 11 ± 4.1 | 7.1 ± 3.0 | 12 ± 4.6 | 14 ± 1.2 | 5.9 ± 3.0 | 9.7 ± 7.2 | 12 ± 2.9 |
| % basal | 29 ± 8.6 | 20 ± 3.4 | 9.1 ± 3.4 | 39 ± 5.2 | 24 ± 5.6 | 9.9 ± 3.3 | 37 ± 5.4 | 17 ± 10 | 10 ± 2.5 | 9.9 ± 4.8 | 8.2 ± 2.3 |
| % undifferentiated | 15 ± 6.0 | 13 ± 3.8 | 7.3 ± 3.6 | 25 ± 11 | 15 ± 7.7 | 11 ± 5.6 | 25 ± 1.0 | 22 ± 6.0 | 7.8 ± 1.1 | 7.3 ± 2.4 | 9.6 ± 1.8 |
1 Data is presented as mean ± standard deviation.
2 Male/Female.
3 European/Hispanic/Asian/African.
4 Pulmonary function testing parameters are given as percent of predicted value with the exception of FEV1/FVC, which is reported as % observed; FVC - forced vital capacity, FEV1 - forced expiratory volume in 1 sec, TLC - total lung capacity, DLCO - diffusing capacity. For individuals with COPD, FVC, FEV1, and FEV1/FVC are post-bronchodilator values.
5 % epithelial and inflammatory are based on the total number of cells recovered; % ciliated, secretory, basal and undifferentiated cells are based on the total number of epithelial cells recovered.
6 Lone emphysema with normal spirometry smokers.
7 COPD smoker, GOLD stage I n = 9, II n = 9, III n = 2.
Figure 1Assessment of RNA quality in airway epithelial samples. Integrity of 180 RNA samples was scored using the RNA Integrity Number (RIN) generated by Agilent 2100 Bioanalyzer Software (1 = highly degraded; 10 = intact). Samples are grouped by phenotype as defined in Methods, and within each phenotype the site of the epithelial sample is indicated (trachea; large airway; small airway). Samples with RIN ≥ 7.0, shown by the dotted line, passed QC criterion, while the 5 samples below the dotted line failed the QC criterion.
Figure 2Assessment of GAPDH 3'/5' and Chip scaling factor. Ratios of signal intensities for GAPDH 3' and 5' probe sets for 223 samples were extracted from the GeneChip Operating Software (GCOS) Quality Report and plotted against the Scaling Factors analyzed with a target intensity value of 500 extracted from the GCOS Quality Report. Samples with GAPDH 3'/5' ratio ≤ 3.0, to the left of the vertical dotted line, passed QC criterion, while the one sample to the right of the dotted line failed the QC criterion. Samples with scaling factor values ≤ 10.0 passed QC criterion (below the horizontal dashed line) while the 7 samples above the dashed line failed the QC criterion.
Classification of Quality Control Failures by Criterion 1
| 2 | 0 | 5 | |
| 0 | 1 | 2 | |
| 1 | 0 | 0 | |
| 2 | 0 | 0 | |
| 5 | 1 | 7 | |
1 The 233 samples were assessed by the established QC criteria and all those failing one or more were classified by which QC criteria were failed.
2 Only 191 of the 223 samples were assessed for RIN.
Figure 3Pairwise correlations of expression levels for 100 maintenance genes. Expression levels for 100 maintenance genes were determined for 34 airway epithelial samples of which 24 randomly selected samples passed the pre-determined QC criteria and 10 failed one or more of the criteria. The vertical and horizontal numbers refer to the 34 samples, categorized as "pass" or "fail"; LA = large airway; SA = small airway. Pearson correlation coefficients for all pairwise comparisons between the 34 samples were determined and are plotted in grey-scale, with each cell representing a single correlation between two samples (white, r > 0.94; gray, 0.92 ≤ r ≤ 0.94; black, r < 0.92). Shown are the 24 × 24 comparison of samples both passing QC, the 24 × 10 between samples passing QC and samples failing QC, and the 10 × 10 comparison of samples both failing QC. Note that all of the correlation values <0.92 are derived only from pairwise comparisons including samples failing the QC criteria.
Figure 4Frequency distribution of correlation coefficients calculated for pairwise comparisons. Shaded dark grey region represents pairwise comparisons (n = 285) where at least 1 sample failed the QC criteria. Light grey region represent pairwise comparisons (n = 276) where both samples pass QC criteria. The majority of samples passing the QC criteria have correlation values >0.94.
Figure 5Variability in maintenance gene expression levels in samples that pass or fail QC criteria. The coefficients of variation for each of the 100 maintenance genes were calculated across 2 data sets: a data set of 10 samples failing QC criteria (red squares), and a randomly selected data set of 10 samples that pass QC criteria (blue triangles). Upper and lower boundaries of shaded regions represent 95th and 5th percentiles, respectively, of coefficient of variation across samples failing the QC criteria (red box) and coefficient of variation across samples passing the QC criteria (blue box).
Figure 6Principal components analysis of genome-wide gene transcriptome data in failed and passed COPD subjects. The axes have been rotated presenting a top view to highlight the 2 standard deviation ovoid clustering of expression from failed and passed COPD subjects. Each axis represents one principal component (PC), with PC1 on the x axis, PC3 on the y axis and PC2 on the z axis. Failed COPD subjects are represented by red spheres and passed COPD subjects by green spheres.