| Literature DB >> 34193041 |
Kristin Hieronymus1, Benjamin Dorschner1, Felix Schulze1, Neeta L Vora2, Joel S Parker3, Jennifer Lucia Winkler4, Angela Rösen-Wolff1, Stefan Winkler5.
Abstract
BACKGROUND: Preterm birth is the leading cause of neonatal morbidity and mortality, but research efforts in neonatology are complicated due to the unavailability of large volume blood samples. Whole blood assays can be used to overcome this problem by performing both functional and gene expression studies using small amounts of blood. Gene expression studies using RT-qPCR estimate mRNA-levels of target genes normalized to reference genes. The goal of this study was to identify and validate stable reference genes applicable to cord blood samples obtained from developing neonates of different gestational age groups as well as to adult peripheral blood samples. Eight reference gene candidates (ACTB, B2M, GAPDH, GUSB, HPRT, PPIB, RPLP0, RPL13) were analyzed using the three published software algorithms Bestkeeper, GeNorm and NormFinder.Entities:
Keywords: Comparative gene expression analysis; Housekeeping genes; Infants; Neonates; Validation; Whole blood
Mesh:
Substances:
Year: 2021 PMID: 34193041 PMCID: PMC8244134 DOI: 10.1186/s12864-021-07801-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Maternal and newborn characteristics by group
| Measure | Preterm neonates (group 1, | Full-term neonates (group 2, | |
|---|---|---|---|
| Maternal age, mean (range), years | 30 (23–36) | 32.3 (23–39) | 0.289 |
| Receipt of antenatal corticosteroids | 0.003 | ||
| 4 (40) | 10 (100) | ||
| 0 (0) | 0 (0) | ||
| 6 (60) | 0 (0) | ||
| Mode of delivery | > 0.999 | ||
| 7 (70) | 7 (70) | ||
| 3 (30) | 3 (30) | ||
| Gestational age, mean (range), weeks | 30.6 (23.4–35.9) | 38.9 (37.3–40.4) | < 0.0001 |
| Birth weight, mean (range), g | 1419 (465–2260) | 3313 (2500–4570) | < 0.0001 |
| < 10th centile birth weight, n (%) | 3 (30) | 1 (10) | 0.264 |
| Umbilical artery pH, mean (range) | 7.31 (7.13–7.43) | 7.31 (7.22–7.37) | 0.954 |
| Infant sex | 0.361 | ||
| 5 (50) | 7 (70) | ||
| 5 (50) | 3 (30) |
Name, function, primer sequences and amplicon characteristics of candidate reference genes (bp, base pairs; E, PCR efficiency)
| Name | Accession No. | Function | Sequence (5‘ -> 3‘) | Amplicon (bp) | |
|---|---|---|---|---|---|
| NM_001101 | Cytoskeletal structural protein | F: AGA GCT ACG AGC TGC CTG AC R: AGC ACT GTG TTG GCG TAC AG | 184 | 92.48 | |
| NM_004048 | Beta-chain of MHC-I molecules | F: CAC TGA ATT CAC CCC CAC TGA R: CTG CTT ACA TGT CTC GAT CCC A | 104 | 92.94 | |
| NM_002046 | Enzyme of glycolysis | F: CCA TGA GAA GTA TGA CAA CAG CC R: GGG TGC TAA GCA GTT GGT G | 70 | 96.28 | |
| NM_000181 | Exoglycosidase in lysosomes | F: ACT TCT CTG ACA ACC GAC GC R: AGG ATC ACC TCC CGT TCG TA | 172 | 95.21 | |
| NM_000194 | Glycosyltransferase mutase | F: GAC CAG TCA ACA GGG GAC AT R: AAG CTT GCG ACC TTG ACC AT | 167 | 93.83 | |
| NM_000942 | Cyclosporine-binding protein | F: GCC GGG TGA TCT TTG GTC TC R: AAG TCT CCG CCC TGG ATC AT | 148 | 100.55 | |
| NM_001002 | Ribosomal protein | F: TGG CAG CAT CTA CAA CCC TG R: ATC TGC AGA CAG ACA CTG GC | 102 | 90.21 | |
| NM_012423 | Ribosomal protein | F: CGA GGT TGG CTG GAA GTA CC R: CTT CTC GGC CTG TTT CCG TAG | 121 | 92.44 |
Fig. 1RT-qPCR cycle threshold values of candidate reference genes differ between experimental groups. Cycle threshold values (ct) are depicted as Tukey boxplot (plotted are median, inter-quartile-range (IQR) (box), 75th percentile + 1.5 IQR or 25th percentile – 1.5 IQR (whiskers) and outliers). n = 10 per group
Fig. 2Algorithm based ranking of candidate reference genes. a – b BestKeeper based calculation of the coefficient of determination (r2) for groups 1 and 2 (neonates) or groups 1–3 (all). c Standard deviation and r2 for groups 1 and 2 (preterm and full-term neonates) or groups 1–3 (all). d – f NormFinder based calculation of stability values for groups 1 and 2 (preterm and full-term neonates) or groups 1–3 (all). g – i GeNorm based calculation of stability values for groups 1 and 2 (preterm and full-term neonates) or groups 1–3 (all). j – k Overall ranking of candidate reference genes for groups 1 and 2 (preterm and full-term neonates) or groups 1–3 (all). l – m GeNorm based calculation of stability values for groups 1 and 2 (preterm and full-term neonates) after exclusion of RPLP0 from the analysis. n - o GeNorm based calculation of the number of reference genes needed to build a stable normalization factor for groups 1 and 2 (preterm and full-term neonates) or groups 1–3 (neonates and healthy adults). BK, BestKeeper; NF, NormFinder; GN, GeNorm
Fig. 3Reference genes introduce systematic bias despite algorithm based validation. a Relative gene expression (normalized to the mean of group 1 (preterm neonates)) for each candidate reference gene or normalization factor. b Fold-change difference in gene expression of full-term neonates or healthy adults relative to preterm neonates. c Relative gene expression of TLR4 from cord blood of preterm or full-term neonates and peripheral blood of healthy adults is shown non-normalized or normalized to different candidate reference genes or normalization factors. a, c Data is presented as mean ± SD
Fig. 4RNAseq data can be used to assess performance of candidate reference genes and to screen for new reference genes. a Relative expression of candidate reference genes based on the RNA-seq data from cord blood of preterm and full-term neonates [15]. Data was normalized to the group mean gene expression of preterm neonates and presented as mean ± SD. b Fold-change gene expression difference between full-term and preterm neonates. c Following systematic analysis of the RNA-seq data, the relative expression of genes with the lowest CV are depicted. Data was normalized to the group mean gene expression of preterm neonates and presented as mean ± SD. d Fold-change gene expression difference between full-term and preterm neonates. e Relative gene expression of candidate reference genes based on the RNA-seq data from peripheral blood of full-term neonates taken at day of life (DOL) 0 and additionally at either DOL 1, 3 or 7 [17]. Data was normalized to the group mean gene expression of DOL 0 and presented as mean ± SD. f Fold-change gene expression difference between DOL 7 and DOL 0. g Systematic analysis of RNA-seq datasets GSE123070 (dataset “A”) and GSE111404 (dataset “B”) [17]. h Following systematic analysis of the RNA-seq data, the relative expression of genes with the lowest CV are depicted. Data was normalized to the group mean gene expression of DOL 0 and presented as mean ± SD. i Fold-change gene expression difference between DOL 7 and DOL 0