| Literature DB >> 35304543 |
Lise M Hardy1,2, Yosra Bouyacoub1,2, Antoine Daunay1, Mourad Sahbatou1,3, Laura G Baudrin1,2, Laetitia Gressin4, Mathilde Touvier5, Hélène Blanché2,4,6, Jean-François Deleuze1,2,7,6, Alexandre How-Kit8,9.
Abstract
Lymphoblastoid cell lines (LCLs) derive from blood infected in vitro by Epstein-Barr virus and were used in several genetic, transcriptomic and epigenomic studies. Although few changes were shown between LCL and blood genotypes (SNPs) validating their use in genetics, more were highlighted for other genomic features and/or in their transcriptome and epigenome. This could render them less appropriate for these studies, notably when blood DNA could still be available. Here we developed a simple, high-throughput and cost-effective real-time PCR approach allowing to distinguish blood from LCL DNA samples based on the presence of EBV relative load and rearranged T-cell receptors γ and β. Our approach was able to achieve 98.5% sensitivity and 100% specificity on DNA of known origin (458 blood and 316 LCL DNA). It was further applied to 1957 DNA samples from the CEPH Aging cohort comprising DNA of uncertain origin, identifying 784 blood and 1016 LCL DNA. A subset of these DNA was further analyzed with an epigenetic clock indicating that DNA extracted from blood should be preferred to LCL for DNA methylation-based age prediction analysis. Our approach could thereby be a powerful tool to ascertain the origin of DNA in old collections prior to (epi)genomic studies.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35304543 PMCID: PMC8933453 DOI: 10.1038/s41598-022-08663-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Descriptive statistics of the DNA samples used from the four collections used.
| Cohort characteristics | CEPH reference families (n = 316) | SU.VI.MAX (n = 364) | EFS (n = 93) | CEPH aging (n = 1813) | |
|---|---|---|---|---|---|
| Nonagenarians and centenarians (NC, n = 1346) | Nonagenarians and centenarians’ offspring (NCO, n = 467) | ||||
| Tissue-of-origin of DNA | LCL | Blood | Blood | LCL and blood | LCL and blood |
| Agea in years, M ± SD (range) | 48.9 ± 22.1 (18–97)b | 48.9 ± 5.9 (35–61) | 41.6 ± 13.4 (19–69) | 99.3 ± 3.8 (90–110+) | 68.4 ± 9.2 (48–90) |
| Females, n (%) | 158 (50%) | 182 (50%) | 40 (43%) | 1032 (76.7%) | 262 (56.1%) |
aAge at collection.
bKnown for 214 samples.
Figure 1Distribution of Ct /Ct ratios, Ct /Ct ratios and mean TCR Tm from blood DNA from EFS and SU.VI.MAX (n = 457) and LCL DNA from CEPH reference families (n = 316) using real-time PCR assays. (A) Distribution of Ct /Ct ratios based on EBV and GAPDH real-time PCR assays. (B) Distribution of Ct /Ct ratios based on TCR and GAPDH real-time PCR assays. (C) Distribution of mean TCR Tm based on TCR real-time PCR assay. The chosen thresholds for each test are given above the frameworks.
Calculations of sensitivity, specificity, PPV, NPV and accuracy for tissue-of-origin tests.
| Tissue-of-origin assay | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | Accuracy |
|---|---|---|---|---|---|
| Ct | 98.5 | 100.0 | 100.0 | 97.8 | 0.991 |
| Ct | 94.3 | 99.4 | 99.5 | 92.4 | 0.962 |
| Tm
| 98.2 | 98.7 | 99.1 | 97.5 | 0.984 |
| Combined | 98.5 | 100.0 | 100.0 | 97.8 | 0.991 |
For our calculations, blood + was considered as the positive result, and LCL + and uncertain origin as negatives results.
Figure 2Venn diagrams of the results using combination of the three real-time PCR assays (EBV, TCR and TCR) from blood and LCL DNA of known origin. (A) LCL DNA samples from CEPH reference families (n = 316) distribution. (B) Blood DNA samples from EFS and SU.VI.MAX (n = 457) distribution. When there was a discrepancy between the results of the three tests, these samples were represented on both the left and right Venn diagrams. For each Venn Diagram, the percentages are calculated from the total number of blood (316 for panel A) and LCL (457 for panel B) reference DNA samples.
Figure 3Application of the three tissue-of-origin real-time PCR assays to DNA samples (n = 1957) of the CEPH Aging cohort. (A) Distribution of Ct /Ct ratios, Ct /Ct ratios and mean TCR Tm of CEPH Aging cohort DNA samples based on EBV, TCR and TCR real-time PCR assays. (B) Venn diagrams of the results using the combination of the three real-time PCR assays: EBV, TCR and TCR. When there was a discrepancy between the results of the three tests, the samples were represented on both the left and right Venn diagrams. The percentages were calculated from the total number samples present in each Venn Diagram (1152 for the left and 808 for the right).
Concordance between the information present in the CEPH Biobase database and results of real-time PCR tissue-of-origin assays.
| Origin of DNA according to the CEPH Biobase database | Tissue-of-origin PCR test | Concordance of DNA origin between database information and PCR test results | ||
|---|---|---|---|---|
| Blood | LCL | Uncertain | ||
Blood n = 269 (13.74%) | 263 (13.44%) | 0 (0%) | 6 (0.31%) | Blood DNA = 97.77% |
LCL n = 1035 (52.89%) | 3 (0.15%) | 1032 (52.73%) | 0 (0%) | LCL DNA = 99.71% |
Uncertain n = 653 (33.37%) | 530 (27.08%) | 116 (5.93%) | 7 (0.35%) | Uncertain DNA = 1.07% |
Total n = 1957 (100%) | 796 (40.67%) | 1148 (58.66%) | 13 (0.66%) | - |
Figure 4Age predictions of all DNA samples from the DNA methylation-based age prediction model of Thong[44] using three CpG loci and pyrosequencing. (A) Age predictions from EFS blood (n = 24) and CEPH families EBV cell line (n = 26) DNA samples. (B) Age predictions from blood (n = 24) and EBV cell line (n = 21) DNA samples of the CEPH Aging cohort. The mean absolute deviation (MAD) of the predicted age from the chronological age and the Pearson R coefficient are given on each graph in red and green for blood and LCL samples, respectively.