| Literature DB >> 21966361 |
Angela Stokes1, Ignat Drozdov, Eliete Guerra, Christos A Ouzounis, Saman Warnakulasuriya, Michael J Gleeson, Mark McGurk, Mahvash Tavassoli, Edward W Odell.
Abstract
The requirement for large amounts of good quality DNA for whole-genome applications prohibits their use for small, laser capture micro-dissected (LCM), and/or rare clinical samples, which are also often formalin-fixed and paraffin-embedded (FFPE). Whole-genome amplification of DNA from these samples could, potentially, overcome these limitations. However, little is known about the artefacts introduced by amplification of FFPE-derived DNA with regard to genotyping, and subsequent copy number and loss of heterozygosity (LOH) analyses. Using a ligation adaptor amplification method, we present data from a total of 22 Affymetrix SNP 6.0 experiments, using matched paired amplified and non-amplified DNA from 10 LCM FFPE normal and dysplastic oral epithelial tissues, and an internal method control. An average of 76.5% of SNPs were called in both matched amplified and non-amplified DNA samples, and concordance was a promising 82.4%. Paired analysis for copy number, LOH, and both combined, showed that copy number changes were reduced in amplified DNA, but were 99.5% concordant when detected, amplifications were the changes most likely to be 'missed', only 30% of non-amplified LOH changes were identified in amplified pairs, and when copy number and LOH are combined ∼50% of gene changes detected in the unamplified DNA were also detected in the amplified DNA and within these changes, 86.5% were concordant for both copy number and LOH status. However, there are also changes introduced as ∼20% of changes in the amplified DNA are not detected in the non-amplified DNA. An integrative network biology approach revealed that changes in amplified DNA of dysplastic oral epithelium localize to topologically critical regions of the human protein-protein interaction network, suggesting their functional implication in the pathobiology of this disease. Taken together, our results support the use of amplification of FFPE-derived DNA, provided sufficient samples are used to increase power and compensate for increased error rates.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21966361 PMCID: PMC3180289 DOI: 10.1371/journal.pone.0024503
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
SNP Call Rates as a percentage of the 131,429 SNPs remained after a 350 bp filter (n = 10).
| Call Rates(%) | Genomic DNA | Amplified DNA | ||||
| All | Homozygous | Heterozygous | All | Homozygous | Heterozygous | |
|
| 81.1–87.1 | 49.2–65.1 | 22.0–32.2 | 79.4–84.5 | 43.6–62.3 | 21.8–35.8 |
|
| 84.1 | 56.5 | 27.5 | 82.8 | 54.4 | 28.4 |
|
| 2.2 | 5.5 | 3.4 | 1.6 | 5.4 | 3.9 |
Genotype Distribution of the gDNA overall and the ‘No Calls’ in the amplified DNA (n = 10).
| Call Rates(%) | Genotypes in gDNA | Corresponding gDNA genotype of the No Call SNPs in amplified DNA | ||||
| AA | AB | BB | AA | AB | BB | |
|
| 30.5–37.9 | 26.6–39.5 | 29.9–36.8 | 27.1–35.8 | 25.0–43.9 | 28.6–38.3 |
|
| 33.9 | 32.9 | 33.2 | 31.7 | 35.6 | 32.7 |
|
| 2.5 | 4.8 | 2.4 | 3.4 | 7.1 | 3.8 |
SNP call concordance rates (n = 10).
| SNPs Called in both samples | Concordant SNPs | % Concordance | |
|
| 93905–105786 | 69135–95057 | 73.6–90.6 |
|
| 100599.2 | 83120.7 | 82.4 |
|
| 3732.2 | 8908.1 | 5.9 |
Paired copy number analysis - The number of gene changes and their concordance (n = 6).
| Number of Changes | Concordant Changes | ‘missed’ changes% of gDNA pair | ‘extra’ changes% of WGA pair | ||||||
| gDNA | WGA | in both | Number | % | Amplification | Deletion | Amplification | Deletion | |
|
| 1704–6429 | 1814–4897 | 836–3244 | 836–3220 | 98.8–100.0 | 2.3–69.3 | 5.0–48.7 | 1.71–26.17 | 10.4–52.2 |
|
| 4076.5 | 2777.8 | 1765.8 | 1755.3 | 99.5 | 34.9 | 15.4 | 10.2 | 26.2 |
|
| 2047.0 | 1195.9 | 811.9 | 804.0 | 0.4 | 27.7 | 16.5 | 9.0 | 13.8 |
Paired LOH analysis - The number of gene changes and their concordance (n = 6).
| Number of Changes | Concordant Changes | ‘missed’ LOH | ‘extra’ LOH | ||||
| gDNA | WGA | In both | Number | % | % of gDNA pair | % of WGA pair | |
|
| 77–15446 | 215–12624 | 15–10436 | 15–10436 | 100 | 32.4–96.0 | 17.3–97.5 |
|
| 3258.5 | 2725.8 | 1828.2 | 1828.2 | 100 | 73.2 | 74.5 |
|
| 6018.2 | 4861.8 | 4217.9 | 4219.9 | 0 | 22.2 | 29.0 |
Copy number and LOH overlap - The number of gene changes their concordance (n = 6).
| Number of Changes | Concordant Changes | ‘missed’ Overlap | ‘extra’ Overlap | ||||
| gDNA | WGA | In both | Number | % | % of gDNA pair | % of WGA pair | |
|
| 3857–16148 | 2490–13995 | 1553–12163 | 1435–9819 | 71.0–93.7 | 24.7–60.6 | 9.9–29.5 |
|
| 7426.7 | 5353.0 | 4380.8 | 3693.7 | 86.5 | 46.3 | 20.9 |
|
| 4710.3 | 4485.5 | 3997.9 | 3194.9 | 8.9 | 16.2 | 8.0 |
Figure 1Chromosomal distribution of ‘missed’ and ‘extra’ gene changes.
Showing the proportion of gene changes (from the copy number and LOH overlap analysis) identified as either ‘missed’ (white bars) or ‘extra’ (black bars) during the whole-genome amplification procedure and their chromosomal distribution (normalized to chromosome length). Graphs A–F show data from six different paired normal and dysplasia samples and compare the data generated using unamplfied (genomic) and amplified DNA. Proportion of changes is represented as a fraction with respect to all changes detected by a respective methodology.
Figure 2Automated PubMed search of 3112 genes with changes in oral dysplasia.
Of the 3112 genes, 1375 (44%) had at least one PubMed cited abstract, suggesting the experimental data reflected acceptable coverage of current molecular knowledge in oral dysplasia and carcinogenesis.
Figure 3Topological analysis of oral dysplasia associated genes.
3A) High confidence human protein-protein interaction (PPI) subnetwork consisting of 867 genes with changes in oral dysplasia and 1270 interactions. Node color corresponds to distinct gene communities identified by the Louvain method (see Methods). 3B–E) Comparison of topological properties of oral genes (n = 1551), oncogenes (n = 171), essential genes (n = 1331), and non-essential genes (n = 413) in the PPI network. *p<0.05, **p<0.001 vs. non-essential. MEAN±SEM.