| Literature DB >> 27054083 |
Giancarlo Russo1, Andrea Patrignani1, Lucy Poveda1, Frederic Hoehn2, Bettina Scholtka3, Ralph Schlapbach1, Alex M Garvin2.
Abstract
Colorectal cancer (CRC) represents one of the most prevalent and lethal malignant neoplasms and every individual of age 50 and above should undergo regular CRC screening. Currently, the most effective preventive screening procedure to detect adenomatous polyps, the precursors to CRC, is colonoscopy. Since every colorectal cancer starts as a polyp, detecting all polyps and removing them is crucial. By exactly doing that, colonoscopy reduces CRC incidence by 80%, however it is an invasive procedure that might have unpleasant and, in rare occasions, dangerous side effects. Despite numerous efforts over the past two decades, a non-invasive screening method for the general population with detection rates for adenomas and CRC similar to that of colonoscopy has not yet been established. Recent advances in next generation sequencing technologies have yet to be successfully applied to this problem, because the detection of rare mutations has been hindered by the systematic biases due to sequencing context and the base calling quality of NGS. We present the first study that applies the high read accuracy and depth of single molecule, real time, circular consensus sequencing (SMRT-CCS) to the detection of mutations in stool DNA in order to provide a non-invasive, sensitive and accurate test for CRC. In stool DNA isolated from patients diagnosed with adenocarcinoma, we are able to detect mutations at frequencies below 0.5% with no false positives. This approach establishes a foundation for a non-invasive, highly sensitive assay to screen the population for CRC and the early stage adenomas that lead to CRC.Entities:
Keywords: APC gene; Colorectal cancer; High sensitivity; Single molecule sequencing; Stool DNA; Third generation sequencing
Year: 2015 PMID: 27054083 PMCID: PMC4803778 DOI: 10.1016/j.atg.2015.08.006
Source DB: PubMed Journal: Appl Transl Genom ISSN: 2212-0661
Supplementary Table 3Experiment 3 : description of the samples.
Experiment 1: mutations reported based on the sequencing run on the PacBio RS II.
| Segment | Relative position | RA | AA | Cov. WT | AA Freq. WT (%) | Cov. DLD1-M | AA Freq. DLD1-M (%) | P | Adj. P |
|---|---|---|---|---|---|---|---|---|---|
| 214 | G | A | 2095 | 0 | 3335 | 1.29 | 0 | ||
| 94 | G | − C | 3346 | 0.04 | 4012 | 0.51 | 2.30E-05 | ||
| 161 | C | T | 2485 | 0.04 | 2569 | 0.70 | 7.80E-05 | ||
| APC2 | 26 | A | + T | 3380 | 0 | 3753 | 0.2 | 0.0021 | 0.750 |
Note: (*) True positive. RA: Reference allele. AA: Alternative allele. WT: wild type. DLD1-M: DLD1 — mixed sample.
Fig. 1Simulated stool sample: comparison of three different sequencing platforms. With extremely high CCS-reads quality, the results obtained using PacBio are 100% specific and sensitive. On the contrary, both IonTorrent PGM and Illumina MiSeq report several false positive and miss to identify one variant (Supplementary Table 1, Supplementary Table 2).
Simulated stool sample: comparison of three different sequencing platforms. With extremely high CCS-reads quality, the results obtained using PacBio are 100% specific and sensitive. On the contrary, both IonTorrent PGM and Illumina MiSeq report several false positive and miss to identify one variant (Supplementary Tables 1 and 2).
Experiment 2: mutations found in the DNA samples extracted from 37 y/o patient diagnosed with adenocarcinoma.
| Germline mutation | Somatic mutation | |
|---|---|---|
| Gene | APC | TP53 |
| Codon | 1309 | 237 |
| Reference allele | T | G |
| Alternative allele (AA) | -AAAAG | A |
| Coverage in the wild type (WT) | 2022 | 1728 |
| AA frequency in the WT | 48.22% | 0.00% |
| Coverage in the patient sample | 1271 | 872 |
| AA frequency in the patient sample | 68.61% | 12.96% |
| P-value | 0 | 1.45E-56 |
| Adjusted P-value | 0 | 4.98E-54 |
Fig. 2Confirmation by Sanger sequencing of the germline, 5 bp deletion in the tumor sample from a 37 y/o patient diagnosed with adenocarcinoma. This mutation represents the most frequently mutated codon in the APC gene. The DNA samples extracted form both the tumor and the healthy, adjacent tissues were sequenced on one SMRT cell. By means of third generation sequencing, the deletion was identified with frequencies of 48.22% and 68.61% in the normal (A) and the tumor (B) tissues, respectively (P < 1e-256 in both cases, Fisher's exact test).
Fig. 3Confirmation by Sanger sequencing of the somatic point substitution in the tumor sample from a 37 y/o patient diagnosed with adenocarcinoma. This mutation represents the third most frequently mutated codon in the TP53 gene. The mutation in the tumor sample (B) is a heterozygous substitution which occurs at a frequency of 15.96% (highlighted in purple). Despite this rather low cellularity, the total absence of background noise allows for an easy detection of the green A signal in the Sanger sequencing track, which is instead clearly absent in the healthy tissue (A).
Experiment 3: mutations found in the DNA samples extracted from patient 1.
| Tumor | Stool | |
|---|---|---|
| Gene | APC6 | APC6 |
| Codon | 1309 | 1309 |
| Reference allele | T | T |
| Alternative allele (AA) | -AAAAG | -AAAAG |
| Coverage in the wild type (WT) | 4658 | 4658 |
| AA frequency in the WT (%) | 0.00 | 0.00 |
| Coverage in the patient sample | 3897 | 4550 |
| AA frequency in the patient sample (%) | 82.22 | 0.57 |
| P-value | 0 | 1.22E-08 |
| Adjusted P-value | 0 | 3.66E-06 |
Experiment 3: mutations found in the DNA samples extracted from patient 2.
| Tumor | Stool | ||
|---|---|---|---|
| Gene | APC | APC | APC |
| Codon | 1491 | 1491 | 1556 |
| Reference allele | A | A | G |
| Alternative allele (AA) | − T | − T | + A |
| Coverage in the wild type (WT) | 7372 | 7372 | 7359 |
| AA frequency in the WT (%) | 0.00 | 0.00 | 0.04 |
| Coverage in the patient sample | 8005 | 8005 | 7984 |
| AA frequency in the patient sample (%) | 19.05 | 0.37 | 1.32 |
| P-value | 0 | 3.04E-09 | 2.72E-26 |
| Adjusted P-value | 0 | 3.43E-07 | 8.16E-24 |