| Literature DB >> 25054131 |
Abstract
We describe a new signature definition and analysis method to be used as biomarker for early cancer detection. Our new approach is based on the construction of a reference map of transcriptional signatures of both healthy and cancer affected individuals using circulating miRNA from a large number of subjects. Once such a map is available, the diagnosis for a new patient can be performed by observing the relative position on the map of his/her transcriptional signature. To demonstrate its efficacy for this specific application we report the results of the application of our method to published datasets of circulating miRNA, and we quantify its performance compared to current state-of-the-art methods. A number of additional features make this method an ideal candidate for large-scale use, for example, as a mass screening tool for early cancer detection or for at-home diagnostics. Specifically, our method is minimally invasive (because it works well with circulating miRNA), it is robust with respect to lab-to-lab protocol variability and batch effects (it requires that only the relative ranking of expression value of miRNA in a profile be accurate not their absolute values), and it is scalable to a large number of subjects. Finally we discuss the need for HPC capability in a widespread application of our or similar methods.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25054131 PMCID: PMC4087284 DOI: 10.1155/2014/192646
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Wiring diagram of our overall diagnostic signature method (a) and pseudocode of the core procedure (b).
Figure 2Maps for the two groups of subjects: (a) Caucasian subjects and (b) African American subjects. Each node in the graph represents a subject, whose transcriptional signature was derived from a profile of circulating miRNA. The length of an edge is approximately proportional to the inverse of the distance between two signatures as computed by the algorithm. The nodes spontaneously cluster in well-defined and easily identifiable control/disease groups, with only two misclassifications in the Caucasian subjects case. Node label legend: red = early breast cancer subject; green = control subject.
Figure 3Map of patients based on their serum miRNA profiles included in the Bianchi et al. dataset [13]. We applied our method using a signature length of 25 + 25 and the top 10% quantile as threshold for distances (N = 10%). Using a neighbor majority rule, the accuracy of the classification is 84%. Node color legend: green = healthy subjects; red = early stage subjects.