Literature DB >> 27832243

Evaluation of the Feasibility of Screening Patients for Early Signs of Lung Carcinoma in Web Search Logs.

Ryen W White1, Eric Horvitz1.   

Abstract

IMPORTANCE: A statistical model that predicts the appearance of strong evidence of a lung carcinoma diagnosis via analysis of large-scale anonymized logs of web search queries from millions of people across the United States.
OBJECTIVE: To evaluate the feasibility of screening patients at risk of lung carcinoma via analysis of signals from online search activity. DESIGN, SETTING, AND PARTICIPANTS: We identified people who issue special queries that provide strong evidence of a recent diagnosis of lung carcinoma. We then considered patterns of symptoms expressed as searches about concerning symptoms over several months prior to the appearance of the landmark web queries. We built statistical classifiers that predict the future appearance of landmark queries based on the search log signals. This was a retrospective log analysis of the online activity of millions of web searchers seeking health-related information online. Of web searchers who queried for symptoms related to lung carcinoma, some (n = 5443 of 4 813 985) later issued queries that provide strong evidence of recent clinical diagnosis of lung carcinoma and are regarded as positive cases in our analysis. Additional evidence on the reliability of these queries as representing clinical diagnoses is based on the significant increase in follow-on searches for treatments and medications for these searchers and on the correlation between lung carcinoma incidence rates and our log-based statistics. The remaining symptom searchers (n = 4 808 542) are regarded as negative cases. MAIN OUTCOMES AND MEASURES: Performance of the statistical model for early detection from online search behavior, for different lead times, different sets of signals, and different cohorts of searchers stratified by potential risk.
RESULTS: The statistical classifier predicting the future appearance of landmark web queries based on search log signals identified searchers who later input queries consistent with a lung carcinoma diagnosis, with a true-positive rate ranging from 3% to 57% for false-positive rates ranging from 0.00001 to 0.001, respectively. The methods can be used to identify people at highest risk up to a year in advance of the inferred diagnosis time. The 5 factors associated with the highest relative risk (RR) were evidence of family history (RR = 7.548; 95% CI, 3.937-14.470), age (RR = 3.558; 95% CI, 3.357-3.772), radon (RR = 2.529; 95% CI, 1.137-5.624), primary location (RR = 2.463; 95% CI, 1.364-4.446), and occupation (RR = 1.969; 95% CI, 1.143-3.391). Evidence of smoking (RR = 1.646; 95% CI, 1.032-2.260) was important but not top-ranked, which was due to the difficulty of identifying smoking history from search terms. CONCLUSIONS AND RELEVANCE: Pattern recognition based on data drawn from large-scale web search queries holds opportunity for identifying risk factors and frames new directions with early detection of lung carcinoma.

Entities:  

Mesh:

Year:  2017        PMID: 27832243     DOI: 10.1001/jamaoncol.2016.4911

Source DB:  PubMed          Journal:  JAMA Oncol        ISSN: 2374-2437            Impact factor:   31.777


  13 in total

1.  The Premise and Promise of Big Data for Tracking Population Health: Big Deal or Big Disappointment?

Authors:  Emad Mansoor; Sadeer G Al-Kindi
Journal:  Dig Dis Sci       Date:  2017-03       Impact factor: 3.199

2.  Learning to Identify Rare Disease Patients from Electronic Health Records.

Authors:  Rich Colbaugh; Kristin Glass; Christopher Rudolf; Mike Tremblay Volv Global Lausanne Switzerland
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

Review 3.  Innovations in research and clinical care using patient-generated health data.

Authors:  Heather S L Jim; Aasha I Hoogland; Naomi C Brownstein; Anna Barata; Adam P Dicker; Hans Knoop; Brian D Gonzalez; Randa Perkins; Dana Rollison; Scott M Gilbert; Ronica Nanda; Anders Berglund; Ross Mitchell; Peter A S Johnstone
Journal:  CA Cancer J Clin       Date:  2020-04-20       Impact factor: 508.702

Review 4.  How Machine Learning Will Transform Biomedicine.

Authors:  Jeremy Goecks; Vahid Jalili; Laura M Heiser; Joe W Gray
Journal:  Cell       Date:  2020-04-02       Impact factor: 41.582

Review 5.  Artificial intelligence for clinical oncology.

Authors:  Benjamin H Kann; Ahmed Hosny; Hugo J W L Aerts
Journal:  Cancer Cell       Date:  2021-04-29       Impact factor: 38.585

6.  The Role of Web-Based Health Information in Help-Seeking Behavior Prior to a Diagnosis of Lung Cancer: A Mixed-Methods Study.

Authors:  Julia Mueller; Caroline Jay; Simon Harper; Chris Todd
Journal:  J Med Internet Res       Date:  2017-06-08       Impact factor: 5.428

7.  Detecting Impending Stroke From Cognitive Traits Evident in Internet Searches: Analysis of Archival Data.

Authors:  Sigal Shaklai; Ran Gilad-Bachrach; Elad Yom-Tov; Naftali Stern
Journal:  J Med Internet Res       Date:  2021-05-28       Impact factor: 5.428

8.  Inducing Behavioral Change in Seekers of Pro-Anorexia Content Using Internet Advertisements: Randomized Controlled Trial.

Authors:  Elad Yom-Tov; Anat Brunstein-Klomek; Or Mandel; Arie Hadas; Silvana Fennig
Journal:  JMIR Ment Health       Date:  2018-02-22

9.  Utilizing Machine Learning on Internet Search Activity to Support the Diagnostic Process and Relapse Detection in Young Individuals With Early Psychosis: Feasibility Study.

Authors:  Michael Leo Birnbaum; Prathamesh Param Kulkarni; Anna Van Meter; Victor Chen; Asra F Rizvi; Elizabeth Arenare; Munmun De Choudhury; John M Kane
Journal:  JMIR Ment Health       Date:  2020-09-01

10.  Population-scale hand tremor analysis via anonymized mouse cursor signals.

Authors:  Ryen W White; Eric Horvitz
Journal:  NPJ Digit Med       Date:  2019-09-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.