Literature DB >> 31983179

An Overview of Lung and Breast Cancer Using the National Cancer Database.

Leigh Deshotels1, Gerard Chaaya2, Takefumi Komiya3.   

Abstract

INTRODUCTION: The National Cancer Database (NCDB) is a clinical oncology database utilized by many researchers and clinicians internationally. We sought to investigate the various trends in data of two of the most common cancers, breast and lung, published using the NCDB.
MATERIALS AND METHODS: We selected a multitude of pre-determined variables for analysis. We then performed two separate literature searches using an advanced PubMed search builder, and the data was combined to determine each variables' association with journal impact factor (IF) using both univariate and multivariate analyses.
RESULTS: A total of 191 published studies were identified. We found that a journal IF > 5 was associated with a publication year prior to 2017 (univariate analysis OR 2.68, 95% CI 1.38-5.21, p-value 0.004 and multivariate analysis OR 3.47, 95% CI 1.62-7.42, p-value 0.001) and a sample size > 10,000 (univariate analysis OR 3.27, 95% CI 1.43-7.50, p-value 0.005 and multivariate analysis OR 4.68, 95% CI 1.89-11.6, p-value 0.0008). Variables such as number of authors, region, cancer type, stage, treatment outcome and treatment incidence were not significant for an association with an IF >5.
CONCLUSION: Based on our data, studies published after 2017 using the NCDB were associated with a lower IF. This could suggest that the quality of the NCDB data may be declining over time, or NCDB is becoming more widely used.<br />.

Entities:  

Keywords:  Lung cancer; National Cancer Database; breast cancer; national registry data

Year:  2020        PMID: 31983179      PMCID: PMC7294020          DOI: 10.31557/APJCP.2020.21.1.163

Source DB:  PubMed          Journal:  Asian Pac J Cancer Prev        ISSN: 1513-7368


Introduction

National registry databases are an important component of clinical research as they are heavily relied upon by clinicians and researchers internationally. One of those is the National Cancer Database (NCDB), a joint program of the Commission on Cancer (CoC) of the American College of Surgeons and the American Cancer Society, which uses hospital registry data to track patients with malignant oncologic conditions. (Boffa et al., 2017). It was first established in 1988, and since then, it has incorporated data from more than 34 million historical patient records which covers more than 70% of newly diagnosed cancer cases nationwide (Boffa et al., 2017). This data is used to further oncologic treatment and patient care as well as explore trends in cancer care and serve as quality improvement. The NCDB collects data on all types of cancer. Data elements are collected and submitted to the NCDB using a number of web-based data applications developed by the CoC. These hospitals that submit the data must be a designated CoC-accredited hospital. This could limit generalizability as safety-net hospitals are underrepresented, and COC-accredited hospitals only represent about one-third of hospitals nationally (Jagsi et al., 2014). Also, NCDB data is not population-based as is another popular registry database Surveillance, Epidemiology, and End Results Program or SEER (Mohanty and Bilimoria, 2014). NCDB does, however, incorporate various forms of data such as treatment plans, comorbidities, insurance status and overall survival. We sought to investigate the various trends in data published using the NCDB, and more importantly, the quality of this data as this has only been evaluated in a limited manner. We chose to focus on data surrounding two of the most common types of cancers, breast and lung.

Materials and Methods

We first selected a number of pre-determined variables that we intended to analyze. We then performed two separate literature searches in March 2019 using an advanced PubMed search builder: one using “breast cancer” and “national cancer database” and another using “lung cancer” and “national cancer database.” Included articles for analysis were those that were published in English and those that used lung and breast cancer cases derived from NCDB. The following variables were extracted from each article: number of authors (< 5 or > 5), year of publication (< 2017 or > 2017), sample size (< 10,000 or > 10,000), region (US or non-US), cancer type (lung or breast), stage (excluded stage IV or included stage IV), treatment modality incidence (surgery, chemotherapy, radiation therapy) and treatment outcome (survival and mortality). We then searched for each journals’ most recent impact factor (IF) listed on their website. IF was divided into two groups, IF < 5 and IF > 5. All data points were charted in an excel document, and a chi-square analysis was performed looking at each variable and its association with IF. A p-value <0.05 was considered as statistically significant. This study was not considered as human subject research that requires Institutional Review Board (IRB) approval.

Results

There were a total of 191 combined breast and lung cancer studies that met our criteria for analysis. According to Figure I, the number of publications steadily increased over time after 2013 with the exception of an isolated downtrend of lung cancer publications in 2016. The journal with the most publications overall was Annals of Surgical Oncology, and the institution responsible for producing the most publications overall was Yale University School of Medicine in New Haven, Connecticut (Table 1).
Figure 1

Trends in the Number of Publications Over the Years

Table 1

Characteristics of NCDB Studies Published in PubMed

CharacteristicsYear of PublicationNTop JournalsRankNameIF
<2017621Lancet Oncology36.148
>20171292J Clin Oncol26.303
Sample size3JAMA Oncol20.871
<10,000614Ann Oncol13.93
>10,0001305J Natl Cancer Inst11.238
No. of authors
≤552Top InstitutionsRankNameN
>51391Yale Univ36
Region2Washington Univ19
Non-US263MD Anderson18
US1654U of Chicago15
Cancer type5U of Penn9
Lung1095Mayo Clinic9
Breast82
Stage
including IV40
excluding IV151
Treatment outcome
Yes141
No50
Treatment incidence
Yes36
No155
Impact factor
<5141
>550

IF, impact factor; N, number

A journal impact factor > 5 had a significant association with a publication year prior to 2017 (univariate analysis OR 2.68, 95% CI 1.38-5.21, p-value 0.004 and multivariate analysis OR 3.47, 95% CI 1.62-7.42, p-value 0.001) and a sample size > 10,000 (univariate analysis OR 3.27, 95% CI 1.43-7.50, p-value 0.005 and multivariate analysis OR 4.68, 95% CI 1.89-11.6, p-value 0.0008) (Table 2 and Figure 3). Variables such as number of authors, region, cancer type, stage, treatment outcome and treatment incidence were not significant for an association with an impact factor >5.
Table 2

Univariate and Multivariate Analyses for Association with Impact Factors Higher than 5.0.

VariableUnivariate analysis
Multivariate analysis
OR95% CIP-valueOR95% CIP-value
Year (<2017 vs. 2017≤)2.681.38-5.210.0043.471.62-7.420.001
Sample size (10,000< vs. <10,000)3.271.43-7.500.0054.681.89-11.60.0008
No. of authors (5< vs. ≤5)1.510.70-3.220.2911.560.66-3.650.306
Region (non-US vs. US)1.260.51-3.110.6152.240.78-6.410.133
Cancer type (lung vs. breast)1.730.89-3.390.1081.770.83-3.780.138
Stage (including IV vs. excluding IV)1.050.48-2.300.8981.160.45-3.000.765
Treatment outcome (Yes vs. No)2.301.00-5.320.0512.380.47-12.00.292
Treatment incidence (No vs. Yes)2.620.96-7.150.0611.630.27-9.750.592

OR, odds ratio; CI, confidence interval

Figure 3

Top Ten Journals with the Most Publications

Characteristics of NCDB Studies Published in PubMed IF, impact factor; N, number Univariate and Multivariate Analyses for Association with Impact Factors Higher than 5.0. OR, odds ratio; CI, confidence interval Trends in the Number of Publications Over the Years Year and Sample Size and Their Association with Impact Factor Top Ten Journals with the Most Publications

Discussion

Based on our data, studies published after 2017 using the NCDB were associated with a lower impact factor. This is despite the fact that the number of publications using NCDB has been increasing over time. This could suggest that either the quality of the NCDB data is declining over time, or the NCDB is becoming more widely used given the increase in publications since 2013. We reported a similar outcome for lung cancer data using the SEER-Medicare database discussed below. We found a significant association between early publication year and high impact factor (Komiya et al., 2018). This was thought to be explained by increased utilization of the SEER database amongst lower impact journals. We expected that smaller studies were to be underrepresented in journals with an impact factor > 5 given the fact that higher impact factor journals are more likely to publish large data analysis. It, however, may also indicate that the use of the NCDB as a research tool has become more common amongst oncology researchers, and that its impact in the oncology society may be decreasing. The majority of journals with the highest impact factor were not among the top journals with the most publications as seen when comparing Table I to Figure II. It appears that only JAMA Oncology, which has an impact factor of 20.871, is represented amongst the top ten journals with the most publications. This is despite the fact that these top ten journals represent 61% of the total articles analyzed. This observation was also noted (Su et al., 2018) which analyzed characteristics of all types of cancers published using the NCDB. In their study, around 20% of NCDB papers analyzed by the group were published in journals with IF > 10 (Su et al., 2018). It was expected to see less publications in the highest IF journals, but again, our data is showing a limited acceptance of papers using NCDB data when submitted to the most prominent journals.
Figure 2

Year and Sample Size and Their Association with Impact Factor

We also observed that there was a decline in the number of lung cancer publications in 2016 followed by a sharp increase in 2017. The theory behind this is unclear. Lung cancer researchers might have become more aware of NCDB around this time. Yale University School of Medicine produced the most number of publications overall. Publications affiliated with the university made up almost 20% of the total publications analyzed. This is followed by Washington University in St. Louis. According to Table I, it appears most of the top academic institutions are utilizing NCDB frequently. Again, it is important to highlight the distribution and quality of data that comes from large registry databases as they are the cornerstone for oncologic research. Retrospective studies rely heavily on national registry databases, especially the NCDB as it is one of the largest in the nation. As we mentioned earlier, SEER is its main competitor; however, there are major differences between the two. The NCDB allows for cancer surveillance and monitoring outcomes whereas SEER is epidemiological in that it is utilized mainly for measuring cancer incidence (Mohanty and Bilimoria, 2014). However, an additional program, SEER-Medicare, more closely resembles the NCDB in that it provides a link between patient information provided in the Medicare database and SEER database which allows for some monitoring of outcomes (Komiya et al., 2018). One major strength of NCDB compared to SEER is that it allows for quality improvement. Since the NCDB reports cancer outcomes, hospitals are able to use this data to measure their performance compared to the other 1500 CoC-accredited hospitals (Mohanty and Bilimoria, 2014). This is not possible with the SEER database. We realize the limitation of our analysis. Publications must have been written in English and listed in PubMed. This search method may overlook studies published in non-English or journals not listed in PubMed. Also the cut-off value of IF of 5 was arbitrarily set because its close to a mean number of 4.01. IF has been known to fluctuate over the years, and it is influenced by all the articles and journals in the research field at the time of publication. In addition, our study only analyzed lung and breast cancer publications. Nevertheless, it is widely used and is one of the few tools to judge quality of the journals at this time. In conclusion, our study found that recent studies using NCDB for lung and breast cancers have a trend of decreasing IF over the last few years. It suggests its becoming a more widely used tool, or its impact in oncology field is deteriorating. As a convenient tool in retrospective oncology research, NCDB will still remain valuable for academic researchers.
  5 in total

1.  Overview of publications on lung cancer using the SEER database.

Authors:  Takefumi Komiya; Achuta K Guddati; Gerard Chaaya
Journal:  Respir Investig       Date:  2018-07-25

Review 2.  Using the National Cancer Database for Outcomes Research: A Review.

Authors:  Daniel J Boffa; Joshua E Rosen; Katherine Mallin; Ashley Loomis; Greer Gay; Bryan Palis; Kathleen Thoburn; Donna Gress; Daniel P McKellar; Lawrence N Shulman; Matthew A Facktor; David P Winchester
Journal:  JAMA Oncol       Date:  2017-12-01       Impact factor: 31.777

3.  Comparing national cancer registries: The National Cancer Data Base (NCDB) and the Surveillance, Epidemiology, and End Results (SEER) program.

Authors:  Sanjay Mohanty; Karl Y Bilimoria
Journal:  J Surg Oncol       Date:  2014-01-25       Impact factor: 3.454

Review 4.  Considerations for observational research using large data sets in radiation oncology.

Authors:  Reshma Jagsi; Justin E Bekelman; Aileen Chen; Ronald C Chen; Karen Hoffman; Ya-Chen Tina Shih; Benjamin D Smith; James B Yu
Journal:  Int J Radiat Oncol Biol Phys       Date:  2014-09-01       Impact factor: 7.038

Review 5.  Publication trend, resource utilization, and impact of the US National Cancer Database: A systematic review.

Authors:  Chang Su; Cuiying Peng; Ena Agbodza; Harrison X Bai; Yuqian Huang; Giorgos Karakousis; Paul J Zhang; Zishu Zhang
Journal:  Medicine (Baltimore)       Date:  2018-03       Impact factor: 1.889

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.