Literature DB >> 35860050

Terminal digit preference and the accuracy of breast cancer diameter reporting based on Benford's law.

Shahzaib Ahmad1, Amber Latif2, Mehwish Mehmood3, Ramisha Aslam1, Zain Ul Abiddin1, Hassan Mumtaz4,5, Khadija Ahmed6, Waqas Mehdi1,7, Waheeda Begum8.   

Abstract

Background: Breast cancer is the most frequent cancer in women all over the world, and it is one of the leading causes of cancer-related deaths in women. A pathologist's partiality for the last digit of a patient's name can lead to errors in the measurement of malignancies. This means that, rather than recording the exact measurement of a tumor, a pathologist might round it off to his preferred terminal digit.
Methods: It is a retrospective cross-sectional study in which data on primary tumor resection for 1000 breast cancer patients was obtained from KRL Hospital's patient directory from November 2016 to December 2020. The tumors were measured in cm to one decimal point along their longest dimension. Ki-67 markers were used to categorize the tumors into nine categories. Terminal digit preference was evaluated using Benford's law.
Results: The recording of the Ki-67 index revealed evidence of pentameric preference. The numbers three, five, and six appeared more frequently in the histogram of the Ki-67 index distribution measured in percentage. The frequency of nine dropped dramatically. However, the influence of tumor size terminal digits on Ki-67 staining scores (low proliferative vs high proliferative) assessed using the Mann-Whitney U Test demonstrated that tumor size terminal digits had no significant effect on Ki-67 staining scores (p = 0.114).
Conclusion: The Ki-67 index shows evidence of pentameric preference for digits three, five, and six. The frequency of nine has dropped dramatically. The influence of tumor size on terminal digits on staining scores (low proliferative vs. high proliferative) was assessed using the Mann-Whitney U Test.
© 2022 The Authors.

Entities:  

Year:  2022        PMID: 35860050      PMCID: PMC9289319          DOI: 10.1016/j.amsu.2022.103993

Source DB:  PubMed          Journal:  Ann Med Surg (Lond)        ISSN: 2049-0801


Introduction

Breast cancer represents the most prevalent form of cancer in women all over the world, making it one of the leading causes of cancer-related deaths in women [1]. However, considering recent advances in the field of research, the prognosis and treatment plan for breast cancer have changed dramatically. Hence, physicians must choose from a wide variety of treatment options by the condition of the patient. TNM staging is often used to classify a breast tumor, which in turn determines the management of the patient [3]. The TNM staging system is based on the size of the tumor and its extent of dissemination along with other factors [4]. Hence, the size of a tumor as well as its cytology have a major impact on its staging and subsequently guide the physician in its management. However, errors can be made during the measurement of these tumors. One of these errors is the terminal digit preference of the pathologist. This means that instead of recording the exact measurement of a tumor, a pathologist might round off the measurement to his preferred terminal digit, usually 0 or 5. This error has been documented in previous studies for different kinds of carcinomas [5]. In the case of breast carcinoma, however, more research is required. The analysis of breast cancer 0eas4re0ents established terminal digital preference utilizing histograms in a previous investigation [6]. This terminal digital preference can also occur during a sample's Ki-67 indexing. The Ki-67 index assesses the expression of the Ki-67 marker on cancer cells, which indicates their proliferative ability [7]. This study aims to evaluate terminal digital preference among pathologists using Benford's Law. Benford's Law has previously been used to evaluate COVID-19 data [8,9] and other fabrications in the data [10]. It states that in a large dataset containing naturally recorded measurements, the incidence of digits 1 to 9 is not distributed equally. Instead, the distribution of these digits follows a counter-intuitive trend in which smaller digits occur more frequently than larger ones [11]. As a result, rather than following the normal frequency distribution, in which each digit occurs at a frequency of 11.11%, the digit "1″ occurs at 30.1%, "2″ at 17.6%, "3″ at 12.5%, and so on [12]. By comparing the frequencies of different digits (especially 0 and 5) with the expected frequencies according to Benford's Law, we can establish if terminal digit preference occurs in breast cancer size measurements and the Ki-67 index.

Materials and methods

It is a retrospective cross-sectional study. All the recorded data was obtained from the patient directory of KRL Hospital. At KRL Hospital, primary tumor resection for breast cancer patients was done in 1000 breast cancer patients from November 2016 to December 2020. Data related to these specimens were recorded and later interpreted by a histopathologist. The calculated sample size for the study was 1000 breast cancer specimens by the WHO sample size calculator. Hence, the sizes and Ki-67 indexes of 1000 breast carcinoma tumors were included in the study. Fine needle aspiration tests and benign breast lesions were not included in the study. The expected frequencies of different digits in the data set were calculated according to Benford's Law via the mathematical equation:F(d) = log (1 + 1/d)where F is the frequency and d are the digits in question [[1], [2], [3], [4], [5], [6], [7], [8], [9]]. CAP protocols were followed in all the specimens included in the dataset. The recorded specimens were obtained during breast conservative procedures and modified radical mastectomies. The tumors were measured along their greatest dimension and recorded in cm to one decimal point. The tumor diameter was calculated from microscopic slides by measuring the outermost borders of the invasive lesion and measuring to the nearest millimeter using a transparent ruler. If no slides were available, the measurement was based on a macroscopic inspection (formalin-fixed material), either of a single tissue slice or as an average of all tissue slices containing microscopically validated invasive tumor tissue. The study proposal was approved by the ethical review committee of KRL Hospital. Our study is fully compliant with the STROCSS 2021 guidelines [13]. A complete STROCSS 2021 checklist has been provided as a supplementary file. Our study has been registered on Research Registry with the following UIN: researchregistry7969 [14]. Our study is by the Declaration of Helsinki. The tumors were also stained for Ki-67 markers and were divided into nine groups depending upon the percentage of staining, i.e., (11–29%, 20–29%, 30–39%, 40–49%, 50–59%, 60–69%, 70–79%, 80–89%, 90–99%). A Pearson's chi-square goodness-of-fit test was performed. The null hypothesis is that the observed distribution of digits in the data set for breast tumor size and Ki-67 index follows Benford's law; hence a distribution having a p-value > 0.05 is considered to adhere to Benford's distribution. Mann–Whitney The U test was used to compare the calculated frequencies of digits 1 to 9 with the expected frequencies calculated according to Benford's Law.

Results

Data from 1000 patients with primary malignant breast tumors was retrieved from KRL Hospital and analyzed. We found no pentameric preference for histopathological measurement of tumor size. On computing the last digit frequency for histopathological measurements of tumor size recorded in centimeters to one decimal place, the histogram displayed a higher frequency of digits three and nine. There was a decrease in the frequency around the peak of digit nine which correlates to digits seven, eight, and one. The evidence of pentameric preference was found in the recording of the Ki-67 index. The histogram of the distribution of the Ki-67 index recorded in percentage revealed that the digits three, five, and six were in higher frequency. There was a marked drop in the frequency of digit nine. The rest of the terminal digits, although differing in frequency from that predicted by Benford's law, somewhat followed Benford's distribution curve, as shown in Fig. I.
Fig. 1

Benford's law breakdown of KI-67%..

Benford's law breakdown of KI-67%.. The effect of terminal digits of tumor size on Ki-67 staining scores (low proliferative vs high proliferative) was evaluated using Mann – Whitney U Test. The test revealed the insignificant effect of terminal digits of tumor size on the Ki-67 staining scores (p = 0.114), as shown in Fig. II.
Fig. 2

Benford's distribution curve.

Benford's distribution curve.

Discussion

It has not been thoroughly researched. Our study using 1000 specimens of primary malignant breast tumors collected from November 2016 to December 2020 at KRL Hospital aimed to assessment of terminal digit preference in breast tumor size and Ki–67 indices using Benford's law. Studies have shown that terminal digit preference occurs in pathological reporting of malignant tumors, including colorectal adenocarcinomas and breast tumors [5,6]. This type of preference can affect T size categorization and thus have serious implications in patient management and prognosis, particularly in tumor types where macroscopic tumor size affects staging, such as breast and thyroid tumors. The histograms showing the distribution of the first digits of the largest diameter of tumor size and that of Ki-67% showed that some digits were reported at a higher frequency than others. The preferred terminal digits were 3 and 9 in the reporting of histopathological breast tumor size and 3, 5, and 6 in Ki-67 staining scores. The result of digit preference in tumor diameter is inconsistent with the previous nationwide study done in Norway, in which there was clear evidence of terminal digit preference for 0 and 5 (pentameric preference) in the measurements of mammographic and histopathologic tumor diameter [6]. The increased frequency of digits 3 and 9 in tumor size measurement found in our study does not affect T category classification according to the TNM system as 30 mm and 90 mm values do not lie at the border values. For example, a tumor diameter of 28 mm rounded off to 30 mm due to a digit preference of 3 would fall into the T2 category either way. Similarly, tumor diameters falling in the range of 80 mm–89 mm rounded off to 90 mm owing to digit preference would be classified as T3. We can say that histopathologists adopted a practical approach by avoiding T category border values, a finding consistent with a recent Dutch study [15]. The Ki-67% score is defined as the percentage of positively stained cancer cells among the total number of malignant cells evaluated. It is used as a prognostic marker in breast cancer. A higher Ki-67% score shows that the tumor is aggressive, more likely to spread, and is associated with a worse prognosis. In our study, there was a preference in the reporting of Ki-67 staining scores for digits 3, 5, and 6. As there is no common consensus on a cut-off value of Ki-67, the effects of terminal digit preference in the reporting of these scores remain unpredictable and need to be investigated further. The Mann–Whitney U test was used to assess the effect of tumor size on Ki-67 staining scores, with 10% set as the cut-off value for low and high Ki-67 scores. It has been reported that a higher Ki-67 index is correlated with increasing tumor size [16]. However, in our study, the relationship between tumor size and Ki-67 values was found to be insignificant. Using immunohistochemistry, researchers from Anyang Tumor Hospital have found evidence that distinguishing between TNBC subtypes with varying degrees of aggressiveness and prognosis can be done using p53 and ki-67 proteins. The prognosis is significantly worse for patients with high p53 or Ki-67 indices or a history of cancer in the family. Prognostic markers in TNBC include p53, Ki-67, and family history [17].

Limitations

The limitation of our study is that it included data from a single hospital with small sample size. The study did not include information about individual histopathologists who performed the measurements, and it was not possible to study intra-observer or inter-observer trends. With no knowledge of the specific cases affected by terminal digit preference, it is not possible to postulate the clinical implications of the measurement error. The current TNM staging guidelines do not consider terminal digit preference. More explicit guidelines are required to minimize errors in the pathological reporting of malignant tumors. To make our findings generally applicable, there should be uniformity in using the cut-off value for Ki-67. Further studies are required to probe the consequences resulting from this source of measurement error.

Strengths

It is essential to measure the impact of terminal digit preference especially when treatment guidelines can change the course of management. This study quantifies the impact of this covert tendency to alter the exact dimensions of the tumors. Hence, this study is an approach to unveiling the impact of terminal digit preference.

Conclusion

The recording of the Ki-67 index revealed evidence of pentameric preference. The numbers three, five, and six appeared more frequently in the histogram of the Ki-67 index distribution measured in percentage. The frequency of nine has dropped dramatically. The influence of tumor size terminal digits on Ki-67 staining scores (low proliferative vs. high proliferative) was assessed using the Mann–Whitney U test, which demonstrated that tumor size terminal digits did not affect Ki-67 staining scores.

Ethical approval

The ethical approval was obtained from ethical review committee of Shifa International Hospital Islamabad. Reference number: 09-ERC/17/10/2015.

Sources of funding

No funding received.

Author contribution

All the authors contributed to the outlines of the research proposal, data collection, Analysis, writing, editing, proofing, and final approval of the paper.

Availability of data and material

Yes.

Registration of research studies

Name of the registry: Not Applicable. Unique Identifying number or registration ID: Not Applicable. Hyperlink to your specific registration (must be publicly accessible and will be checked):

Consent

Not Applicable.

Guarantor

Hassan Mumtaz.

Provenance and peer review

Not commissioned, externally peer-reviewed.

Declaration of competing interest

None to disclose.
  14 in total

1.  Terminal digit preference: beware of Benford's law.

Authors:  T W Beer
Journal:  J Clin Pathol       Date:  2009-02       Impact factor: 3.411

2.  Pentameric last-digit preference and stage border avoidance in pathology measurement.

Authors:  Michael A den Bakker; Ronald A M Damhuis
Journal:  Histopathology       Date:  2018-06-21       Impact factor: 5.087

3.  Terminal digit preference occurs in pathology reporting irrespective of patient management implication.

Authors:  S J Hayes
Journal:  J Clin Pathol       Date:  2008-09       Impact factor: 3.411

Review 4.  Treatment of breast cancer.

Authors:  Karen L Maughan; Mark A Lutterbie; Peter S Ham
Journal:  Am Fam Physician       Date:  2010-06-01       Impact factor: 3.292

5.  Terminal digit preference: a source of measurement error in breast cancer diameter reporting.

Authors:  Kaitlyn M Tsuruda; Solveig Hofvind; Lars A Akslen; Solveig R Hoff; Marit B Veierød
Journal:  Acta Oncol       Date:  2019-09-30       Impact factor: 4.089

6.  STROCSS 2021: Strengthening the reporting of cohort, cross-sectional and case-control studies in surgery.

Authors:  Ginimol Mathew; Riaz Agha
Journal:  Int J Surg       Date:  2021-11-11       Impact factor: 6.071

Review 7.  The new TNM-based staging of breast cancer.

Authors:  Gábor Cserni; Ewa Chmielik; Bálint Cserni; Tibor Tot
Journal:  Virchows Arch       Date:  2018-01-27       Impact factor: 4.064

8.  COVID-19 deaths in the USA: Benford's law and under-reporting.

Authors:  Michele Campolieti
Journal:  J Public Health (Oxf)       Date:  2022-06-27       Impact factor: 5.058

9.  On the authenticity of COVID-19 case figures.

Authors:  Adrian Patrick Kennedy; Sheung Chi Phillip Yam
Journal:  PLoS One       Date:  2020-12-08       Impact factor: 3.240

10.  Association of Ki-67 With Clinicopathological Factors in Breast Cancer.

Authors:  Jitendra S Nigam; Tarun Kumar; Shreekant Bharti; Ruchi Sinha; Punam P Bhadani
Journal:  Cureus       Date:  2021-06-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.