Literature DB >> 35800508

Ensconcing a biostatistics clinic in tertiary care research institute of India: A descriptive study.

Kishore Kamal1, Meenakshi Sharma2, Mahajan Rahul3, Kapoor Rakesh4, Vipin Koushal5.   

Abstract

Context: Data collection and statistical analysis are integral components of research. The beauty of statistics lies in its ability to evaluate evidence in the face of uncertainty. However, lack of dedicated biostatistical consultation units, rote academic teaching and training lead to poor statistical analysis. Thus, we aim to explore and understand the challenges of establishing a Biostatistics Clinic (BC) in a tertiary care research institute. A secondary aim is to identify the stage of research at which participants approach biostatisticians. Material and
Methods: The data for the current study came from a consultancy unit named as "Biostatistics Clinic" in the department of biostatistics from Post Graduate Institute of Medical Education and Research (PGIMER), Chandigarh, India. The residents and students who approached the department of biostatistics for consultancy regarding study design, sample size, statistical analysis and other analytical work were the sampling units.
Results: A total of 208 residents and students made 404 visits to biostatistics clinic. The male and female visits were 118 (56.7%) and 90 (43.5%), respectively. Majority of visitors approached for data analysis (171; 75%) followed by study design and sample size calculation for protocol preparation (43; 18.9%). Leading reference to approach the biostatistics clinic was through a faculty (91; 43.8%) followed by self (54; 26%).
Conclusion: Despite the thrust and apparent advantages of contacting statistician at the beginning of the study, majority approached only at the data analysis stage. Therefore, repeated and improved efforts are required to spread the message of approaching statistician early. Copyright:
© 2022 Journal of Family Medicine and Primary Care.

Entities:  

Keywords:  Analytical support clinic; biostatistical consulting; biostatistics clinic; statistical consultancy

Year:  2022        PMID: 35800508      PMCID: PMC9254788          DOI: 10.4103/jfmpc.jfmpc_1653_21

Source DB:  PubMed          Journal:  J Family Med Prim Care        ISSN: 2249-4863


Introduction

Data collection and statistical analysis are integral components of research. The individual, family, and community practitioner must decide between competing interventions based on evidence. The decision to provide the best patient outcome requires a robust data-based decision to segregate uncertainty from certainty. The beauty of statistics lies in its ability to evaluate evidence in the face of uncertainty.[12] However, its improper usage to obtain P value and confidence intervals clouds the reliability and validity of the studies. The unscientific research and data from the same hamper patient care. Additionally, biostatistics’ incomplete and incorrect usage has emerged as a significant reason for rejecting articles.[34567] The reasons for this could be the lack of rigorous statistical training, statistical collaborations, and the use of software without technical understanding among researchers.[89] Another reason can be consulting a statistician after the data collection instead of the planning stage. The literature recommends approaching a statistician at the beginning of the research.[101112] The replication crisis[1314] often occur due to not involving a statistician at the conceptualisation stage that resulted in suboptimal research practices such as small sample, p-hacking, and post hoc reasoning.[1516] Numerous studies emphasised the need and setting up of biostatistical consultations.[171819] Costing studies in the literature have suggested the worth of such consultation units.[202122] Many tertiary care institutes in developed countries have separate biostatistical consultation units. However, the biostatistics departments in India were established primarily for teaching purposes, and there are no particular consultation units with dedicated time for consultations. Hence Biostatistics Clinic (BC), a first of its kind, was established at a tertiary care institution in northern India in 2018. The primary objective of the current study is to explore and understand the challenges of establishing a BC at a tertiary care institution in India. A secondary aim is to identify the stage of research at which participants approach biostatisticians.

Methodology

The current study was conducted in the Post Graduate Institute of Medical Education and Research (PGIMER), a 2,200 bedded multi-speciality tertiary care research institute and “Institute of National Importance” situated in Chandigarh, India. The communication to initiate the biostatistics clinic was circulated offline (official letters to each department, oral communication) and online (social media, email, and message) modules to the resident doctors, faculty, and students for broader reach. Subsequently, emails, messages, and WhatsApp were used as reminders to utilise the biostatistics clinic service. The consultancy time was kept from 2:00 to 5:00 pm every working day to facilitate the clinicians. The timings of consultations were flexible and adjusted on individual requests. The beneficiaries were requested to officially email or call to fix an appointment to avoid inconvenience and time waste. Although participants were asked to visit departments, few consultations were given through WhatsApp, Skype, and emails. In general, we decided to keep half an hour for each consultation-the same was extended or reduced depending on the problems and requests for consultations. The participants’ visit record, such as the purpose of the visit, date, time, speciality, designation, mobile number, and email, was maintained.

Sample size

We did not formally calculate sample size as the current study is hypothesis-generating in nature rather than hypothesis confirmation.

Data analysis

The resident doctors and students approached the BC for consultancy regarding study design, sample size calculation, statistical analysis, and other analytical work were the units of data collection and analysis. The variables of interest were statistical consultation types (statistical analysis, sample size, study design), refer by whom (faculty, self, friends), stage of consultation (protocol conceptualisation, after data collection or all the stages), and time (before lunch, after lunch or after duty hours). Subsequently, a spreadsheet was used to enter data from the hard copy specifying the purpose of the visits. We used descriptive measures such as frequency (%), bar chart, and pie-chart to report study findings using MS excel® (Microsoft, Washington (US)).

Results

A total of 208 students made 404 visits to the biostatistics clinic [Table 1]. The male and females visits were 118 (56.7%) and 90 (43.5%), respectively. Leading reference to the biostatistics clinic was through a faculty (43.8%) followed by self (26%). More than 2/3rd (143) of the visitors approached once, followed by 2-4 times (24%) and ≥ five times (7.2%). An overwhelming majority of visits came for data analysis (75%) followed by study design and sample size calculation for protocol preparation (18.9%). The most frequent visitors were the junior residents (42.7%), followed by senior residents and doctorate students (18.8%). Majority (69.1%) of visitors approached post-lunch (2:00 – 5:00 pm) followed (13.6%) by before lunch (9:00 – 1:00 pm) and after (9.9%) office hours (5:00 – 8:00 pm) respectively. Most (90%) of the visitors called and fixed an appointment before visiting.
Table 1

Characteristics of Visitors to Biostatistics Clinic in a Tertiary Care Medical Research Institute

VariablesCategoriesCount (%)
GenderMale118 (56.7)
Female90 (43.3)
ReferenceFaculty91 (43.8)
Self54 (26.0)
Friend36 (17.3)
Not Specified23 (11.1)
Notice4 (1.8)
Consultation frequency1 Time143 (68.8)
2-4 Times50 (24.0)
≥ 5 Times15 (7.2)
Purpose* (n=228)Data Analysis171 (75.0)
Protocol Preparation43 (18.9)
Interpretation13 (5.7)
Software & Data Entry1 (0.4)
Designation# (n=213)Junior Residents91 (42.7)
PhD40 (18.8)
Senior Residents40 (18.8)
Research Staff24 (11.3)
Postgraduate18 (8.4)
Appointment^ (n=404)09:00-01:00 pm55 (13.6)
01:00-02:00 pm30 (7.4)
02:00-05:00 pm279 (69.1)
05:00-08:00 pm40 (9.9)

*Few residents and students came for more than one purpose. #Few Students designation changes during the study period. ^A total of 404 appointments was given to 208 residents and students

Characteristics of Visitors to Biostatistics Clinic in a Tertiary Care Medical Research Institute *Few residents and students came for more than one purpose. #Few Students designation changes during the study period. ^A total of 404 appointments was given to 208 residents and students We found that only 65 (31.3%) students out of 208 came for repeated consultations. However, only five (2.4%) students visited the clinic, from conceptualisation to data analysis. Residents and students flow to the clinic were regular in each month. However, there were spikes during the thesis and protocol submission months [Figure 1]. The thesis and protocol submission months in the institute are June and December. The participation by the clinical (39%) and non-clinical (34%) departments were almost the same [Figure 2]. Despite the availability of consultants and repeated reminders, most researchers (97.6%) do not follow the standard protocol of approaching at the planning stage. Moreover, numerous investigators (>90%) come for data analysis without a reference proposal. Table 2 summarises the challenges faced during consultations at the established clinic.
Figure 1

Distribution of participants visits on monthly basis

Figure 2

Specialisation wise Visits

Table 2

Challenges during biostatistics consultations and possible adverse outcomes

ChallengesPossible adverse outcomes
The absence of the free flow of discussionThe unidirectional discussion without a holistic overview of study characteristics may lead to Garbage (low powered studies, flawed study design, ill-conceived master sheet) In, Garbage (non-replicable studies, inappropriate statistical analysis, Incorrect results) Out (GIGO).
Datasets without objectivesAnalysis of datasets without a written document creates confusion in preparing master sheets, coding, data dictionary, identification of dependent and independent variables and leads to multiple testing for hunting statistically significant P.
Incomplete or Partial datasetsThe partial or incomplete dataset leads to repetitive preparation and coding of the same dataset, multiple testing, repetition of similar analysis on different occasions. It can blur the distinction between independent and dependent variables.
Last Minute ApproachThe emergency approach in analysis often leads to ignorance of vital intricacies of the study design, assumptions of the datasets. Further, it may lead to incorrect statistical analysis and interpretations of the data.
AuthorshipThe offer of authorship at data analysis stages often leads to data torturing to hunt for a P<0.05 or other statistically significant criteria. It eventually leads to inflation of Type-I error and is a substantial contributor to the replicability crisis.
Distribution of participants visits on monthly basis Specialisation wise Visits Challenges during biostatistics consultations and possible adverse outcomes

Discussion

The current study showed that, despite the emphasis on consultation during the planning stage, most visitors (75%) approached at analysis stage after data collection. This situation is known as “Data repairer.” A significant challenge was balancing study design, objectives and variables, sample size, and selecting appropriate statistical techniques. During the analysis stage, approaching students were the most fixated for specific analysis, sample size justification despite apparent flaws. Most people coming at this stage were restless to hunt for P < 0.05, alternatively known as “p-value hacking.” Many methodologists have raised the alarm against the perils of P value hacking.[232425] This stage frequently leads to a precarious situation of slicing and dicing the data by researchers without a significant P value (p < 0.05) from the data. It eventually inflates type-I error. The inflation of error leads to non-replication of study findings and has emerged as a significant threat off late.[26272829] The majority (43.8%) of the participants seeking biostatistical consultations were referred by faculty. Therefore, it is crucial to periodically disseminate the information about the biostatistics clinic among the faculty colleagues, at least in the beginning, till the system becomes self-sustained. The majority of the studies emphasised developing collaborations between researchers and biostatistics consultants.[171830] However, ours is perhaps the first study to empirically demonstrate the importance of collaborations. Self-reference (26%) indicated that participants were aware of the need for an expert opinion on biostatistics. It could be possible due to increasing emphasis by methodologists, prior exposure, felt need, or lack of expertise in statistical analysis. However, there was a spur in seeking consultations during protocol and thesis submission months. Such exigencies were managed with flexible timings and a checklist for discussion in our clinic. There were many challenges faced at the various stages of the biostatistics clinic. The timing and flexibility play a significant role in establishing the biostatistics clinic. Despite an emphasis on prior appointments to save time, many dropped in both morning and evening. Similarly, many of them were not punctual about time which increased waiting time. Moreover, many residents and students turned up with data and specific queries rather than protocols. Few analysed data and others from freelance researchers, but a lack of P < 0.05 (p-hacking) led them to visit our clinic. These were also the most difficult to handle as if they were living in the P value paradigm, and they will not achieve nirvana (salvation) if the P value does not turn out to be significant. They required the gentlest touch, persuasion, teaching the difference between clinical and statistical significance, the importance of negative trials, and the need to approach the statistician at the conceptual stage of the study. In the initial stages, many residents came with a pre mindset that statisticians enter the numbers in the computer and then computer do the analysis. Therefore, it is going to be a 5-minute job. This misapprehension guided them in contacting biostatistical consultancy 10-15 days before submitting protocols and theses. However, the single-visit approach to solving all the problems is flawed because it is difficult to understand the subtle intricacies of the studies and apply appropriate statistics in one go, despite, availability of computers. Kreft has succinctly summarised, “no piece of hardware can solve the challenging statistical issues underlying decisions about model specification”.[31] Many resident doctors and students reported P value without confidence intervals (CI). Moreover, the mean and standard deviation were standard reporting measures irrespective of the skewed nature of the data. In our clinic, this was an opportunity to educate researchers to carefully plan and sync “research question, study design, sample size, data collection and entry, analysis and display of results with appropriate descriptive and inferential measures” at the conceptual stage. Despite recommendations, individual, family, and community practitioners do not involve biostatisticians during study conceptualization. The studies without biostatisticians lead to many methodological flaws in data collection, analysis and interpretations that may adversely affect patient care.

Conclusion

Despite the thrust and apparent advantages of contacting statisticians at the beginning of the study, the majority approached only at the data analysis stage. Therefore, repeated and improved efforts are required to spread the message of consulting a statistician at the planning stage through BCs. Further, consultation units may be established at other institutions to strengthen and consolidate the research in the country. The strengthening of BCs and collaboration with biostatisticians will also enhance the best possible care for patients.

Circular

The lead author of this article officially started the biostatistics clinic on 10th May 2018 vide letter no. PGI/BIOSTAT/2018/87-143 after pilot testing.

Financial support and sponsorship

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Conflicts of interest

There are no conflicts of interest.
  25 in total

1.  Towards a curriculum for the consultant biostatistician: identification of central disciplines.

Authors:  H Tobi; D J Kuik; P D Bezemer; P Ket
Journal:  Stat Med       Date:  2001-12-30       Impact factor: 2.373

2.  What to Do When Your Paper Is Rejected.

Authors:  Gail M Sullivan
Journal:  J Grad Med Educ       Date:  2015-03

3.  Biostatistics Faculty and NIH Awards at U.S. Medical Schools.

Authors:  Guangxiang Zhang; John J Chen
Journal:  Am Stat       Date:  2015-02       Impact factor: 8.710

4.  Manuscript rejection: causes and remedies.

Authors:  Javed Ali
Journal:  J Young Pharm       Date:  2010-01

5.  Statistics and medicine: the indispensable know-how of the researcher.

Authors:  R Romano; E Gambale
Journal:  Transl Med UniSa       Date:  2013-01-04

6.  Why do you need a biostatistician?

Authors:  Antonia Zapf; Geraldine Rauch; Meinhard Kieser
Journal:  BMC Med Res Methodol       Date:  2020-02-05       Impact factor: 4.615

Review 7.  How many scientists fabricate and falsify research? A systematic review and meta-analysis of survey data.

Authors:  Daniele Fanelli
Journal:  PLoS One       Date:  2009-05-29       Impact factor: 3.240

8.  Advancing collaborations in health research and clinical trials in Sub-Saharan Africa: development and implementation of a biostatistical collaboration module in the Masters in Biostatistics Program at Stellenbosch University.

Authors:  Tonya M Esterhuizen; Guowei Li; Taryn Young; Jie Zeng; Rhoderick Machekano; Lehana Thabane
Journal:  Trials       Date:  2021-07-22       Impact factor: 2.279

9.  A Tutorial on Hunting Statistical Significance by Chasing N.

Authors:  Denes Szucs
Journal:  Front Psychol       Date:  2016-09-22

10.  P-hacking in clinical trials and how incentives shape the distribution of results across phases.

Authors:  Jérôme Adda; Christian Decker; Marco Ottaviani
Journal:  Proc Natl Acad Sci U S A       Date:  2020-06-02       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.