| Literature DB >> 31780584 |
Henry J Leese1, Thozhukat Sathyapalan2, Victoria Allgar3, Daniel R Brison4, Roger Sturmey5.
Abstract
Numerical data in biology and medicine are commonly presented as mean or median with error or confidence limits, to the exclusion of individual values. Analysis of our own and others' data indicates that this practice risks excluding 'Goldilocks' effects in which a biological variable falls within a range between 'too much' and 'too little' with a region between where its function is 'just right'; a concept captured by the Swedish term 'Lagom'. This was confirmed by a narrative search of the literature using the PubMed database, which revealed numerous relationships of biological and clinical phenomena of the Goldilocks/Lagom form including quantitative and qualitative examples from the health and social sciences. Some possible mechanisms underlying these phenomena are considered. We conclude that retrospective analysis of existing data will most likely reveal a vast number of such distributions to the benefit of medical understanding and clinical care and that a transparent approach of presenting each value within a dataset individually should be adopted to ensure a more complete evaluation of research studies in future. © Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.Entities:
Keywords: data analysis and interpretation; data distributions; statistics & research methods
Mesh:
Substances:
Year: 2019 PMID: 31780584 PMCID: PMC6887037 DOI: 10.1136/bmjopen-2018-027767
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1An illustrative example of a comparison of cell lines is described, which shows that bar charts do not give the reader adequate information on the variability and distribution of each sampled ‘n’. The extent of activation of a receptor in three cell lines a, b and c under baseline (drug‐naïve) conditions and following the addition of a drug is given in arbitrary units. The same datasets are presented in three different ways: (A) bar chart, (B) grouped column scatter plot with means and error and (C) before–after scatter plot. n=10 (ie, biological replicates and not technical replicates). In this example, error bars represent the SEM although authors should consider the sampling size and distribution of ‘n’ when choosing the most appropriate way of showing experimental error (eg, SD or CI).