| Literature DB >> 26539290 |
Abstract
The opportunities for both subtle and profound errors in software and data management are boundless, yet they remain surprisingly underappreciated. Here I estimate that any reported scientific result could very well be wrong if data have passed through a computer, and that these errors may remain largely undetected. It is therefore necessary to greatly expand our efforts to validate scientific software and computed results.Entities:
Keywords: data management; software error
Year: 2014 PMID: 26539290 PMCID: PMC4629271 DOI: 10.12688/f1000research.5930.2
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Number of lines of code in typical classes of computer programs (via informationisbeautiful.net).
| Software Type | Lines of Code |
|---|---|
| Research code supporting a typical bioinformatics study, e.g. one graduate
| O(1000) – O(10,000) |
| Core scientific software (e.g. Matlab and R, not including add-on libraries). | O(100,000) |
| Large scientific collaborations (e.g. LHC, Hubble, climate models). | O(1,000,000) |
| Major software infrastructure (e.g. the Linux kernel, MS Office, etc.). | O(10,000,000) |