| Literature DB >> 33206676 |
Wei Jeng1, Shih-Hung Wang2, Hung-Wei Chen2, Po-Wei Huang3, Yu-Jen Chen4, Hsu-Chun Hsiao2,5.
Abstract
Research transparency has been advocated as a key means of addressing the current crisis of reproducibility. This article proposes an enhanced form of research transparency, termed lifecycle transparency. Over the entire lifecycle of a research effort, this approach captures the syntactical contexts of artifacts and stakeholders, such as timestamps, agreements, and/or dependency requirements for completing each research phase. For example, such contexts might include when, where, and from whom patients' consent and institutional review board approvals were received before a clinical trial was carried out. However, as existing open-science tools are often dedicated to certain research phases or disciplines, and thus insufficient to support lifecycle transparency, we propose a novel decentralized framework to serve as a common medium for interaction among open-science tools, and produces irrefutable and immutable proofs of progress that can be verified automatically.Entities:
Year: 2020 PMID: 33206676 PMCID: PMC7673512 DOI: 10.1371/journal.pone.0241496
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Stakeholders and associated research artifacts in clear transparency.
(Note: Use case adopted from Benchoufi [5]).
Fig 2BLT ecosystem.
Fig 3Example BLT workflow.
Fig 4User tasks during creation of a BLT workflow.
Proposed smart-contract template.
| Actions | Descriptions |
|---|---|
| Creates a new project, initialized with customized templates. The variable isFrozen is used to determine whether the project workflow (e.g., phase relations) can be modified during the project. | |
| Adds a member, identified by her Ethereum address, to the research group. Only members of the research group are allowed to perform write operations on the contract. | |
| Removes a member from the research group. | |
| Creates a new phase of the project. If isFrozen is set to “false”, new phases are allowed to be created dynamically during the project. | |
| Logs a new record to a certain phase. | |
| Commits a certain phase to indicate that it is complete, and logs the committer and committed time. Once a phase is committed, new records can no longer be logged to it. | |
| Specifies a parent-child relationship between two phases. The child phase should not be committed until all of its parent phases are committed. This function is an implementation of the phase-dependency requirement. | |
| Sets a phase’s time constraint. The phase should be committed before—or, conversely, not before—a certain time. This function is an implementation of the time-constraint requirement. | |
| Calculates the hash of all records in a certain phase as the phase hash. The parties who agree on the phase should sign the phase hash with their respective private keys, and send their signatures to the contract. | |
| Stores a party’s signature on a phase hash, which serves as his/her/its agreement regarding that phase. This function is an implementation of the agreement requirement. | |
| Checks if all of a project’s phase-dependency, time, or agreement constraints have been satisfied, as of the time that this function is called. |
Sample use case of BLT.
| Research Stage | Description | Active Logging | Passive Logging |
|---|---|---|---|
| Early | 1. Alice filed an IRB application for her new grant. | BLT client | IRB application system |
| 2. Alice met an instrument vendor and got a quote. | BLT client | – | |
| Middle | 3. Alice’s research assistant logged new data from quantitative analysis software. | Lab notebook | Internet-connected instrument |
| 4. Alice’s postdoc updated the open protocol in Protocols.io | – | Protocols.io | |
| 5. Alice’s camera periodically takes a photo of the laboratory mice to record their reactions to a new drug every four hours. The hash of the photo is logged on BLT. | BLT client | – | |
| Last | 6. Alice received a rejection letter from a journal. | BLT client | Journal’s submission center |
Estimated gas required by BLT smart-contract methods.
| Method | # of para. | Min | Max | Init |
|---|---|---|---|---|
| constructor | 2 | 2,844,710 | 2,848,806 | 0 |
| addMember | 2 | 94,206 | 98,302 | 0 |
| delMember | 1 | 29,525 | 31,573 | 0 |
| addPhase | 1 | 92,477 | 94,525 | 0 |
| addPhaseDependency | 4 | 114,855 | 123,047 | 15,000 |
| addPhaseRecord | 7 | 154,435 | 168,771 | 15,000 |
| addTimeConstraint | 4 | 73,089 | 79,233 | 15,000 |
| addAgreement | 4 | 95,583 | 103,775 | 15,000 |
| calculatePhaseHash | 1 | 45,617 | 47,665 | 0 |
| commitPhase | 1 | 72,378 | 74,426 | 0 |
* has an additional cost of 2,728 gas per record.
Threats to reproducible science and research-misconduct types.
| Early stages of research lifecycle, e.g., idea, research design | ||
| T1 | Cognitive bias [ | In a research context, a tendency for researchers to make systematic errors in thinking and reasoning, such as unintentionally introducing bias, or having bias toward the treatment group in a clinical trial. Also includes attention bias. |
| T2 | Low statistical power [ | A small sample size and/or small effect can increase the likelihood of a Type II error (false negative). This, in turn, negatively influences a study’s statistical power. |
| M1 | Plagiarism (ideas, data, results) | The taking of ideas, data, or results from others without attributing the source and claiming them as one’s own. |
| Middle stage of research lifecycle, e.g., data collection, quality control, analysis | ||
| T3 | Poor execution / quality-control | Poor execution of data collection, and/or poor data-quality control. |
| T4 | Data dredging [ | Unlike the hypothetico-deductive approach, which requires researchers to collect evidence before making a null hypothesis or other hypotheses, data dredging is the “hacking” of the data-analysis process with the aim of finding post-hoc patterns that can pass tests of statistical significance. Data dredging increases the risk of Type I errors (false positive). |
| M2 | Fabrication | The creation or invention of nonexistent data or information related to the research. |
| M3 | Falsification | An alteration/misrepresentation of the observed result of a scientific experiment. |
| Late stage of research lifecycle, e.g., reporting, publishing, data sharing | ||
| T5 | HARKing [ | In research that purportedly utilizes a hypothetico-deductive approach, HARKing refers to hypothesizing after the results are known: i.e., introducing a post-hoc hypothesis into one’s report as if it were an a priori one. |
| T6 | Publication bias [ | Selective reporting and publication of only positive results. As such, it can be a combination of threats T1-T5. |
| T7 | Lack of data-sharing [ | Non-sharing of data can be considered a threat to the evaluation of a study’s reproducibility. Nevertheless, since data-sharing currently remains optional in many disciplines, it is not considered misconduct for our purposes. |
| M4 | Plagiarism (writing) | The copying of others’ reported results or other written work without proper attribution. |
| M5 | Redundant or duplicate publication | The submission of similar or identical articles to more than one venue simultaneously. A variant form, salami-slicing, means intentionally spreading the results of a single research project across multiple articles, to increase one’s overall number of publications. |
| M6 | Ghost and Guest authorship [ | Ghost and guest authorship refer to improper author attribution of work contribution. Specifically, ghost authorship means that a significant contributor is not listed as an author, whereas guest authorship means that a listed author does not make sufficient contribution to the work. |
Threats and misconduct types mitigated by BLT features.
| Features of BLT | Threat(s) and Misconduct Type(s) |
|---|---|
| F1. Proof of existence | M1, M4, M5, M6 |
| F2. Verifiable workflow | T5, M2 |
| F3. Immutability of logged data | T4, T6, M3 |
| F4. Agreement on phases | T1, T2 |
| F5. Accessibility constraint to researchers | T7 |