Preprint Review Version 1 This version is not peer-reviewed

Realistic and Robust Reproducible Research for Biostatistics

Version 1 : Received: 1 June 2020 / Approved: 2 June 2020 / Online: 2 June 2020 (09:24:25 CEST)

How to cite: Hejblum, B.P.; Kunzmann, K.; Lavagnini, E.; Hutchinson, A.; Robertson, D.S.; Jones, S.C.; Eckes-Shephard, A.H. Realistic and Robust Reproducible Research for Biostatistics. Preprints 2020, 2020060002 (doi: 10.20944/preprints202006.0002.v1). Hejblum, B.P.; Kunzmann, K.; Lavagnini, E.; Hutchinson, A.; Robertson, D.S.; Jones, S.C.; Eckes-Shephard, A.H. Realistic and Robust Reproducible Research for Biostatistics. Preprints 2020, 2020060002 (doi: 10.20944/preprints202006.0002.v1).

Abstract

The complexity of analysis pipelines in biomedical sciences poses a severe challenge for the transparency and reproducibility of results. Researchers are increasingly incorporating software development technologies and methods into their analyses, but this is a quickly evolving landscape and teams may lack the capabilities to set up their own complex IT infrastructure to aid reproducibility. Basing a reproducible research strategy on readily available solutions with zero or low set-up costs whilst maintaining technological flexibility to incorporate domain-specific software tools is therefore of key importance. We outline a practical approach for robust reproducibility of analysis results. In our examples, we rely exclusively on established open-source tools and free services. Special emphasis is put on the integration of these tools with best practices from software development and free online services for the biostatistics domain.

Subject Areas

Biostatistics; Data management; Reproducibility; Workflow automation

Comments (1)

Comment 1
Received: 25 June 2020
Commenter: Jeremy Leipzig
The commenter has declared there is no conflict of interests.
Comment: I thought this is a decent article and covers most of the basic components of reproducibility. Great sections on RDM. It plays a little fast and loose with the workflow stuff and adds some niche solutions as though they were ubiquitous (Cyverse and Gigantum are great but neither dominates the scene). Definitely a review worth reading.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.