Preprint Review Version 1 Preserved in Portico This version is not peer-reviewed

Realistic and Robust Reproducible Research for Biostatistics

Version 1 : Received: 1 June 2020 / Approved: 2 June 2020 / Online: 2 June 2020 (09:24:25 CEST)

How to cite: Hejblum, B.P.; Kunzmann, K.; Lavagnini, E.; Hutchinson, A.; Robertson, D.S.; Jones, S.C.; Eckes-Shephard, A.H. Realistic and Robust Reproducible Research for Biostatistics. Preprints 2020, 2020060002. https://doi.org/10.20944/preprints202006.0002.v1 Hejblum, B.P.; Kunzmann, K.; Lavagnini, E.; Hutchinson, A.; Robertson, D.S.; Jones, S.C.; Eckes-Shephard, A.H. Realistic and Robust Reproducible Research for Biostatistics. Preprints 2020, 2020060002. https://doi.org/10.20944/preprints202006.0002.v1

Abstract

The complexity of analysis pipelines in biomedical sciences poses a severe challenge for the transparency and reproducibility of results. Researchers are increasingly incorporating software development technologies and methods into their analyses, but this is a quickly evolving landscape and teams may lack the capabilities to set up their own complex IT infrastructure to aid reproducibility. Basing a reproducible research strategy on readily available solutions with zero or low set-up costs whilst maintaining technological flexibility to incorporate domain-specific software tools is therefore of key importance. We outline a practical approach for robust reproducibility of analysis results. In our examples, we rely exclusively on established open-source tools and free services. Special emphasis is put on the integration of these tools with best practices from software development and free online services for the biostatistics domain.

Keywords

Biostatistics; Data management; Reproducibility; Workflow automation

Subject

Computer Science and Mathematics, Software

Comments (1)

Comment 1
Received: 25 June 2020
Commenter: Jeremy Leipzig
The commenter has declared there is no conflict of interests.
Comment: I thought this is a decent article and covers most of the basic components of reproducibility. Great sections on RDM. It plays a little fast and loose with the workflow stuff and adds some niche solutions as though they were ubiquitous (Cyverse and Gigantum are great but neither dominates the scene). Definitely a review worth reading.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.