Preprint Essay Version 1 Preserved in Portico This version is not peer-reviewed

Using Directed Acyclic Graphs (Dags) to Represent the Data Generating Mechanisms of Disease and Healthcare Pathways: A Guide for Educators, Students, Practitioners and Researchers

Version 1 : Received: 23 September 2022 / Approved: 8 October 2022 / Online: 8 October 2022 (02:59:34 CEST)

A peer-reviewed article of this Preprint also exists.

Ellison, G.T.H. (2023). Using Directed Acyclic Graphs (DAGs) to Represent the Data Generating Mechanisms of Disease and Healthcare Pathways: A Guide for Educators, Students, Practitioners and Researchers. In: Farnell, D.J.J., Medeiros Mirra, R. (eds) Teaching Biostatistics in Medicine and Allied Health Sciences. Springer, Cham. https://doi.org/10.1007/978-3-031-26010-0_6 Ellison, G.T.H. (2023). Using Directed Acyclic Graphs (DAGs) to Represent the Data Generating Mechanisms of Disease and Healthcare Pathways: A Guide for Educators, Students, Practitioners and Researchers. In: Farnell, D.J.J., Medeiros Mirra, R. (eds) Teaching Biostatistics in Medicine and Allied Health Sciences. Springer, Cham. https://doi.org/10.1007/978-3-031-26010-0_6

Abstract

Directed acyclic graphs (DAGs) are nonparametric causal path diagrams that have substantial utility as principled representations of disease and healthcare pathways, and of the underlying ‘data generating mechanisms’ these pathways involve. As such, DAGs provide a valuable bridge between: the aetiological knowledge, operational insight and professional experience on which clinical training and practice depend; and the more abstract epistemological and analytical considerations required to extract robust statistical insight from health and healthcare data. DAGs are nonetheless vulnerable to imperfect biomedical paradigms, partial clinical knowledge and limited empirical data. DAGs drawn under such circumstances offer limited scope for statistical insight free from cognitive, analytical or inferential bias if: they misrepresent the data generating mechanisms involved; or ignore the important role that omitted variables (whether measured, unmeasured or unacknowledged) might play therein. To address these weaknesses and broaden the appeal and application of DAGs, this chapter provides ten simple steps that educators can use to improve the analytical competence and statistical confidence of the healthcare students, qualified practitioners and experienced researchers they support. These steps use temporal logic to draw DAGs so as to: reduce reliance on uncertain knowledge, incomplete information, flawed assumptions or guesswork; and avoid, mitigate or acknowledge the errors and biases that each of these incur. The chapter comprises an accessible, non-technical overview of the perspective and thoughtfulness required to generate temporally coherent DAGs as objective representations of the probabilistic causal paths involved in context-specific data generating mechanisms. It encourages a focus on those variables operating as potential sources of analytical or inferential bias when estimating the plausible, probabilistic causal relationship between two pre-specified variables; and specifically addresses the challenges posed by: omitted; time-variant; non-asynchronous; and temporally obscure variables. The chapter includes a worked example based on a published clinical study to demonstrate how each of the steps required to generate temporally-informed DAGs can be applied to: critically appraise the analytical decisions made during applied healthcare research; and inform the decisions required when designing, undertaking and analysing primary and secondary, prospective and retrospective research. The appendices include a summary of ten recommendations for improving the reporting and interrogability of DAGs and DAG-informed analyses.

Keywords

Directed Acyclic Graph; DAG; confounding; collider bias; epistemology; inferential statistics

Subject

Computer Science and Mathematics, Probability and Statistics

Comments (1)

Comment 1
Received: 16 February 2023
Commenter:
The commenter has declared there is no conflict of interests.
Comment: The "v2" version contains final edits and corrections made in proof to the Chapter below:

Ellison GTH. Using directed acyclic graphs (DAGs) to represent the data generating mechanisms of disease and healthcare pathways: a guide for educators, students, practitioners and researchers. Chapter 6 in: Medeiros Mirra RJ and Farnell D (ed.s) Teaching Biostatistics in Medicine and Allied Health Sciences. Springer Verlag: 2023; ISBN: 978-3-031-26009-4). DOI: 10.1007/978-3-031-26010-0_6.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.