Submitted:
04 January 2025
Posted:
06 January 2025
You are already at the latest version
Abstract
Keywords:
Introduction
Methods
- represents the baseline log-odds of neonatal jaundice;
- represents hospital-specific baseline log-odds of neonatal jaundice and can be viewed as a latent feature that captures hospital’s characteristics, including their practices;
- and are respectively the set of the mother-level and hospitals-level covariates for the baby born in the hospital as mentioned earlier; and quantify how these set of covariates affect the odds of neonatal jaundice;
- represent our main exposure, prematurity and the Log odds-ratio comparing the premature () and non-premature babies ().
- Intra-class correlation
- Effect size
- Cluster size
- Covariates space
- Response surface
- Prematurity status
- Candidate methods
- Unobserved cluster-level confounders
- Performance metrics
Results
| Effect | ICC | CS | Imbalance | Method | Relative bias | RMSE | SE | Direction(%) |
|---|---|---|---|---|---|---|---|---|
| Small | Low | 25 | 0 | C_A | 0.475 - 0.836 | 0.045 - 0.078 | 0.001 - 0.002 | 0.846 - 0.961 |
| C_U | 0.487 - 0.634 | 0.046 - 0.06 | 0.001 - 0.002 | 0.886 - 0.961 | ||||
| 1 | C_A | 0.511 - 0.86 | 0.05 - 0.081 | 0.002 - 0.003 | 0.826 - 0.946 | |||
| C_U | 0.518 - 0.688 | 0.053 - 0.065 | 0.002 - 0.002 | 0.865 - 0.941 | ||||
| 100 | 0 | C_A | 0.246 - 0.399 | 0.024 - 0.038 | 0.001 - 0.001 | 0.978 - 0.998 | ||
| C_U | 0.266 - 0.332 | 0.025 - 0.032 | 0.001 - 0.001 | 0.982 - 0.997 | ||||
| 1 | C_A | 0.249 - 0.4 | 0.024 - 0.038 | 0.001 - 0.001 | 0.971 - 0.998 | |||
| C_U | 0.267 - 0.339 | 0.025 - 0.033 | 0.001 - 0.001 | 0.984 - 0.997 | ||||
| High | 25 | 0 | C_A | 0.528 - 0.966 | 0.044 - 0.08 | 0.001 - 0.002 | 0.813 - 0.946 | |
| C_U | 0.679 - 0.825 | 0.055 - 0.068 | 0.002 - 0.002 | 0.828 - 0.872 | ||||
| 1 | C_A | 0.562 - 0.944 | 0.047 - 0.077 | 0.001 - 0.002 | 0.791 - 0.92 | |||
| C_U | 0.684 - 0.851 | 0.059 - 0.069 | 0.002 - 0.002 | 0.826 - 0.881 | ||||
| 100 | 0 | C_A | 0.269 - 0.449 | 0.022 - 0.037 | 0.001 - 0.001 | 0.961 - 0.998 | ||
| C_U | 0.422 - 0.491 | 0.035 - 0.04 | 0.001 - 0.001 | 0.94 - 0.967 | ||||
| 1 | C_A | 0.268 - 0.431 | 0.023 - 0.036 | 0.001 - 0.001 | 0.95 - 0.997 | |||
| C_U | 0.422 - 0.488 | 0.035 - 0.041 | 0.001 - 0.001 | 0.941 - 0.964 | ||||
| Large | Low | 25 | 0 | C_A | 0.127 - 0.224 | 0.045 - 0.079 | 0.002 - 0.003 | 0.998 - 1 |
| C_U | 0.13 - 0.168 | 0.046 - 0.06 | 0.002 - 0.002 | 1 - 1 | ||||
| 1 | C_A | 0.137 - 0.226 | 0.051 - 0.081 | 0.002 - 0.003 | 0.995 - 1 | |||
| C_U | 0.139 - 0.184 | 0.054 - 0.066 | 0.002 - 0.002 | 0.998 - 0.999 | ||||
| 100 | 0 | C_A | 0.064 - 0.105 | 0.023 - 0.038 | 0.001 - 0.002 | 1 - 1 | ||
| C_U | 0.069 - 0.088 | 0.025 - 0.032 | 0.001 - 0.001 | 1 - 1 | ||||
| 1 | C_A | 0.067 - 0.107 | 0.024 - 0.038 | 0.001 - 0.002 | 1 - 1 | |||
| C_U | 0.072 - 0.09 | 0.026 - 0.032 | 0.001 - 0.002 | 1 - 1 | ||||
| High | 25 | 0 | C_A | 0.141 - 0.256 | 0.045 - 0.081 | 0.002 - 0.003 | 0.997 - 1 | |
| C_U | 0.177 - 0.216 | 0.055 - 0.068 | 0.002 - 0.002 | 1 - 1 | ||||
| 1 | C_A | 0.145 - 0.242 | 0.047 - 0.076 | 0.002 - 0.003 | 0.995 - 1 | |||
| C_U | 0.176 - 0.221 | 0.058 - 0.069 | 0.002 - 0.002 | 0.997 - 0.999 | ||||
| 100 | 0 | C_A | 0.07 - 0.12 | 0.022 - 0.038 | 0.001 - 0.002 | 1 - 1 | ||
| C_U | 0.111 - 0.13 | 0.035 - 0.04 | 0.001 - 0.002 | 1 - 1 | ||||
| 1 | C_A | 0.071 - 0.115 | 0.023 - 0.036 | 0.001 - 0.002 | 1 - 1 | |||
| C_U | 0.115 - 0.13 | 0.036 - 0.042 | 0.001 - 0.002 | 1 - 1 |
| Mean Cluster size | Imbalance Level | ICC value | Relative absolute deviation from theta |
|---|---|---|---|
| 25 | 0.00 | 0.03 | 0.22 |
| 0.14 | 0.28 | ||
| 0.28 | 0.32 | ||
| 0.50 | 0.02 | 0.44 | |
| 0.13 | 0.40 | ||
| 0.27 | 0.32 | ||
| 0.96 | 0.02 | 0.17 | |
| 0.12 | 0.29 | ||
| 0.26 | 0.36 | ||
| 50 | 0.00 | 0.03 | 0.13 |
| 0.14 | 0.16 | ||
| 0.28 | 0.29 | ||
| 0.50 | 0.03 | 0.13 | |
| 0.14 | 0.30 | ||
| 0.28 | 0.25 | ||
| 0.96 | 0.03 | 0.12 | |
| 0.13 | 0.25 | ||
| 0.27 | 0.37 | ||
| 100 | 0.00 | 0.04 | 0.11 |
| 0.15 | 0.14 | ||
| 0.28 | 0.24 | ||
| 0.50 | 0.04 | 0.10 | |
| 0.14 | 0.13 | ||
| 0.28 | 0.25 | ||
| 0.96 | 0.03 | 0.10 | |
| 0.14 | 0.17 | ||
| 0.27 | 1.46 |
Discussion
Supplementary Materials
References
- Colli, A., Pagliaro, L. & Duca, P. The ethical problem of randomization. Internal and emergency medicine 9, 799–804 (2014). [CrossRef]
- Hernan, M. & Robins, J. Causal inference: What if. boca raton: Chapman & hill/crc. (2020).
- Guo, S. & Fraser, M. W. Propensity Score Analysis: Statistical Methods and Applications. vol. 11 (SAGE publications, 2014).
- Rubin, D. B. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66, 688 (1974). [CrossRef]
- Sande, G. & Rothman, K. Modern Epidemiology. (1998).
- Rubin, D. B. Inference and missing data. Biometrika 63, 581–592 (1976).
- Holland, P. W. Statistics and causal inference. Journal of the American statistical Association 81, 945–960 (1986).
- Hill, J. L. Bayesian nonparametric modeling for causal inference. Journal of Computational and Graphical Statistics 20, 217–240 (2011). [CrossRef]
- Li, F., Zaslavsky, A. M. & Landrum, M. B. Propensity score weighting with multilevel data. Statistics in medicine 32, 3373–3387 (2013). [CrossRef]
- Chang, T.-H. & Stuart, E. A. Propensity score methods for observational studies with clustered data: a review. Statistics in medicine 41, 3612–3626 (2022). [CrossRef]
- Hahn, P. R., Murray, J. S. & Carvalho, C. M. Bayesian regression tree models for causal inference: Regularization, confounding, and heterogeneous effects (with discussion). Bayesian Analysis 15, 965–1056 (2020). [CrossRef]
- Prosperi, M. et al. Causal inference and counterfactual prediction in machine learning for actionable healthcare. Nature Machine Intelligence 2, 369–375 (2020). [CrossRef]
- Thoemmes, F. J. & West, S. G. The use of propensity scores for nonrandomized designs with clustered data. Multivariate behavioral research 46, 514–543 (2011). [CrossRef]
- Samanta, M. & Welsh, A. H. Bootstrapping for highly unbalanced clustered data. Computational Statistics & Data Analysis 59, 70–81 (2013). [CrossRef]
- W. Van der Elst, V. N., L. Hermans, G. Verbeke, M. G. Kenward & Molenberghs, G. Unbalanced cluster sizes and rates of convergence in mixed-effects models for clustered data. Journal of Statistical Computation and Simulation 86, 2123–2139 (2016). [CrossRef]
- Arpino, B. & Cannas, M. Propensity score matching with clustered data. An application to the estimation of the impact of caesarean section on the Apgar score. Statistics in medicine 35, 2074–2091 (2016). [CrossRef]
- Fuentes, A., Lüdtke, O. & Robitzsch, A. Causal inference with multilevel data: A comparison of different propensity score weighting approaches. Multivariate Behavioral Research 57, 916–939 (2022). [CrossRef]
- Scott, M. A., Diakow, R., Hill, J. L. & Middleton, J. A. Potential for bias inflation with grouped data: a comparison of estimators and a sensitivity analysis strategy. Observational Studies 4, 111–149 (2018). [CrossRef]
- Tovissodé, C. F., Honfo, S. H., Doumatè, J. T. & Glèlè Kakaï, R. On the discretization of continuous probability distributions using a probabilistic rounding mechanism. Mathematics 9, 555 (2021). [CrossRef]
- Rosenbaum, P. R. & Rubin, D. B. The central role of the propensity score in observational studies for causal effects. Biometrika 70, 41–55 (1983).
- D’Agostino Jr, R. B. Propensity score methods for bias reduction in the comparison of a treatment to a non-randomized control group. Statistics in medicine 17, 2265–2281 (1998).
- Middleton, J. A., Scott, M. A., Diakow, R. & Hill, J. L. Bias amplification and bias unmasking. Political Analysis 24, 307–323 (2016). [CrossRef]
- Hirano, K., Imbens, G. W. & Ridder, G. Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71, 1161–1189 (2003). [CrossRef]
- Horvitz, D. G. & Thompson, D. J. A generalization of sampling without replacement from a finite universe. Journal of the American statistical Association 47, 663–685 (1952).











| Methods | Description | C_U | C_A |
|---|---|---|---|
| 1: PSMPM | PSM with Single-level model with matching on pooled data [21] | x | |
| 2: WPSM | PSM with Single-level model and matching purely withing cluster matching [21] | x | |
| 3: PWPSM | PSM with Single-level model and matching preferentially within cluster matching [21] | x | |
| 4: FEPSM | PSM with Fixed effect model and matching on pooled data [21] | x | |
| 5: REPSM | PSM with Random intercept model and matching on pooled data [21] | x | |
| 6: PSWSLma | PSW with weights from Single-level model, on pooled data [9] | x | |
| 7: PSWFEma | PSW with weights from Fixed effect model, on pooled data [9] | x | |
| 8: PSWREma | PSW with weights from Random intercept model, on pooled data [9] | x | |
| 9: PSWSLwc | PSW with weights from Single-level model with aggregated within clustered effect [9] | x | |
| 10: PSWFEwc | PSW with weights from Fixed effect model with aggregated within clustered effect [9] | x | |
| 11: PSWREwc | PSW with weights from Random intercept model with aggregated within clustered effect [9] | x | |
| 12: SL-SL-PSW | Double robust PSW: Single-level-based PS - Single-level-based PO [9] | x | |
| 13: SL-FE-PSW | Double robust PSW: Single-level-based PS - Fixed-effect model-based PO [9] | x | |
| 14: SL-RE-PSW | Double robust PSW: Single-level-based PS - Random effect model-based PO [9] | x | |
| 15: FE-SL-PSW | Double robust PSW: Fixed-effect model-based PS - Single-level-based PO [9] | x | |
| 16: FE-FE-PSW | Double robust PSW: Fixed-effect model-based PS - Fixed-effect model-based PO [9] | x | |
| 17: FE-RE-PSW | Double robust PSW: Fixed-effect model-based PS - Random effect model-based PO [9] | x | |
| 18: RE-SL-PSW | Double robust PSW: Random effect model-based PS - Single-level-based PO [9] | x | |
| 19: RE-FE-PSW | Double robust PSW: Random effect model-based PS - Fixed-effect model-based PO [9] | x | |
| 20: RE-RE-PSW | Double robust PSW: Random effect model-based PS - Random effect model-based PO [9] | x |
| Mean cluster size | Imbalance | Confounders | Outcome ICC | True effect | Top 2 Methods | RMSE |
|---|---|---|---|---|---|---|
| 25 | 0.96 | All measured | 0.02 | -0.075 | att_RERE | 0.05 |
| att_RESL | 0.05 | |||||
| 0.26 | -0.064 | att_RERE | 0.05 | |||
| att_REFE | 0.05 | |||||
| One unmeasured cluster-level covariate | 0.02 | -0.075 | re_att_SLRE | 0.05 | ||
| att_RERE | 0.05 | |||||
| 0.26 | -0.064 | att_SLRE | 0.05 | |||
| att_SLFE | 0.05 | |||||
| 0.50 | All measured | 0.02 | -0.074 | att_SLRE | 0.05 | |
| re_att_SLRE | 0.05 | |||||
| 0.27 | -0.065 | att_SLRE | 0.04 | |||
| att_SLFE | 0.04 | |||||
| One unmeasured cluster-level covariate | 0.02 | -0.074 | att_SLRE | 0.05 | ||
| re_att_SLRE | 0.05 | |||||
| 0.27 | -0.065 | att_SLRE | 0.04 | |||
| att_SLFE | 0.04 | |||||
| 0.00 | All measured | 0.03 | -0.075 | att_SLRE | 0.04 | |
| att_SLFE | 0.04 | |||||
| 0.28 | -0.065 | att_SLFE | 0.04 | |||
| att_SLRE | 0.04 | |||||
| One unmeasured cluster-level covariate | 0.03 | -0.075 | att_SLFE | 0.04 | ||
| att_SLRE | 0.04 | |||||
| 0.28 | -0.065 | att_SLFE | 0.04 | |||
| att_SLRE | 0.04 | |||||
| 100 | 0.96 | All measured | 0.03 | -0.075 | att_SLRE | 0.02 |
| re_att_SLRE | 0.02 | |||||
| 0.27 | -0.066 | att_SLRE | 0.02 | |||
| att_SLFE | 0.02 | |||||
| One unmeasured cluster-level covariate | 0.03 | -0.075 | att_SLRE | 0.02 | ||
| re_att_SLRE | 0.02 | |||||
| 0.27 | -0.066 | att_SLRE | 0.02 | |||
| re_att_SLRE | 0.02 | |||||
| 0.49 | All measured | 0.04 | -0.073 | att_SLRE | 0.02 | |
| att_SLFE | 0.02 | |||||
| 0.28 | -0.064 | att_SLRE | 0.02 | |||
| att_SLFE | 0.02 | |||||
| One unmeasured cluster-level covariate | 0.04 | -0.073 | att_SLFE | 0.02 | ||
| att_SLRE | 0.02 | |||||
| 0.28 | -0.064 | att_SLRE | 0.02 | |||
| att_SLFE | 0.02 | |||||
| 0.00 | All measured | 0.04 | -0.075 | att_SLRE | 0.02 | |
| re_att_SLRE | 0.02 | |||||
| 0.28 | -0.065 | att_SLFE | 0.02 | |||
| att_SLRE | 0.02 | |||||
| One unmeasured cluster-level covariate | 0.04 | -0.075 | att_SLRE | 0.02 | ||
| re_att_SLRE | 0.02 | |||||
| 0.28 | -0.065 | att_SLFE | 0.02 | |||
| att_SLRE | 0.02 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).