Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

On the Discretization of Continuous Probability Distributions for Flexible Count Regression

Version 1 : Received: 15 January 2021 / Approved: 18 January 2021 / Online: 18 January 2021 (12:23:53 CET)

How to cite: Tovissodé, C.F.; Glèlè Kakaï, R.; Honfo, S.H.; Doumatè, J.T. On the Discretization of Continuous Probability Distributions for Flexible Count Regression. Preprints 2021, 2021010332 (doi: 10.20944/preprints202101.0332.v1). Tovissodé, C.F.; Glèlè Kakaï, R.; Honfo, S.H.; Doumatè, J.T. On the Discretization of Continuous Probability Distributions for Flexible Count Regression. Preprints 2021, 2021010332 (doi: 10.20944/preprints202101.0332.v1).

Abstract

Most existing flexible count regression models allow only approximate inference. This work proposes a new framework to provide an exact and flexible alternative for modeling and simulating count data with various types of dispersion (equi-, under- and overdispersion). The new method, referred as “balanced discretization”, consists in discretizing continuous probability distributions while preserving expectations. It is easy to generate pseudo random variates from the resulting balanced discrete distribution since it has a simple stochastic representation in terms of the continuous distribution. For illustrative purposes, we have developed the family of balanced discrete gamma distributions which can model equi-, under- and overdispersed count data. This family of count distributions is appropriate for building flexible count regressionmodels because the expectation of the distribution has a simple expression in terms of the parameters of the distribution. Using the Jensen–Shannon divergence measure, we have shown that under equidispersion restriction, the family of balanced discrete gamma distributions is similar to the Poisson distribution. Based on this, we conjecture that while covering all types of dispersion, a count regression model based on the balanced discrete gamma distribution will allow recovering a near Poisson distribution model fit when the data is Poisson distributed.

Subject Areas

flexible count models; balanced gamma distribution; Jensen–Shannon divergence; latent equidispersion

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.