Submitted:
02 June 2026
Posted:
03 June 2026
You are already at the latest version
Abstract
Keywords:
MSC: 62F03; 62E20; 62P99
1. Introduction
2. Notation and Preliminaries
3. The Poisson–Three-Parameter Lindley Distribution
Probability generating function.
4. Parameter Estimation for the PTPL Distribution
4.1. Method of Moments
4.2. Maximum Likelihood Estimation
- (i)
- The PTPL model is correctly specified, i.e., the true distribution of X belongs to .
- (ii)
- Identifiability: implies on .
- (iii)
- The parameter space Θ is compact.
- (iv)
- The log-likelihood is upper semi-continuous in ϑ for each .
- (v)
- .
- (vi)
- The true parameter lies in the interior of Θ.
- (vii)
- The log-likelihood is twice continuously differentiable in ϑ on a neighbourhood of , for all . (This holds for the PTPL PMF (1) by inspection of its explicit form.)
- (viii)
- The Fisher information matrix is finite and non-singular.
4.3. Implications for the Goodness-of-Fit Test
5. The Proposed Goodness-of-Fit Test
6. Asymptotic Properties
- (i)
- for all .
- (ii)
- The parameter space Θ is compact and is continuous for each .
7. Bootstrap Implementation
7.1. Maximum Likelihood Estimation for the PTPL Model
8. Monte Carlo Study
8.1. Empirical Size
8.2. Empirical Power
- D1 — Poisson(), a strictly equidispersed model with dispersion index for all , representing the sharpest possible structural contrast with the PTPL family, which satisfies throughout its parameter space.
- D2 — Negative Binomial(), an overdispersed model with . We fix and vary , yielding .
- D3 — COM-Poisson(), a highly flexible count model that reduces to the Poisson when and is strongly overdispersed when [20]. We fix and vary . At , the resulting means range from approximately to with , far beyond the PTPL null range of .
- D4 — Zero-Inflated Poisson ZIP(), in which . Six configurations are evaluated, covering and zero-inflation proportions .
D1 — Poisson alternatives.
D2 — Negative Binomial alternatives.
D3 — COM-Poisson alternatives.
D4 — ZIP alternatives.
Overall assessment.
9. Real Data Application
- D1 — Industrial accidents [35]. Number of accidents suffered by female munitions workers during a twelve-week period, originally analyzed by Greenwood and Yule [35] and subsequently used as a standard benchmark for overdispersed count models. The sample mean is , the sample variance is , and the empirical dispersion index is , confirming mild overdispersion relative to the Poisson model.
- D4 — Deaths by horse kicks, Prussian army [38]. The classic dataset due to von Bortkiewicz [38], recording the number of soldiers killed by horse kicks per cavalry corps-year ( corps-year units, , , ). Although this dataset is only marginally overdispersed, it is included as a near-equidispersed benchmark to examine the calibration of the test under near-Poisson conditions.
- D5 — Products purchased per customer, Black Friday [39]. A contemporary retail dataset freely available on Kaggle [39], comprising transactions recorded at an ABC Pvt Ltd retail store during a Black Friday sales event. The count variable of interest is the number of distinct products purchased per unique customer, derived by grouping the raw transaction file by User_ID and counting the number of distinct Product_ID values per user. This yields customer-level observations with , , and , confirming moderate overdispersion. To the best of our knowledge, this is the first application of a PTPL-based model to a Black Friday e-commerce count variable.
10. Discussion
- 1.
- Local power analysis. Deriving the asymptotic power of against Pitman sequences would characterise the rate at which the test detects local alternatives and enable comparison with minimax-optimal procedures.
- 2.
- Multivariate and zero-inflated extensions. The PGF approach extends naturally to multivariate PTPL models and to zero-inflated variants, following the framework of Novoa-Muñoz and Jiménez-Gamero [9].
- 3.
- Model selection. Combining the proposed test with information criteria (AIC, BIC) and minimum-distance PGF estimators [41] may yield more robust model selection procedures for overdispersed count data.
- 4.
- Truncated and censored data. Extensions to zero-truncated or right-censored count data, common in reliability and actuarial applications, constitute a natural avenue for future work.
11. Conclusions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Algorithmic Descriptions of the Bootstrap Procedure


References
- Fisher, R.A. On the Interpretation of χ2 from Contingency Tables, and the Calculation of P. Journal of the Royal Statistical Society 1922, 85, 87–94.
- Cochran, W.G. The χ2 test of goodness of fit. Annals of Mathematical Statistics 1952, 23(3), 315–345.
- Moore, D.S. Chi-square tests with random probabilities. Journal of the American Statistical Association 1986, 81, 372–377.
- Read, T.R.C.; Cressie, N.A.C. Goodness-of-Fit Statistics for Discrete Multivariate Data. Springer-Verlag: New York, NY, USA, 1988.
- Cressie, N. Statistics for Spatial Data. Wiley, 1992.
- Rueda, R., O’Reilly, F., Pérez- Abreu, V. (1991). Goodness of fit for the poisson distribution based on the probability generating function. Communications in Statistics - Theory and Methods, 20(10), 3093-3110. [CrossRef]
- Henze, N. Empirical-distribution-function goodness-of-fit tests for discrete models. Journal of Multivariate Analysis 1996, 58, 145–168.
- Henze, N. Goodness-of-fit tests for discrete distributions based on generating functions. Statistics & Probability Letters 2001, 54, 41–50.
- Novoa-Muñoz, F.; Jiménez-Gamero, M.D. A goodness-of-fit test for the multivariate Poisson distribution. SORT 2016, 40, 113-138.
- Novoa-Muñoz, F.; Jiménez-Gamero, M.D. Testing for the bivariate Poisson distribution. Metrika 2014, 77, 771-793.
- Novoa-Muñoz, F. Goodness-of-fit tests for the bivariate Poisson distribution. Commun. Stat. Simul. Comput. 2019, 50, 1998-2014. [CrossRef]
- Nikitin, Ya. Yu. Asymptotic Efficiency of Nonparametric Tests; Cambridge University Press: Cambridge, UK, 1995.
- Baringhaus, L.; Henze, N. A consistent test for multivariate normality based on the empirical characteristic function. Journal of Multivariate Analysis 2004, 88, 152–170.
- Novoa-Muñoz, F. Tests de Bondad de Ajuste para la Distribución Poisson Bivariante. Editorial Académica Española, 2018.
- Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions, 3rd ed.; Wiley: Hoboken, NJ, USA, 2005.
- Sankaran, M. The discrete Poisson–Lindley distribution. Biometrics 1970, 26(1), 145–149.
- Ghitany, M.E.; Al-Mutairi, D.K.; Nadarajah, S. Zero-truncated Poisson-Lindley distribution and its application. Mathematics and Computers in Simulation 2008, 79, 279–287.
- Mahmoudi, E.; Zakerzadeh, H. Generalized Poisson-Lindley distribution. Communications in Statistics - Theory and Methods 2010, 39(10), 1785–1798. [CrossRef]
- Das, K.; Ahmed, I.; Bhattacharjee, S. A new three-parameter Poisson–Lindley distribution for modelling over-dispersed count data. International Journal of Applied Engineering Research 2018, 13, 16468–16477.
- Shmueli, G.; Minka, T.P.; Kadane, J.B.; Borle, S.; Boatwright, P. A useful distribution for fitting discrete data: revival of the COM-Poisson distribution. J. R. Stat. Soc. Ser. C. 2005, 54(1), 127-142. [CrossRef]
- van der Vaart, A.W. Asymptotic Statistics; Cambridge University Press: Cambridge, UK, 1998.
- Shanker, R.; Shukla, K.; Shanker, R.; Tekie, A.L. A three-parameter Lindley distribution. American Journal of Mathematics and Statistics 2017, 7, 15–26.
- Billingsley, P. Probability and Measure, 3rd ed.; Wiley: New York, NY, USA, 1995.
- Casella, G.; Berger, R.L. Statistical Inference, 2nd ed.; Duxbury Press: Pacific Grove, CA, USA, 2002.
- Teicher, H. Identifiability of mixtures. Annals of Mathematical Statistics 1961, 32(1), 244–248.
- Wald, A. Note on the consistency of the maximum likelihood estimate. Annals of Mathematical Statistics 1949, 20, 595–601.
- Lehmann, E.L.; Casella, G. Theory of Point Estimation, 2nd ed.; Springer: New York, NY, USA, 1998.
- Chow, Y.S.; Teicher, H. Probability Theory: Independence, Interchangeability, Martingales; Springer: New York, NY, USA, 1997.
- van der Vaart, A.W.; Wellner, J.A. Weak Convergence and Empirical Processes; Springer: New York, NY, USA, 1996.
- Royden, H.L.; Fitzpatrick, P.M. Real Analysis, 4th ed.; Pearson: Boston, MA, USA, 2010.
- Baringhaus, L.; Henze, N. Limit distributions for test statistics based on the empirical moment generating function. Statistics & Probability Letters 2004, 69, 79–88.
- Bickel, P.J.; Freedman, D.A. Some asymptotic theory for the bootstrap. Annals of Statistics 1981, 9, 1196–1217.
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2024. Available online: https://www.R-project.org/ (accessed on 28 January 2026).
- Novoa-Muñoz, F. Parallelization strategies for bootstrap-intensive goodness-of-fit simulations in R. [Complete with journal/proceedings details as appropriate], 2024.
- Greenwood, M.; Yule, G.U. An inquiry into the nature of frequency distributions representative of multiple happenings with particular reference to the occurrence of multiple attacks of disease or of repeated accidents. J. R. Stat. Soc. 1920, 83(2), 255-279. [CrossRef]
- McGuire, J.U; Brindley, T.A.; Bancroft, T.A. The distribution of European corn borer larvae Pyrausta nubilalis (Hüb.) in field corn. Biometrics. 1957, 13(1), 65-78. [CrossRef]
- Garman, P. Mite species on apple trees in Connecticut; Bulletin 535. New Haven, CT, USA, 1951.
- von Bortkiewicz, L. Das Gesetz der Kleinen Zahlen; Teubner, Leipzig, Germany, 1898.
- ABC Pvt Ltd and Analytics Vidhya. Black Friday Sales Dataset. Kaggle, 2019. Available online: https://www.kaggle.com/datasets/mehdidag/black-friday (accessed on 10 May 2026).
- Gelman, A.; Carlin, J.B.; Stern, H.S.; Dunson, D.B.; Vehtari, A.; Rubin, D.B. Bayesian Data Analysis, 3rd ed.; CRC Press: Boca Raton, FL, USA, 2013.
- Jiménez-Gamero, M.D.; Batsidis, A. Minimum distance estimators for count data based on the probability generating function with applications. Metrika 2017, 80(5), 503-545.

| Triple | ||||||
|---|---|---|---|---|---|---|
| 3.0 | 0.8 | 0.5 | 0.3908 | 0.5369 | 1.3739 | |
| 2.0 | 0.3 | 0.8 | 0.7857 | 1.2398 | 1.5779 | |
| 1.5 | 1.5 | 1.5 | 0.9333 | 1.6622 | 1.7810 | |
| 1.0 | 0.5 | 1.0 | 1.6667 | 3.5556 | 2.1333 | |
| 0.8 | 0.2 | 3.0 | 2.4367 | 5.5577 | 2.2808 | |
| 0.5 | 1.0 | 2.0 | 3.6000 | 11.4400 | 3.1778 |
![]() |
![]() |
| Dataset | Model | AIC | df | |||
|---|---|---|---|---|---|---|
|
D1: Industrial accidents Greenwood & Yule [35] , , |
Poisson | 2 | ||||
| Neg. Binomial | 2 | |||||
| COM-Poisson | 2 | |||||
| ZIP | 2 | |||||
| PTPL | 1 | |||||
| GOF-PGF:, [not rejected at 5%] | ||||||
|
D2:Pyrausta nubilalis McGuire et al. [36] , , |
Poisson | 4 | ||||
| Neg. Binomial | 5 | |||||
| COM-Poisson | 5 | |||||
| ZIP | 5 | |||||
| PTPL | 4 | |||||
| GOF-PGF:, [not rejected at 5%] | ||||||
|
D3: European red mites Garman [37] , , |
Poisson | 3 | ||||
| Neg. Binomial | 4 | |||||
| COM-Poisson | 4 | |||||
| ZIP | 4 | |||||
| PTPL | 3 | |||||
| GOF-PGF:, [not rejected at 5%] | ||||||
|
D4: Deaths by horse kicks Bortkiewicz [38] , , |
Poisson | 2 | ||||
| Neg. Binomial | 1 | |||||
| COM-Poisson | 1 | |||||
| ZIP | 1 | |||||
| PTPL | 1 | |||||
| GOF-PGF:, [not rejected at 5%] | ||||||
|
D5: Products per customer Black Friday [39] , , |
Poisson | 13 | ||||
| Neg. Binomial | 12 | |||||
| COM-Poisson | 12 | |||||
| ZIP | 13 | |||||
| PTPL | 11 | |||||
| GOF-PGF:, [rejected at 5%; see Note] | ||||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

