Preprint
Article

This version is not peer-reviewed.

The New Generalized Odd Median Based Unit Rayleigh with a New Shape Oscillating Hazard Rate Function

Submitted:

24 February 2025

Posted:

25 February 2025

You are already at the latest version

Abstract
In this paper, the author presents the generalized form of the Median-Based Unit Rayleigh (MBUR) distribution, a novel statistical distribution that is specifically defined within the interval (0, 1) expressing oscillating hazard rate function. This generalization adds a new parameter to the MBUR distribution that significantly addresses the unique characteristics of data represented as ratios and proportions, which are commonly encountered in various fields of research. The establishment of this generalization aims to deepen our understanding of these phenomena by providing a robust framework for analysis. The paper offers a thorough and meticulous derivation of the probability density function (PDF) for the MBUR distribution, illuminating each phase of the process with clarity and precision. It delves deep into the intricacies of the MBUR distribution's properties, presenting a rigorous examination of the accompanying functions that are vital for robust statistical evaluation. These functions—comprising the cumulative distribution function (CDF), survival function, hazard rate and reversed hazard rate function. The paper discusses real data analysis and how the generalization improves such analysis.
Keywords: 
;  ;  ;  

Introduction

Fitting data to statistical distributions is crucial for understanding the underlying processes that generate the data. Researchers have successfully developed a variety of distributions to describe complex real-world phenomena effectively. Before 1980, the primary techniques for generating distributions included solving systems of differential equations, using transformations, and applying quantile function strategies. Since 1980, the focus has shifted to adding new parameters to existing distributions or combining known distributions. These methods have resulted in a wide range of adaptable and flexible distributions that can accommodate various types of asymmetrical data and outliers in data sets. Fitting distributions to data improves modeling in analyses related to regression, survival analysis, reliability analysis, and time series analysis.
A multitude of real-world phenomena can be elegantly captured as proportions, ratios, or fractions nestled within the bounded interval (0,1). These captivating representations are not merely abstract concepts; they reflect the intricate relationships found in various fields such as biology, where the delicate balance of ecosystems is analyzed; finance, where the ebb and flow of market ratios unfold; and mortality rates, which provide profound insights into human health and longevity. Additionally, recovery rates in medical science showcase the resilience of life, while economics delves into the nuanced distributions of wealth and resources. Engineering and hydrology further enrich this tapestry, modeling everything from structural integrity to the flow of water in our environments. The measurement sciences, too, rely on these continuous distributions, breathing life into data that inform our understanding of the world.
Some of these distributions include: the Johnson SB distribution (1), Beta distribution (2), Unit Johnson distribution (3), Topp-Leone distribution (4), Unit Gamma distribution (5),(6),(7),(8), Unit Logistic distribution (9), Kumaraswamy distribution (10), Unit Burr-III distribution (11), Unit Modified Burr-III distribution (12), Unit Burr-XII distribution (13), Unit-Gompertz distribution (14), Unit-Lindley distribution (15), Unit-Weibull distribution (16), Unit-Birnbaum-Saunders (17) and Unit Muth distribution (18).
This paper is structured into several sections for clarity and coherence. Section 1 provides a comprehensive discussion of the methodology employed to derive the new distribution. Section 2 delves into its fundamental characteristics, including the probability density function (PDF), cumulative distribution function (CDF), survival function (S), hazard function (HF), reversed hazard function (RHF), and quantile function. Section 3 offers an in-depth discussion that encompasses an analysis of real data as well as a detailed examination of the findings. In conclusion, Section 4 provides a comprehensive overview of our findings and offers valuable recommendations for future research, inviting further exploration and innovation in the field.

Section 1: Methodology:

Derivation of the PDF

The general formula of the median order statistics for an odd sample size can be written as in equation (1)
f i : m = m ! m 1 2 ! m 1 2 ! F x x m 1 2 1 F x x m 1 2 f x x ( 1 )
Where (i) is the ith odd order statistics and m is the sample size. Replacing (m) sample size (which is an odd number) with m=2n+1 as shown in equation (2.A) & (2.B)
f i : m = 2 n + 1 ! 2 n + 1 1 2 ! 2 n + 1 1 2 ! F x x 2 n + 1 1 2 1 F x x 2 n + 1 1 2 f x x . ( 2 . A )
f i : m = 2 n + 1 ! n ! n ! F x x n 1 F x x n f x x ( 2 . B )
Substituting equation 3 which is the CDF and the PDF of Rayleigh distribution in equation 2.B yields equation 4.
F x x = 1 e x p x 2 α 2   &   f x x = 2 x α 2   e x p x 2 α 2 . . ( 3 )
f i : m = 2 n + 1 ! n ! n ! 1 e x p x 2 α 2 n e x p x 2 α 2 n 2 x α 2   e x p x 2 α 2 . ( 4 )
Using the transformation in equation 5 and the Jacobian in equation 6 and then substituting both in equation 4 yields the new PDF in equation 7. This is the first version of generalization of the MBUR.
l e t   y = e x 2   . . 5 . A
ln y = x 2   ,   ln y 0.5 = x ( 5 . B )  
d x d y = 1 2 ln y 0.5 1 y . . ( 6 )
f y y = Γ 2 n + 2 Γ n + 1 Γ n + 1 1 α 2 1 y α 2 n y n + 1 α 2 1 , n 0   ,   α > 0 ,   0 < y < 1 ( 7 )
The second version of generalization is obtained by substituting the CDF and the PDF of Rayleigh in equation 8 which yields equation 9
f i : n = n ! n 1 2 ! n 1 2 ! F x x n 1 2 1 F x x n 1 2 f x x . . ( 8 )
f i : n = n ! n 1 2 ! n 1 2 ! 1 e x p x 2 α 2 n 1 2 e x p x 2 α 2 n 1 2 2 x α 2   e x p x 2 α 2 . . ( 9 )
Substituting the same transformation of equation 5 and the same Jacobian of equation 6 in equation 9 yields the new PDF in equation 10. This is the second version of generalization of the MBUR distribution.
f y y = Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2 1 α 2 1 y α 2 n 1 2 y n + 1 2 α 2 1   , n 1   ,   α > 0 ,   0 < y < 1 . . ( 10 )
Theorem 1: Both versions in equation 7 and 10 are valid PDF.
Proof: PDF version in equation 7:
To prove that the PDF in equation 7 is a valid PDF, the integral in equation 11 should equal 1. Applying the transformation in equation 12 and substitute in equation 11:
Γ 2 n + 2 Γ n + 1 Γ n + 1 0 1 1 α 2 1 y α 2 n y n + 1 α 2 1 d y = 1 . ( 11 )
l e t :     y 1 α 2 = w   ,   s o   y = w α 2   t h i s   g i v e s   d y = α 2 w α 2 1 d w . ( 12 )  
Γ 2 n + 2 Γ n + 1 Γ n + 1 0 1 1 α 2 1 w n w α 2 n + 1 α 2 1 α 2 w α 2 1 d w = 1
For the PDF version in equation 10, applying the same transformation will integrate the PDF in equation 10 to 1.

Section 2: Some Properties of the Generalized Odd MBUR Distribution

Theorem 2: the cumulative distribution function (CDF) of the generalized odd MBUR is:
P Y < y = I w n + 1 , n + 1 for version 1 & P Y < y = I w n + 1 2 , n + 1 2 for version 2.
Proof: for version 1:
P Y < y = Γ 2 n + 2 Γ n + 1 Γ n + 1 0 y 1 α 2 1 y α 2 n y n + 1 α 2 1 d y ( 13 )  
Apply the transformation of equation 12 and substitute in equation 13 yields equation 14
P Y < y = Γ 2 n + 2 Γ n + 1 Γ n + 1 0 w 1 w n w n d w = B w n + 1 , n + 1 B n + 1 , n + 1 ( 14 )
For version 2:
P Y < y = Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2   0 y 1 α 2 1 y α 2 n 1 2 y n + 1 2 α 2 1   d y . ( 15 )      
Apply the transformation of equation 12 and substitute in equation 15 yields equation 16
P Y < y = Γ n + 1 Γ n + 1 2 Γ n + 1 2 0 w 1 w n 1 2 w n 1 2 d w = B w n + 1 2 , n + 1 2 B n + 1 2 , n + 1 2 . . ( 16 )
Lemma 1: the survival function (S) for version 1 is shown in equation 17
S y = 1 P Y < y = 1 I w n + 1 , n + 1 . . ( 17 )
Lemma 2: the survival function (S) for version 2 is shown in equation 18
S y = 1 P Y < y = 1 I w n + 1 2 , n + 1 2   ( 18 )
Lemma 3: the Hazard function or rate (HF or hr) for version 1 is shown in equation 19
h r y = f Y y 1 P Y < y = Γ 2 n + 2 Γ n + 1 Γ n + 1 1 α 2 1 y 1 α 2 n y n + 1 α 2 1 1 I w n + 1 , n + 1 . ( 19 )
Lemma 4: the Hazard function or rate (HF or hr) for version 2 is shown in equation 20
h r y = f Y y 1 P Y < y = Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2 1 α 2 1 y α 2 n 1 2 y n + 1 2 α 2 1 1 I w n + 1 , n + 1 ( 20 )
Lemma 5: the reversed hazard function (RHF) or reversed hazard rate (rhr) for version 1 is shown in equation 21
r h r y = f Y y P Y < y = Γ 2 n + 2 Γ n + 1 Γ n + 1 1 α 2 1 y α 2 n y n + 1 α 2 1 I w n + 1 , n + 1 ( 21 )
Lemma 6: the reversed hazard function (RHF) or reversed hazard rate (rhr) for version 2 is shown in equation 22
r h r y = f Y y P Y < y = Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2 1 α 2 1 y α 2 n 1 2 y n + 1 2 α 2 1 I w n + 1 , n + 1 ( 22 )
The quantile function of the distribution has no closed explicit form.
Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9 illustrate the PDF of the first version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 84} and different values of alpha {0.452, 1, 1.5, 1.8, 2.2, 4.5, 10.5}
Figure 1. shows PDF of the first version for different alpha and n=2. For alpha less than 1, the PDF is left skewed. If alpha is one the PDF is symmetric around y=0.5. For alpha values larger than 1 the PDF exhibits right skewness or decreasing .
Figure 1. shows PDF of the first version for different alpha and n=2. For alpha less than 1, the PDF is left skewed. If alpha is one the PDF is symmetric around y=0.5. For alpha values larger than 1 the PDF exhibits right skewness or decreasing .
Preprints 150412 g001
Figure 2. shows PDF of the first version for different levels of alpha and n=3.
Figure 2. shows PDF of the first version for different levels of alpha and n=3.
Preprints 150412 g002
Figure 3. shows PDF of the first version for different levels of alpha and n=5.
Figure 3. shows PDF of the first version for different levels of alpha and n=5.
Preprints 150412 g003
Figure 4. shows PDF of the first version for different levels of alpha and n=10.
Figure 4. shows PDF of the first version for different levels of alpha and n=10.
Preprints 150412 g004
Figure 5. shows PDF of the first version for different levels of alpha and n=20.
Figure 5. shows PDF of the first version for different levels of alpha and n=20.
Preprints 150412 g005
Figure 6. shows PDF of the first version for different levels of alpha and n=30.
Figure 6. shows PDF of the first version for different levels of alpha and n=30.
Preprints 150412 g006
Figure 7. shows PDF of the first version for different levels of alpha and n=40.
Figure 7. shows PDF of the first version for different levels of alpha and n=40.
Preprints 150412 g007
Figure 8. shows PDF of the first version for different levels of alpha and n=50.
Figure 8. shows PDF of the first version for different levels of alpha and n=50.
Preprints 150412 g008
Figure 9. shows PDF of the first version for different levels of alpha and n=84.
Figure 9. shows PDF of the first version for different levels of alpha and n=84.
Preprints 150412 g009
The figures of the PDF of the first version illustrates that for values of n larger than 30 but not exceeding 84 and associated with different values of alpha not exceeding 4, these PDFs are more or less symmetric around different values of the variable occupying the whole unit range. This is an advantage of the new added parameter that generalizes the distribution.
Figure 10, Figure 11, Figure 12, Figure 13, Figure 14, Figure 15, Figure 16, Figure 17, Figure 18, Figure 19 and Figure 20 illustrate the PDF of the second version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 100, 150, 170} and different values of alpha {0.452, 1, 1.5, 1.8, 2.2, 4.5, 10.5}
Figure 10. shows PDF of the second version for different levels of alpha and n=2.
Figure 10. shows PDF of the second version for different levels of alpha and n=2.
Preprints 150412 g010
Figure 11. shows PDF of the second version for different levels of alpha and n=3.
Figure 11. shows PDF of the second version for different levels of alpha and n=3.
Preprints 150412 g011
Figure 12. shows PDF of the second version for different levels of alpha and n=5.
Figure 12. shows PDF of the second version for different levels of alpha and n=5.
Preprints 150412 g012
Figure 13. shows PDF of the second version for different levels of alpha and n=10.
Figure 13. shows PDF of the second version for different levels of alpha and n=10.
Preprints 150412 g013
Figure 14. shows PDF of the second version for different levels of alpha and n=20.
Figure 14. shows PDF of the second version for different levels of alpha and n=20.
Preprints 150412 g014
Figure 15. shows PDF of the second version for different levels of alpha and n=30.
Figure 15. shows PDF of the second version for different levels of alpha and n=30.
Preprints 150412 g015
Figure 15. shows PDF of the second version for different levels of alpha and n=40.
Figure 15. shows PDF of the second version for different levels of alpha and n=40.
Preprints 150412 g016
Figure 16. shows PDF of the second version for different levels of alpha and n=50.
Figure 16. shows PDF of the second version for different levels of alpha and n=50.
Preprints 150412 g017
Figure 17. shows PDF of the second version for different levels of alpha and n=100.
Figure 17. shows PDF of the second version for different levels of alpha and n=100.
Preprints 150412 g018
Figure 18. shows PDF of the second version for different levels of alpha and n=150.
Figure 18. shows PDF of the second version for different levels of alpha and n=150.
Preprints 150412 g019
Figure 19. shows PDF of the second version for different levels of alpha and n=170.
Figure 19. shows PDF of the second version for different levels of alpha and n=170.
Preprints 150412 g020
The figures of the PDF of the second version illustrates that for values of n larger than 50 but not exceeding 170 and associated with different values of alpha not exceeding 4, these PDFs are more or less symmetric around different values of the variable occupying the whole unit range. This is an advantage of the new added parameter that generalizes the distribution
Figure 20 and Figure 21 illustrate the CDF of the first version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 100, 150, 170} and different values of alpha {0.272, 0.614, 1, 1.3, 1.8, 2.2, 3.5, 10.5}
Figure 20. shows CDF of the first version for different levels of alpha and n.
Figure 20. shows CDF of the first version for different levels of alpha and n.
Preprints 150412 g021
Figure 21. shows CDF of the first version for different levels of alpha and n.
Figure 21. shows CDF of the first version for different levels of alpha and n.
Preprints 150412 g022
Figure 22 and Figure 23 illustrate the CDF of the second version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 100, 150, 170} and different values of alpha {0.272, 0.614, 1, 1.3, 1.8, 2.2, 3.5, 10.5}
Figure 22. shows CDF of the second version for different levels of alpha and n.
Figure 22. shows CDF of the second version for different levels of alpha and n.
Preprints 150412 g023
Figure 23. shows CDF of the second version for different levels of alpha and n.
Figure 23. shows CDF of the second version for different levels of alpha and n.
Preprints 150412 g024
Figure 24 and Figure 25 illustrate the survival function of the first version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 100, 150, 170} and different values of alpha {0.272, 0.614, 1, 1.3, 1.8, 2.2, 3.5, 10.5}
Figure 24. shows survival function of the first version for different levels of alpha and n.
Figure 24. shows survival function of the first version for different levels of alpha and n.
Preprints 150412 g025
Figure 25. shows survival function of the first version for different levels of alpha and n.
Figure 25. shows survival function of the first version for different levels of alpha and n.
Preprints 150412 g026
Figure 26 and Figure 27 illustrate the survival function of the second version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 100, 150, 170} and different values of alpha {0.272, 0.614, 1, 1.3, 1.8, 2.2, 3.5, 10.5}
Figure 26. shows survival function of the second version for different levels of alpha and n.
Figure 26. shows survival function of the second version for different levels of alpha and n.
Preprints 150412 g027
Figure 27. shows survival function of the second version for different levels of alpha and n.
Figure 27. shows survival function of the second version for different levels of alpha and n.
Preprints 150412 g028
Figure 28, Figure 29, Figure 30, Figure 31, Figure 32, Figure 33, Figure 34, Figure 35 and Figure 36 illustrate the hazard rate function (hr) of the first version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 84} and different values of alpha {0.272, 0.614, 1, 1.3, 1.8, 2.2, 3.5, 10.5}
Figure 28. shows hazard rate function of the first version for different levels of alpha and n=2.
Figure 28. shows hazard rate function of the first version for different levels of alpha and n=2.
Preprints 150412 g029
Figure 29. shows hazard rate function of the first version for different levels of alpha and n=3.
Figure 29. shows hazard rate function of the first version for different levels of alpha and n=3.
Preprints 150412 g030
Figure 30. shows hazard rate function of the first version for different levels of alpha and n=5.
Figure 30. shows hazard rate function of the first version for different levels of alpha and n=5.
Preprints 150412 g031
Figure 31. shows hazard rate function of the first version for different levels of alpha and n=10.
Figure 31. shows hazard rate function of the first version for different levels of alpha and n=10.
Preprints 150412 g032
Figure 32. shows hazard rate function of the first version for different levels of alpha and n=20.
Figure 32. shows hazard rate function of the first version for different levels of alpha and n=20.
Preprints 150412 g033
Figure 33. shows hazard rate function of the first version for different levels of alpha and n=30.
Figure 33. shows hazard rate function of the first version for different levels of alpha and n=30.
Preprints 150412 g034
Figure 34. shows hazard rate function of the first version for different levels of alpha and n=40.
Figure 34. shows hazard rate function of the first version for different levels of alpha and n=40.
Preprints 150412 g035
Figure 35. shows hazard rate function of the first version for different levels of alpha and n=50.
Figure 35. shows hazard rate function of the first version for different levels of alpha and n=50.
Preprints 150412 g036
Figure 36. shows hazard rate function of the first version for different levels of alpha and n=84.
Figure 36. shows hazard rate function of the first version for different levels of alpha and n=84.
Preprints 150412 g037
Figure 37, Figure 38, Figure 39, Figure 40, Figure 41, Figure 42, Figure 43, Figure 44, Figure 45, Figure 46 and Figure 47 illustrate the hazard rate function (hr) of the second version for different values of n {2, 3, 5, 10, 20, 30, 40, 50, 100,150,170} and different values of alpha {0.272, 0.614, 1, 1.3, 1.8, 2.2, 3.5, 10.5}
Figure 37. shows hazard rate function of the second version for different levels of alpha and n=2.
Figure 37. shows hazard rate function of the second version for different levels of alpha and n=2.
Preprints 150412 g038
Figure 38. shows hazard rate function of the second version for different levels of alpha and n=3.
Figure 38. shows hazard rate function of the second version for different levels of alpha and n=3.
Preprints 150412 g039
Figure 39. shows hazard rate function of the second version for different levels of alpha and n=5.
Figure 39. shows hazard rate function of the second version for different levels of alpha and n=5.
Preprints 150412 g040
Figure 40. shows hazard rate function of the second version for different levels of alpha and n=10.
Figure 40. shows hazard rate function of the second version for different levels of alpha and n=10.
Preprints 150412 g041
Figure 41. shows hazard rate function of the second version for different levels of alpha and n=20.
Figure 41. shows hazard rate function of the second version for different levels of alpha and n=20.
Preprints 150412 g042
Figure 42. shows hazard rate function of the second version for different levels of alpha and n=30.
Figure 42. shows hazard rate function of the second version for different levels of alpha and n=30.
Preprints 150412 g043
Figure 43. shows hazard rate function of the second version for different levels of alpha and n=40.
Figure 43. shows hazard rate function of the second version for different levels of alpha and n=40.
Preprints 150412 g044
Figure 44. shows hazard rate function of the second version for different levels of alpha and n=50.
Figure 44. shows hazard rate function of the second version for different levels of alpha and n=50.
Preprints 150412 g045
Figure 45. shows hazard rate function of the second version for different levels of alpha and n=100.
Figure 45. shows hazard rate function of the second version for different levels of alpha and n=100.
Preprints 150412 g046
Figure 46. shows hazard rate function of the second version for different levels of alpha and n=150.
Figure 46. shows hazard rate function of the second version for different levels of alpha and n=150.
Preprints 150412 g047
Figure 47. shows hazard rate function of the second version for different levels of alpha and n=170.
Figure 47. shows hazard rate function of the second version for different levels of alpha and n=170.
Preprints 150412 g048
The figures of the hazard functions depict the different shapes the hazard function can attain. These shapes ranges from increasing, bath tub and J shaped appearance. The new finding is that when alpha level is large than or equal to one and with increasing n values the hazard rates exhibit oscillating pattern before it starts to approach infinity at the upper end of the unit interval. For example, at the same alpha level 1.8 and with increased values of n, this oscillating pattern is attained at lower values of the random variable . In other words, when n=50, the oscillating pattern is attained at y=0.8, when n=100, this oscillating patter is achieved at y= 0.6, when n=150, the oscillating pattern is captured at y=0.5, when n=170, the oscillating pattern is realized at y values between 0.4 and 0.5. This pattern is almost present at different values of n and alpha. The oscillating pattern in the different graphs abruptly end because the hazard rate values are infinite at these interrupts.
Another new finding is that at higher values of alpha more than two accompanied with higher n values between 20 and 30, the hazard rate function is concave; it starts at zero and then increasing until a specific peak after which it starts to fall down till reaching an oscillating pattern then it starts to approach infinity. This is obvious at alpha=2.2 and when n value is between 20 and 30 and thereafter up to n=150
Theorem 3: the rth raw moment of the first version of the distribution is given by
E y r = Γ 2 n + 2 Γ n + 1 Γ n + 1 + r α 2 Γ 2 n + 2 + r α 2
Proof: the expectation of the rth moment in equation (23) is obtained with the help of the transformation mentioned in equation (12)
E y r = 0 1   y r f y y d y = Γ 2 n + 2 Γ n + 1 Γ n + 1   1 α 2   0 1 y r   1 y α 2 n y n + 1 α 2 1   d y   . ( 23 )
= Γ 2 n + 2 Γ n + 1 Γ n + 1   1 α 2   0 1 y r   1 y α 2 n y n + 1 α 2 1 + r   d y  
= Γ 2 n + 2 Γ n + 1 Γ n + 1   0 1   1 w n w n + r α 2   d w  
= Γ 2 n + 2 Γ n + 1 Γ n + 1   Γ n + 1 Γ n + 1 + r α 2 Γ 2 n + 2 + r α 2 = Γ 2 n + 2 Γ n + 1 Γ n + 1 + r α 2 Γ 2 n + 2 + r α 2  
Theorem 4: the rth raw moment of the second version of the distribution is given by
E y r = Γ n + 1 Γ n + 1 2 Γ n + 1 2 + r α 2 Γ n + 1 + r α 2
Proof: the expectation of the rth moment in equation (24) is obtained with the help of the transformation mentioned in equation (12)
E y r = 0 1   y r f y y d y = Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2   1 α 2   0 1 y r   1 y α 2 n 1 2 y n + 1 2 α 2 1   d y   . ( 24 )
= Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2   1 α 2   0 1   1 y α 2 n 1 2 y n + 1 2 α 2 1 + r   d y
= Γ n + 1 Γ n 2 + 1 2 Γ n 2 + 1 2   0 1   1 w n 1 2 w n 1 2 + r α 2   d y
= Γ n + 1 Γ n + 1 2 Γ n + 1 2   Γ n + 1 2 Γ n + 1 2 + r α 2 Γ n + 1 + r α 2 = Γ n + 1 Γ n + 1 2 Γ n 2 + 1 2 + r α 2 Γ n + 1 + r α 2

Section 3: Real Data Analysis

Table 1 shows the flood data. These are 20 observations regarding the maximum flood level of the Susquehanna River at Harrisburg, Pennsylvania. (19).
Table 2 shows the descriptive statistics of the data with right skewness and mild positive excess kertosis (leprokurtic).
Table 3 shows the statistical analysis for first version of the Generalized Odd MBUR distribution and its competitors. It outperforms all of them as it has the highest value of log-likelihood and the most negative values for AIC, CAIC, and BIC. The K-S test fails to reject the null hypothesis which supports that the distribution fits the data well with p value 0.3297. Both the AD statistics and the CVM statistics has the lowest values among the other distribution which favors better fit of the data. These are the results for fitting the first version of the distribution.
The P-values for the estimators of alpha and beta parameters of the Beta distribution and Kumaraswamy distributions are significant p < 0.001 .
P-values for the estimators of alpha of the MBUR distribution is significant p < 0.001 .
P-values for the estimators of theta of the Unit Lindley distribution is significant p < 0.001 .
The generalization of the MBUR improves the statistical analysis of the previously analyzed data with the MBUR. It leverages the indices and this generalization makes the MBUR outperforms the beta and the Kumaraswamy distribution. Figure 48 shows the histogram and the fitted Generalized Odd MBUR which shows marked enhancement for fitting the data than the MBUR shown in Figure 49. This is also true if comparing the theoretical CDF of the Generalized Odd MBUR in Figure 50 than the one in Figure 51.
Figure 48. shows the histogram of the flood data and the fitted Generalized Odd MBUR distribution ( first version ) .
Figure 48. shows the histogram of the flood data and the fitted Generalized Odd MBUR distribution ( first version ) .
Preprints 150412 g049
Figure 49. shows the histogram of the flood data and the fitted MBUR distribution and other competitors.
Figure 49. shows the histogram of the flood data and the fitted MBUR distribution and other competitors.
Preprints 150412 g050
Figure 50. shows the empirical and the theoretical CDF after fitting the generalized Odd MBUR distribution (first version).
Figure 50. shows the empirical and the theoretical CDF after fitting the generalized Odd MBUR distribution (first version).
Preprints 150412 g051
Figure 51. shows the eCDF vs. theoretical CDF of the distributions for the flood data set. The purple curve is the MBUR. The generalization of MBUR markedly improves fitting the data as shown in figure 50.
Figure 51. shows the eCDF vs. theoretical CDF of the distributions for the flood data set. The purple curve is the MBUR. The generalization of MBUR markedly improves fitting the data as shown in figure 50.
Preprints 150412 g052
Table 4 shows the statistical analysis for second version of the Generalized Odd MBUR distribution and its competitors. The same results are obtained apart from the estimated n parameter value and its associated variance and standard error. Otherwise, it outperforms the other distributions for the same reasons as the first version of it. The value of the estimated n parameter is larger (17.2087) compared with (8.0302) while the value of the estimated alpha is the same(1.1168).
The P-values for the estimators of alpha and beta parameters of the Beta distribution and Kumaraswamy distributions are significant p < 0.001 .
P-values for the estimators of alpha of the MBUR distribution is significant p < 0.001 .
P-values for the estimators of theta of the Unit Lindley distribution is significant p < 0.001 .
Figure 52 shows the fitted PDF for the second version with slight difference than figure 48. The peak is larger than in figure 48. Figure 53 shows the CDF for this version.
Figure 52. shows the histogram of the flood data and the fitted Generalized Odd MBUR distribution (second version).
Figure 52. shows the histogram of the flood data and the fitted Generalized Odd MBUR distribution (second version).
Preprints 150412 g053
Figure 53. shows the empirical and the theoretical CDF after fitting the Generalized Odd MBUR distribution (second version).
Figure 53. shows the empirical and the theoretical CDF after fitting the Generalized Odd MBUR distribution (second version).
Preprints 150412 g054

Section 4: Conclusion

The addition of new parameter to the previously studied MBUR distribution enhances the capability of BMUR to fit the data. Two versions for generalizing the MBUR were discussed by the author. Both have a non-explicit closed form of the CDF, but they can be expressed using special function. Subsequently, the quantile functions for both versions are not expressed in closed form. Also the rth moments of both versions are expressed with the aid of special function. This is considered as a limitation for being used in applications like median based quantile regression or mean based regression in generalized linear model. The advantage of adding this new parameter is that it helps BMUR to fit near symmetric data. Moreover, the two versions of this Generalized Odd MBUR exhibit a new shape for the hazard rate in the form of oscillating patter at the end of the distribution before approaching infinity and at different values of the random variable depending on the level of the alpha and the n parameter.

Future work

Using Bayesian analysis can enhance the analysis of the data.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable

Availability of data and material

Not applicable. Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

Competing interests

The author declares no competing interests of any type.

Funding

No funding resource. No funding roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript are declared

Authors’ contribution

AI carried the conceptualization by formulating the goals, aims of the research article, formal analysis by applying the statistical, mathematical and computational techniques to synthesize and analyze the hypothetical data, carried the methodology by creating the model, software programming and implementation, supervision, writing, drafting, editing, preparation, and creation of the presenting work.

Acknowledgement

Not applicable

References

  1. Johnson, N.L. Systems of Frequency Curves Generated by Methods of Translation. Biometrika. 1949, 36, 149–176. [Google Scholar] [CrossRef] [PubMed]
  2. Eugene, N.; Lee, C.; Famoye, F. Beta-Normal Distribution And Its Applications. Communications in Statistics - Theory and Methods. 2002, 31, 497–512. [Google Scholar] [CrossRef]
  3. Gündüz, S.; Korkmaz, M.Ç. A New Unit Distribution Based On The Unbounded Johnson Distribution Rule: The Unit Johnson SU Distribution. Pak.j.stat.oper.res. 2020, 471–490. [Google Scholar] [CrossRef]
  4. Topp, C.W.; Leone, F.C. A Family of J-Shaped Frequency Functions. Journal of the American Statistical Association. 1955, 50, 209–219. [Google Scholar] [CrossRef]
  5. Consul, P.C.; Jain, G.C. On the log-gamma distribution and its properties. Statistische Hefte. 1971, 12, 100–106. [Google Scholar] [CrossRef]
  6. Grassia, A. On A Family Of Distributions With Argument Between 0 And 1 Obtained By Transformation Of The Gamma And Derived Compound Distributions. Australian Journal of Statistics. 1977, 19, 108–114. [Google Scholar] [CrossRef]
  7. Mazucheli, J.; Menezes, A.F.B.; Dey, S. Improved maximum-likelihood estimators for the parameters of the unit-gamma distribution. Communications in Statistics - Theory and Methods. 2018, 47, 3767–3778. [Google Scholar] [CrossRef]
  8. Tadikamalla, P.R. On a family of distributions obtained by the transformation of the gamma distribution. Journal of Statistical Computation and Simulation. 1981, 13, 209–214. [Google Scholar] [CrossRef]
  9. Tadikamalla, P.R.; Johnson, N.L. Systems of frequency curves generated by transformations of logistic variables. Biometrika. 1982, 69, 461–465. [Google Scholar] [CrossRef]
  10. Kumaraswamy, P. A generalized probability density function for double-bounded random processes. Journal of Hydrology. 1980, 46, 79–88. [Google Scholar] [CrossRef]
  11. Modi, K.; Gill, V. Unit Burr-III distribution with application. Journal of Statistics and Management Systems. 2020, 23, 579–592. [Google Scholar] [CrossRef]
  12. Haq, M.A.U.; Hashmi, S.; Aidi, K.; Ramos, P.L.; Louzada, F. Unit Modified Burr-III Distribution: Estimation, Characterizations and Validation Test. Ann Data Sci. 2023, 10, 415–440. [Google Scholar] [CrossRef]
  13. Korkmaz MÇ, Chesneau C. On the unit Burr-XII distribution with the quantile regression modeling and applications. Comp Appl Math 2021, 40, 29.
  14. Mazucheli, J.; Maringa, A.F.; Dey, S. Unit-Gompertz Distribution with Applications. Statistica 2019, 79, 25–43. [Google Scholar]
  15. Mazucheli, J.; Menezes, A.F.B.; Chakraborty, S. On the one parameter unit-Lindley distribution and its associated regression model for proportion data. Journal of Applied Statistics. 2019, 46, 700–714. [Google Scholar] [CrossRef]
  16. Mazucheli, J.; Menezes, A.F.B.; Fernandes, L.B.; De Oliveira, R.P.; Ghitany, M.E. The unit-Weibull distribution as an alternative to the Kumaraswamy distribution for the modeling of quantiles conditional on covariates. Journal of Applied Statistics. 2020, 47, 954–974. [Google Scholar] [CrossRef] [PubMed]
  17. Mazucheli, J.; Menezes, A.F.; Dey, S. The Unit-Birnbaum-Saunders distribution with applications. Chelian Journal of Statisitcs 2018, 9, 47–57. [Google Scholar]
  18. Maya, R.; Jodrá, P.; Irshad, M.R.; Krishna, A. The unit Muth distribution: statistical properties and applications. 2024, 73, 1843–1866. [Google Scholar] [CrossRef]
  19. Dumonceaux, R.; Antle, C.E. Discrimination Between the Log-Normal and the Weibull Distributions. Technometrics. 1973, 15, 923–926. [Google Scholar] [CrossRef]
Table 1. shows Flood Data set.
Table 1. shows Flood Data set.
0.26 0.27 0.3 0.32 0.32 0.34 0.38 0.38 0.39 0.4
0.41 0.42 0.42 0.42 045 0.48 0.49 0.61 0.65 0.74
Table 2. Descriptive statistics of the fourth data set.
Table 2. Descriptive statistics of the fourth data set.
min mean std skewness kurtosis 25p 50p 75p max
0.26 0.4225 0.1244 1.1625 4.2363 0.33 0.405 0.465 0.74
Table 3. statistical analysis for the flood data set with the first version of the Generalized Odd MBUR distribution:.
Table 3. statistical analysis for the flood data set with the first version of the Generalized Odd MBUR distribution:.
Beta Kumaraswamy Generalized odd MBUR Topp-Leone Unit-Lindley
theta α = 6.8318 α = 3.3777 n = 8.1044 2.2413 1.6268
β = 9.2376 β = 12.0057   α = 1.1168
Var 7.22 7.2316 0.3651 2.8825 8.0302 0.0177 0.2512 0.0819
7.2316 8.0159 2.8825 29.963 0.0177 0.0018
SE 0.6008 ( alpha) 0.1351 (alpha) 0.6336 (n) 0.1121 0.0639
0.6331 ( beta) 1.2239 ( beta) 0.0095 ( alpha)
AIC -24.3671 -21.9465 -24.4562 -12.7627 -12.3454
CAIC -23.6613 -21.2407 -23.7503 -12.5405 -12.1231
BIC -22.3757 -19.9551 -22.4647 -11.767 -11.3496
LL 14.1836 12.9733 14.2281 7.3814 7.1727
K-S 0.2063 0.2175 0.2040 0.3409 0.2625
H0 Fail to reject Fail to reject Fail to reject Reject Fail to reject
P-value 0.3174 0.2602 0.3297 0.0141 0.0311
AD 0.7302 0.9365 0.7153 2.9131 2.3153
CVM 0.1242 0.1653 0.1205 0.5857 0.4428
Table 4. statistical analysis for the flood data with the second version of the Generalized Odd MBUR.
Table 4. statistical analysis for the flood data with the second version of the Generalized Odd MBUR.
Beta Kumaraswamy Generalized odd MBUR Topp-Leone Unit-Lindley
theta α = 6.8318 α = 3.3777 n = 17.2087 2.2413 1.6268
β = 9.2376 β = 12.0057   α = 1.1168
Var 7.22 7.2316 0.3651 2.8825 32.1208 0.0354 0.2512 0.0819
7.2316 8.0159 2.8825 29.963 0.0354 0.0018
SE 0.6008 ( alpha) 0.1351 (alpha) 1.2673 (n) 0.1121 0.0639
0.6331 ( beta) 1.2239 ( beta) 0.0095 ( alpha)
AIC -24.3671 -21.9465 -24.4562 -12.7627 -12.3454
CAIC -23.6613 -21.2407 -23.7503 -12.5405 -12.1231
BIC -22.3757 -19.9551 -22.4647 -11.767 -11.3496
LL 14.1836 12.9733 14.2281 7.3814 7.1727
K-S 0.2063 0.2175 0.2040 0.3409 0.2625
H0 Fail to reject Fail to reject Fail to reject Reject Fail to reject
P-value 0.3174 0.2602 0.3297 0.0141 0.0311
AD 0.7302 0.9365 0.7153 2.9131 2.3153
CVM 0.1242 0.1653 0.1205 0.5857 0.4428
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated