Introduction
Unit distributions model and fit the proportionate data and ratio. They have wide applications in diverse fields like economics, finance, biology, medicine, hydrology, engineering, and sociology. A better understanding of the distribution fitting the data helps apply these data in various statistical applications like regression analysis, survival analysis, and time series analysis. Unit distribution can be derived by the following transformations . There are many continuous distributions that model these data. Some of these distributions are the unit Gamma Lindley distribution (Karakaya & Sağlam, 2025), the unit half logistic geometric distribution (Ramadan et al., 2022), the logit truncated exponential skew logistic distribution (Pang et al., 2021), the arc-secant hyperbolic Weibull distribution (Korkmaz et al., 2023), the Vasicek distribution (Mazucheli et al., 2022), the logit slash distribution (Korkmaz, 2020), and the median-based unit Rayleigh distribution and the references therein (I. M. Attia, 2025a).
The generalization of the unit distribution can be achieved through different mechanisms like power transformation to obtain the power Johnson B (Cancho et al., 2020), power Generalized Johnson SB (Gallardo et al., 2022), and power unit inverse Lindley distribution (Gemeay et al., 2024). The generalization can also be conducted using the T-X family method (transformed-transformer mechanism) like the transmuted power unit inverse Lindley distribution (Eldessouky et al., 2025), Kumaraswamy generalized family of distribution (Tahir et al., 2020), generalized distribution based on T-Topp –Leone family of distributions (Sudsila et al., 2022), and the generalized unit half logistic geometric distributions (Nasiru et al., 2023).
The author discusses in this paper a different method for adding new parameter to the unit distribution using the general formula for the order statistics. Then the author demonstrates this by examples of different new distributions.
The paper is arranged into the following section. Section 1 explores the derivations of the different unit distributions discussed in the paper. Section 2 elaborates on some of the basic functions and properties of these distributions. Section 3 discusses the Maximum likelihood Estimators. Section 4 demonstrates real data analysis with expanded discussion. Section 5 comprehends the conclusions. Section 6 suggests the future work.
Section One: Derivation of the Some Unit Distribution and Their Generalization:
Preposition 1: Kumaraswamy distribution has the following PDF seen in equation (1)
Proof: This distribution can be derived from the inverse Weibull distribution (IW) that has the following PDF and CDF in equation (2). Applying the transformation in equation (3) and the Jacobian in equation (4)
Substituting equation (3) & (4) into (2) gives the so called unit power distribution with its PDF & CDF seen in equation (5) & (6).
The general formula for the ith order statistics in sample size n is defined in equation (7)
Using the smallest order statistics formula, in other words, i=1, and substituting the parent distribution of unit power distribution derived from the IW distribution as previously explained gives the well-known Kumaraswamy distribution. Let
, so equation (8) is the PDF of the Kumaraswamy. Therefore, Kumaraswamy distribution can be considered as generalization of the unit power distribution derived from the transformation of the IW distribution into unit power distribution. Because the unit power distribution in equation (5) has only one parameter
Using the smallest order statistics formula and substituting the PDF and CDF for this unit power distribution yield Kumaraswamy.
What if we use the smallest order statistics and substitute the PDF and CDF of the IW distribution as the parent distribution, then apply the transformation and Jacobian in equations (3-4), this yields the same result, the well-known Kumaraswamy distribution.
Preposition 2: Fatima 1 distribution has the following PDF seen in equation (9).
Proof: Using the largest order statistics formula, substituting the PDF & CDF of the unit power distribution seen in equation (5), substituting n=i, and let
, the new generalized form of unit distribution is shown in equation (9).
Figure 1 illustrates the PDF and the hazard rate function of the distribution.
The same distribution is acquired if using the formula of the largest order statistics with the PDF & CDF of the IW distribution as the parent distribution then applying the transformation and the Jacobian in equation (3-4).
Preposition 3: Fatima2 distribution has the following PDF in equation (10).
Proof: Using the i
th order statistics formula will result in generalized form of the unit power distribution with 3 parameters as shown in equation (10). Let
. The same distribution is gained if using the i
th order statistics, replacing the PDF & CDF of the IW as the parent distribution then applying the transformation and Jacobian of equations (3-4).
Figure 2 shows the PDF and the hazard rate function.
Transforming the Rayleigh distribution which has PDF & CDF in equation (11) into unit Rayleigh distribution can be obtained by employing the transformation and the Jacobian seen in equation (12). The unit Rayleigh distribution has a PDF & CDF in equation (13) & (14) respectively.
Preposition 4: Fatima 3 distribution had PDF seen in equation 15
Proof: Substituting equation (13) & (14) into the smallest order statistics formula gives the new generalized unit distribution in equation (15). Let
This result can also be obtained if we start with the largest order statistics formula and substitute Rayleigh distribution as the parent distribution with the PDF & CDF in equation (11) then applying the transformation and Jacobian in equation (12).
Figure 3 depicts the PDF and the hazard rate function of the distribution.
Preposition 5: Fatima 4 distribution has the following PDF in equation 16
Proof: Substituting the PDF & CDF of equation (13-14) into the largest order statistics formula gives the new generalized unit distribution in equation (16). Let
.
The same result is achieved if we start with the smallest order statistics formula and substitute the Rayleigh distribution with the PDF & CDF in equation (11) then utilizing the transformation and Jacobian in equation (12).
Figure 4 illustrates the PDF and the hazard rate function of the distribution.
Preposition 6: the Median Based Unit Rayleigh (Fatima 5) and its generalized form (Fatima 6 & Fatima 7) have the PDF in equations (17-19) respectively.
Proof: Using the PDF & CDF of Rayleigh distribution in equation (11) as the parent distribution , applying the transformation and Jacobian in equation (12) then substituting in the i
th order statistics with i=2 and n=3 , gives the median based unit Rayleigh ( MBUR) distribution discussed by ( Attia,2025) as shown in equation (17). The author (I. M. Attia, 2025b) & (I. Attia, 2025) discussed the generalization of this distribution using the odd median order statistics as seen in equations (18-19).
Section Two: Some Basic Properties
In this section, the author demonstrates some of the basic properties for each of the new unit distributions. This includes the raw moments and the quantile function.
Preposition 7: Fatima 1 has the following raw moments and quantile function seen in equations (20-21) respectively.
_________________________________________________________________
Preposition 8: Fatima 2 has the following raw moments in equation (22) but it has no closed quantile function because it has no closed CDF.
Preposition 9: Fatima 3 has the following raw moments and quantile function seen in equations (23-24) respectively
_________________________________________________________________
Preposition 10: Fatima 4 has the following raw moments and quantile function seen in equations (25-26) respectively.
Section Three: Estimation Method (MLE)
Preposition 11: Fatima 1 has log of PDF in equation 27 and its partial derivative with respect to each parameter in equation (28) & (29)
Preposition 12: Fatima 2 has log of PDF in equation 30 and its partial derivative with respect to each parameter in equation (31), (32) & (33)
Preposition 13: Fatima 3 has log of PDF in equation 34 and its partial derivative with respect to each parameter in equation (35) & (36).
Preposition 14: Fatima 4 distribution has log of PDF in equation 37 and its partial derivative with respect to each parameter in equation (38) & (39).
Section Four: Real Data Analysis and Discussion
The data sets are derived from the OECD, or Organization for Economic Co-operation and Development.
https://stats.oecd.org/index.aspx?DataSetCode=BLI . It provides information on the economy, social events, education, health, labor, and the environment in the member countries. Matlab 2014 R was used for analysis where the MLE function utilizes the derivative free Nelder-Mead algorithm for optimization. The author analyzes the water quality indicator. This quality is measured through self-reported satisfaction or by tracking the availability of clean drinking water and the extent of water pollution. In this database, the water quality is expressed as percentage of population satisfied with the quality of water or subjective perception of water safety and cleanliness. It is also presented as the proportion of population with access to water that is free of contamination and available when needed. The dataset is 0.92, 0.92, 0.79, 0.90, 0.62, 0.82, 0.87, 0.89, 0.93, 0.86, 0.97, 0.78, 0.91, 0.67, 0.81, 0.97, 0.80, 0.77, 0.77, 0.87, 0.82, 0.83, 0.83, 0.85, 0.75, 0.91, 0.85, 0.98, 0.82, 0.89, 0.81, 0.93, 0.76, 0.97, 0.96, 0.62, 0.82, 0.88, 0.7, 0.62, and 0.72.
Table 1 demonstrates the descriptive analysis of the water quality dataset.
Figure (5) depicts the boxplot of the data. The data shows left skewness and mild platykurtic appearance.
Table 2 demonstrates that all the competitors’ distributions fit the data well as the null hypothesis test fails to reject the assumption that any of the distribution can generate the data. The P-value of the KS-test is significant for all of the fitted distributions. The beta distribution is the best to fit the data because it has the most negative AIC, CAIC, BIC and, HQIC followed by Fatima 2 distribution, then Kumaraswamy distribution and lastly Fatima 1 distribution. The variance of the estimated parameter obtained from fitting Fatima 1 and Fatima 2 is very large which may indicate correlation between the parameters and this requires more sophisticated methods to solve for parameter estimation.
Table 3 shows that all the distributions fit the data well. Fatima 6 and Fatima 7 fit the data better than Fatima 5, followed by Fatima 3 and lastly Fatima 4. This is supported by the higher Log-likelihood and the most negative AIC, CAIC, BIC and HQIC. Variance –covariance of the estimated parameter gained from fitting Fatima 4 is very large which necessitate deploying other methods to manage this large variance.
Figure 6 and
Figure 7 demonstrate the fitted PDF and CDF for the different unit distributions discussed in this paper.
From the above discussion, all the distributions are more or less comparable to each other; they are nearly equal to fit the dataset except Fatima 1 and Fatima 4, as they fit the data with lesser value of Log-likelihood (34.9508). Although Fatima 1 is different from Fatima 4, however; the statistical indices like AIC, CAIC, BIC, HQIC, LL, KS, CVM, and AD are almost equal. The variance of the estimated parameters obtained from fitting Fatima 1, Fatima 2, and Fatima 4 distributions are very large. This may point to correlation between the parameters and so other methods are needed to mitigate this correlation for better parameter estimation and hence better construction of the confidence interval.
Section Five: Conclusion
Generalization of the unit distribution is a challenge. The most frequent methods are the power transformation, methods utilizing Beta-generated or Kumaraswamy-generated families for a parent distribution of a random variable defined over the unit interval, or the transformer-transformed (T-X) family method. In this paper the author used a different method utilizing the general formula of the order statistics for a parent distribution of a variable defined over the unit interval. Kumaraswamy distribution is obtained from Inverse Weibull (IW) distribution by transforming this IW into unit power distribution and substituting the PDF and CDF of the new unit power distribution into the general formula of the smallest order statistics. We can also gain Kumaraswamy from general formula of the smallest order statistics substituting the IW PDF and CDF then applying the transformation of parent variable into a variable defined over unit interval as discussed in Section 1. The largest order statistics formula and IW distribution can be used to derive Fatima 1. While the ith order statistics and the IW distribution are utilized to derive Fatima 2 distribution. By transforming the IW distribution into one-parameter unit power distribution then applying the general order statistics, a new shape parameter can be added to this unit distribution yielding different unit distributions according to the formula of the order statistics used.
Rayleigh distribution can be transformed into one-parameter unit Rayleigh then replacing the PDF and the CDF of this unit Rayleigh into the smallest order statistics yields Fatima 3 distributions. Moreover, Fatima 4 can be derived using the unit Rayleigh PDF and CDF as the parent distribution in the largest order statistics formula. Fatima 5 which is the Median Based Unit Rayleigh (MBUR) distribution originates from the general formula of the median order statistics with sample size equals 3 and generalization of this MBUR can be attained from using the general formula with an odd sample size to get Fatima 6 and Fatima 7. This method leads to an addition of a new parameter to the one-parameter unit Rayleigh distribution.
These new unit distributions give different estimators of their parameters with different variances. Some of these variances are very large pointing to the possibility of correlation between the newly added parameter and the existing old parameter. Hence, other methods should be deployed to mitigate this correlation and deflate the variance for better construction of confidence interval.
Section Six: Future Work
Estimation of the parameters of these unit distributions can be evaluated with other methods like least square method, weighted least square method, maximum product of spacing, Cramer Von Mises Estimator and Anderson Darling estimator. Bayesian inference can also be evaluated. Other properties of these distributions like entropy, Lorenz curve, Bonferroni curve, Gini index, probability weighted moments and mean residual life function are candidates for further study. Also stochastic ordering is a booster for further investigation. These distributions can be applied in analysis of censored data as well as survival data. They can be utilized in regression analysis as well as in time series analysis.
Funding
No funding resource. No funding roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript are declared
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable
Availability of data and material
Not applicable. Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
Competing interests
The author declares no competing interests of any type.
Authors’ contribution
AI carried the conceptualization by formulating the goals, aims of the research article, formal analysis by applying the statistical, mathematical and computational techniques to synthesize and analyze the hypothetical data, carried the methodology by creating the model, software programming and implementation, supervision, writing, drafting, editing, preparation, and creation of the presenting work.
Acknowledgement
Not applicable
References
- Attia, I. (2025). The New Generalized Odd Median Based Unit Rayleigh with a New Shape Oscillating Hazard Rate Function (No. arXiv:2503.11668). arXiv. [CrossRef]
- Attia, I. M. (2025a). A Novel Unit Distribution Named As Median Based Unit Rayleigh (MBUR): Properties and Estimations (No. arXiv:2410.04132). arXiv. [CrossRef]
- Attia, I. M. (2025b). Comparative Study of the Median Based Unit Rayleigh and its Generalized Form the Generalized Odd Median Based Unit Rayleigh (No. arXiv:2503.11700). arXiv. [CrossRef]
- Cancho, V. G., Bazán, J. L., & Dey, D. K. (2020). A new class of regression model for a bounded response with application in the study of the incidence rate of colorectal cancer. Statistical Methods in Medical Research, 29(7), 2015–2033. [CrossRef]
- Eldessouky, E. A., Hassan, O. H. M., Aloraini, B., & Elbatal, I. (2025). Modeling to medical and economic data using: The transmuted power unit inverse Lindley distribution. Alexandria Engineering Journal, 113, 633–647. [CrossRef]
- Gallardo, D. I., Bourguignon, M., Gómez, Y. M., Caamaño-Carrillo, C., & Venegas, O. (2022). Parametric Quantile Regression Models for Fitting Double Bounded Response with Application to COVID-19 Mortality Rate Data. Mathematics, 10(13), 2249. [CrossRef]
- Gemeay, A. M., Alsadat, N., Chesneau, C., & Elgarhy, M. (2024). Power unit inverse Lindley distribution with different measures of uncertainty, estimation and applications. AIMS Mathematics, 9(8), 20976–21024. [CrossRef]
- Karakaya, K., & Sağlam, Ş. (2025). Unit Gamma-Lindley Distribution: Properties, Estimation, Regression Analysis, and Practical Applications. Gazi University Journal of Science, 38(2), 1021–1040. [CrossRef]
- Korkmaz, M. Ç. (2020). A new heavy-tailed distribution defined on the bounded interval: The logit slash distribution and its application. Journal of Applied Statistics, 47(12), 2097–2119. [CrossRef]
- Korkmaz, M. Ç., Chesneau, C., & Korkmaz, Z. S. (2023). A new alternative quantile regression model for the bounded response with educational measurements applications of OECD countries. Journal of Applied Statistics, 50(1), 131–154. [CrossRef]
- Mazucheli, J., Alves, B., Korkmaz, M. Ç., & Leiva, V. (2022). Vasicek Quantile and Mean Regression Models for Bounded Data: New Formulation, Mathematical Derivations, and Numerical Applications. Mathematics, 10(9), 1389. [CrossRef]
- Nasiru, S., Chesneau, C., Abubakari, A. G., & Angbing, I. D. (2023). Generalized Unit Half-Logistic Geometric Distribution: Properties and Regression with Applications to Insurance. Analytics, 2(2), 438–462. [CrossRef]
- Pang, L., Tian, W., Tong, T., & Chen, X. (2021). Logit Truncated-Exponential Skew-Logistic Distribution with Properties and Applications. Modelling, 2(4), 776–794. [CrossRef]
- Ramadan, A. T., Tolba, A. H., & El-Desouky, B. S. (2022). A Unit Half-Logistic Geometric Distribution and Its Application in Insurance. Axioms, 11(12), 676. [CrossRef]
- Sudsila, P., Thongteeraparp, A., Aryuyuen, S., & Bodhisuwan, W. (2022). The Generalized Distributions on the Unit Interval based on the T-Topp-Leone Family of Distributions. Trends in Sciences, 19(19), 6186. [CrossRef]
- Tahir, M. H., Hussain, M. A., Cordeiro, G. M., El-Morshedy, M., & Eliwa, M. S. (2020). A New Kumaraswamy Generalized Family of Distributions with Properties, Applications, and Bivariate Extension. Mathematics, 8(11), 1989. [CrossRef]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).