Flexible Hazard Modeling with Segmented Distributions

Guillermo Martínez Flórez; Rafael Bráz Azevedo Farias; Carlos Javier Barrera Causil

doi:10.20944/preprints202504.1011.v1

Submitted:

11 April 2025

Posted:

14 April 2025

You are already at the latest version

Abstract

This paper introduces a flexible family of segmented proportional hazard distributions designed to model abrupt changes in hazard rates, which are often observed in medical and engineering applications. The proposed framework generalizes the proportional hazard transformation to segmented distributions, including new forms of the Rayleigh, log-logistic, Lindley, and Laplace PH models. We develop a maximum likelihood estimation procedure incorporating right censoring, a key feature of real-world survival data. The segmented hazard models effectively capture structural breaks in the hazard function, providing a robust alternative to traditional survival models that assume constant hazard dynamics. A case study based on IQ score data illustrates the improved flexibility and interpretability of the segmented Laplace PH model in detecting latent change points. The proposed models enhance the capacity to model complex survival patterns with abrupt changes in risk, contributing to a deeper understanding of dynamic hazard processes.

Keywords:

segmented distributions

;

proportional hazard models

;

survival analysis

;

structural change

;

maximum likelihood

;

Laplace distribution

;

right censoring

Subject:

Computer Science and Mathematics - Probability and Statistics

1. Introduction

Proportional hazard models have become essential tools in survival analysis, particularly in fields like medicine and engineering, where time-to-event data often play a critical role in understanding treatment effects, risk factors, and patient outcomes [21]. These models are widely used to analyze survival data by adjusting the hazard function and incorporating time-dependent covariates, which are highly prevalent in real-world medical studies. However, traditional models often assume constant hazard rates over time, failing to account for abrupt changes in failure or risk rates that are frequently observed in clinical and environmental studies.

In medical research, for instance, sudden changes in failure rates may occur as a result of treatment interventions, disease progression, or other factors. A patient undergoing a new medication regimen may experience different risk levels before, during, and after treatment, with the hazard rate shifting accordingly. Similarly, environmental studies often exhibit abrupt changes in variables like pollution levels or temperature due to external factors, leading to significant variations in failure rates over time [5,10].

Traditional models struggle to capture these dynamic changes. For this reason, there is a growing need for more flexible statistical models that can accommodate multiple change points in the hazard function. Recent approaches, such as the exponential piecewise model [9], power piecewise exponential model [11], and exponentiated Weibull hazard function [16], have made progress in modeling these complexities. However, these methods are often limited to specific distributions and lack the versatility needed for more complex data structures.

In this paper, we introduce a family of segmented proportional hazard models that extend beyond current methodologies by offering greater flexibility in modeling data with multiple change points in hazard rates. Our proposed models, including segmented versions of the Rayleigh, log-logistic, Lindley, and Laplace proportional hazard distributions, provide a powerful tool for analyzing survival data across various fields, particularly in medicine and engineering, where abrupt changes are common. By incorporating a censoring mechanism and employing maximum likelihood estimation, our models offer a robust alternative for analyzing complex survival data with high degrees of variability over time or space.

The remainder of this paper is organized as follows. Section 2 introduces the Proportional Hazard Distribution (PHD), which serves as the cornerstone for constructing the subsequent segmented models. Section 3 presents the proposed family of segmented proportional hazard distributions, including the segmented Rayleigh, log-logistic, Lindley, and Laplace variants (SLPH). In Section 4, we describe the maximum likelihood estimation procedure for these models, including the treatment of right-censored data, following the approach of [6]. Section 5 illustrates the practical application of the proposed methodology using real data, comparing the performance of the SLPH model against various alternative models such as the segmented Laplace (LS), segmented half-normal (HNS), segmented half-normal proportional hazard (HNSHP), standard Laplace, skew-normal (SN), Gamma, normal, and mixture of normals. Finally, Section 6 concludes the paper with a summary of findings and directions for future research.

2. Proportional Hazard Distribution

The Proportional Hazard Distribution (PHD) is introduced as an extension of the minimum distribution in the sample, wherein

n \in Z

is replaced by

α \in R^{+}

, for a random variable Z with a Probability Density Function (PDF) denoted as f and a Cumulative Distribution Function (CDF) denoted as F. As a result, this distribution can be regarded as that of a fractional order statistic. We proceed to define the PHD distribution (as seen in [15]).

Let F be a continuous Cumulative Distribution Function (CDF) with a Probability Density Function (PDF) denoted as f, and a hazard function

h = f / (1 - F)

. We state that a random variable Z follows a proportional hazard distribution associated with F (f) and the parameter

α > 0

if its probability density function is given by

φ_{F} (z; α) = α f (z) {[1 - F (z)]}^{α - 1}, z \in R,

(1)

where

α

is a positive real number. We use the notation

Z \sim P H D (α)

. The distribution function of the

P H D

model is defined as

G_{F} (z) = 1 - {[1 - F (z)]}^{α}, z \in R .

(2)

The risk function of this model is expressed as

h_{G} (z; α) = α h (z)

, where

h (\cdot)

represents the risk function of the Probability Density Function (PDF)

f (\cdot)

. In other words, the risk function is proportional to that of the PDF

f (\cdot)

, which explains the nomenclature “pdf with proportional hazard.”

Within the proportional hazard family of distributions, in cases where

f (\cdot)

is parameterized by a distribution family indexed by a parameter vector

θ

, it suffices for

h (x, θ)

to be fully determined. This ensures the identifiability of the parameters within the PHD model.

In effect, much like [2], we consider two Probability Density Functions (PDFs) belonging to the proportional hazard family, characterized by the vectors

(θ_{1}, α_{1})

and

(θ_{2}, α_{2})

. Since both distributions adhere to the proportional hazard principle, we have

h_{i} (x; θ_{i}, α_{i}) = α_{i} h (x; θ_{i})

. Now, if for

(θ_{1}, α_{1}) \neq (θ_{2}, α_{2})

we find

α_{1} h (x; θ_{1}) = α_{2} h (x; θ_{2})

, then

α_{1} / α_{2} = h (x; θ_{2}) / h (x; θ_{1})

. Notably, if

θ_{1} = θ_{2}

, it yields

\frac{h (x; θ_{2})}{h (x; θ_{1})} = 1

, implying

α_{1} = α_{2}

, which in turn contradicts the assumption of distinct parameters for the two distributions.

Now, when

θ_{1} \neq θ_{2}

, at least one pair

x_{i} \neq x_{i^{'}}

exists such that

\frac{h (x; θ_{2})}{h (x; θ_{1})} \neq \frac{h (x_{i^{'}}; θ_{2})}{h (x_{i^{'}}; θ_{1})}

. Since

α_{1} = α_{2}

is constant, it follows that

α_{1} \neq α_{2}

. These findings underscore the impossibility of distinct parameter sets leading to the same risk function. Consequently, it suffices for the risk function to be determined to ensure the identifiability of model parameters.

Now, we consider the scenario in which the density function

f (\cdot; θ)

takes on a segmented form, resulting in a more adaptable model for explaining abrupt alterations in the intermittent failure rate caused by diverse experimental factors within specific time frames or spaces.

3. Segmented Proportional Hazard Distribution

If the risk function,

h (x; θ)

, of the base PDF

f (\cdot; θ)

exhibits distinct behaviors within different sections of its range, it becomes feasible to partition the domain of

h (\cdot; θ)

into L intervals delimited by points

l_{1} < l_{2} < \dots < l_{L - 1} < l_{L} < \infty

. This allows for the expression of the failure rate ratio in the interval

[l_{i}, l_{i + 1})

as

h_{i} (x)

. Consequently, the segmented proportional hazard function

h (x)

, with

h_{i} (x) > 0

, can be defined as follows:

h (x) = \{\begin{matrix} 0, & if x < l_{1}, \\ h_{1} (x), & if l_{1} \leq x < l_{2}, \\ h_{2} (x), & if l_{2} \leq x < l_{3}, \\ ⋮ \\ h_{i} (x), & if l_{i} \leq x < l_{i + 1}, \\ ⋮ \\ h_{L} (x), & if l_{L} \leq x . \end{matrix}

(3)

In a similar way, we have that

S (x) = 1 - F (x) = \{\begin{matrix} S_{1} (x), & if l_{1} \leq x < l_{2}, \\ S_{2} (x), & if l_{2} \leq x < l_{3}, \\ ⋮ \\ S_{i} (x), & if l_{i} \leq x < l_{i + 1}, \\ ⋮ \\ S_{L} (x), & if l_{L} \leq x, \end{matrix}

(4)

Here,

F_{i} (x) = 1 - S_{i} (x)

must be a monotonically non-decreasing and continuous function, while

F (x)

must be continuous at the common points specifically,

F_{i} (l_{i + 1}) = F_{i + 1} (l_{i + 1})

for

i = 1, 2, 3, \dots, L

. Similarly,

f_{i} (\cdot)

, the Probability Density Function (PDF) of

F_{i} (x)

, should also be continuous at these common points, i.e.,

f_{i} (l_{i + 1}) = f_{i + 1} (l_{i + 1})

. These conditions ensure that

F (\cdot)

is a continuous and differentiable (smooth) function, and that

f (\cdot)

is continuous at the shared points. As a result, it can be deduced that

S_{i} (l_{i + 1}) = S_{i + 1} (l_{i + 1})

,

f_{i} (l_{i + 1}) = f_{i + 1} (l_{i + 1})

, and

h_{i} (l_{i + 1}) = h_{i + 1} (l_{i + 1})

signifying that they are all monotonically continuous functions.

By defining the risk function, we have that

f_{i} (x) = h_{i} (x) S_{i} (x) x \in R, i = 1, 2, 3, \dots, L .

(5)

Now, according to the definition of the cumulative risk function, it follows that

H_{i} (x) = - \log [S_{i} (x)] = \int_{x_{m i n}}^{x} h_{i} (y) d y,

that is,

\begin{matrix} S_{i} (x) & = & exp [- H_{i} (x)] \\ = & exp [- \int_{l_{1}}^{x} h_{i} (y) d y] \\ = & exp [- \int_{l_{1}}^{l_{2}} h_{1} (y) d y - \int_{l_{2}}^{l_{3}} h_{2} (y) d y - \dots - \int_{l_{i - 1}}^{l_{i}} h_{i - 1} (y) d y - \int_{l_{i}}^{x} h_{i} (y) d y] \\ = & exp [- \sum_{j = 1}^{i - 1} \int_{l_{j}}^{l_{j + 1}} h_{j} (y) d y - \int_{l_{i}}^{x} h_{i} (y) d y] \end{matrix}

(6)

So, calling

ψ_{i} = \int_{l_{i}}^{l_{i + 1}} h_{i} (y) d y and Λ_{i} (x) = \int_{l_{i}}^{x} h_{i} (y) d y

for

l_{i} \leq x < l_{i + 1}

, the

i -

th component of the survival function is represented by:

\begin{matrix} S_{1} (x) = h_{1} (x) exp (- Λ_{1} (x)), for i = 1, \\ S_{i} (x) = exp [- \sum_{j = 1}^{i - 1} ψ_{j} - Λ_{i} (x)], for i = 2, 3, \dots, L . \end{matrix}

(7)

By substituting (7) into (5), we obtain the component of the i-th group of the segmented Probability Density Function (PDF) of the risk function, denoted as (HSM)

f (\cdot)

. The segmented PDF of f will be indicated as

X \sim H S M_{f}

.

The cumulative hazard distribution function of the proportional hazard family can be expressed in terms of the survival function

S_{F} (\cdot)

as

G_{F} (x) = 1 - {[1 - F (x)]}^{α} = 1 - {[S (x)]}^{α} .

(8)

Consequently, the proportional hazard extension of the segmented function (HPSM)

S M_{f}

is derived by substituting (4) into (8). The PDF of this extension is given by

g_{F} (x) = α f (x) {[S (x)]}^{α - 1} .

(9)

Based on equation (5), the density (9) can be expressed as

g_{F} (x) = α h (x) {[S (x)]}^{α} .

(10)

The i-th component of the i-th group of the random variable X is given by:

\begin{matrix} g_{1} (x) & = & α h_{1} (x) exp (- α Λ_{1} (x)), for i = 1, \\ g_{i} (x) & = & α h_{i} (x) exp [- α \sum_{j = 1}^{i - 1} ψ_{j} - α Λ_{i} (x)], l_{i} \leq x < l_{i + 1}, for i = 2, 3, \dots, L . \end{matrix}

(11)

This Probability Density Function (PDF) will be referred to as

X \sim H P S M_{f} (α)

. In cases where

f (\cdot)

is parameterized by a vector of parameters

θ

, we will denote it as

X \sim H P S M_{f} (θ, α)

.

Note that when

α = 1

, the segmented function of the density

f (x)

is obtained. Similarly, if

L = 1

, the Probability Density Function (PDF) of the proportional hazard is derived. When

α = L = 1

, we retrieve the PDF

f (x)

.

Based on the preceding equation, the survival function of the i-th group is given by:

S_{1 - G} (x) = exp [- α Λ_{1} (x)], for i = 1

S_{i - G} (x) = exp [- α \sum_{j = 1}^{i - 1} ψ_{j} - α Λ_{i} (x)], for i = 2, 3, \dots, L

Similarly, the Cumulative Distribution Function (CDF) can be expressed as:

G_{1} (x) = 1 - exp [- α Λ_{1} (x)], for i = 1

G_{i} (x) = 1 - S_{i - G} (x) = 1 - exp [- α \sum_{j = 1}^{i - 1} ψ_{j} - α Λ_{i} (x)], for i = 2, 3, \dots, L

Additionally, the hazard function of the i-th group is given by:

h_{i - G} (x) = α h_{i} (x) .

Now, as

S (x) \sim U (0, 1)

, we can express

(0, 1)

as the union

\cup_{i = 1}^{L} (u_{i}, u_{i + 1})

, where

u_{i} = 1 - S_{i} (x_{i})

. Thus, for

u \in (u_{i}, u_{i + 1})

, the p-th percentile of X is given by

t_{p} = \{\begin{matrix} Λ_{1}^{- 1} (- \frac{1}{α} \log (u)), & if l_{1} \leq x < l_{2}, \\ Λ_{i}^{- 1} (- \frac{1}{α} \log (u) - \sum_{j = 1}^{i - 1} ψ_{j}), & if l_{i} \leq x < l_{i + 1}, i = 2, 3, \dots, L . \end{matrix}

(12)

Here,

Λ_{i}^{- 1} (\cdot)

represents the inverse function of

Λ_{i} (\cdot)

.

The moments of the segmented random variable X will depend on each distribution family under discussion. In general, the r-th moment of X can be expressed as

\begin{matrix} E (X^{r}) & = & \int_{- \infty}^{\infty} x^{r} g (x) d x \\ = & \sum_{i = 1}^{L} \int_{l_{i}}^{l_{i + 1}} x^{r} g_{i} (x) d x \\ = & \int_{l_{1}}^{l_{2}} x^{r} h_{1} (x) exp [- α Λ_{1} (x)] d x \\ + α \sum_{i = 1}^{L} exp [- α \sum_{j = 1}^{i - 1} ψ_{j}] \int_{l_{i}}^{l_{i + 1}} x^{r} h_{i} (x) exp [- α Λ_{i} (x)] d x \\ = & q_{1} + \sum_{i = 1}^{L} exp [- α \sum_{j = 1}^{i - 1} ψ_{j}] q_{i}, \end{matrix}

(13)

where

q_{i} = α \int_{l_{i}}^{l_{i + 1}} x^{r} h_{i} (x) exp [- α Λ_{i} (x)] d x

for

i = 1, 2, 3, \dots, L .

Similarly, the moment-generating function of X is given by

\begin{matrix} M_{X} (t) & = & \int_{- \infty}^{\infty} exp (t x) g (x) d x \\ = & \sum_{i = 1}^{L} \int_{l_{i}}^{l_{i + 1}} exp (t x) g_{i} (x) d x \\ = & \int_{l_{1}}^{l_{2}} h_{1} (x) exp [- α Λ_{1} (x) + t x] d x \\ + α \sum_{i = 1}^{L - 1} exp [- α \sum_{j = 1}^{i - 1} ψ_{j}] \int_{l_{i}}^{l_{i + 1}} h_{i} (x) exp [- α Λ_{i} (x) + t x] d x \\ = & p_{1} + \sum_{i = 1}^{L - 1} exp [- α \sum_{j = 1}^{i - 1} ψ_{j}] p_{i}, \end{matrix}

(14)

where

p_{i} = α \int_{l_{i}}^{l_{i + 1}} h_{i} (x) exp [- α Λ_{i} (x) + t x] d x

for

i = 1, 2, 3, \dots, L .

Next, we present the PDF, CDF, survival, and hazard functions of the segmented Rayleigh, Log-logistic, Lindley, and Laplace distributions.

3.1. Segmented Rayleigh Proportional Hazard Distribution

For the density function of the Rayleigh distribution, we have the hazard function

h_{i} (t; σ_{i}) = \frac{t}{σ_{i}^{2}}

, which leads to the PDF of the segmented proportional hazard represented by:

g_{i} (t) = \{\begin{matrix} \frac{α t}{σ_{1}^{2}} exp (- \frac{α t^{2}}{2 σ_{1}^{2}}), & if i = 1, \\ \frac{α t}{σ_{i}^{2}} k_{i R}^{α} exp (- \frac{α}{2 σ_{i}^{2}} (t^{2} - t_{i}^{2})), & if t_{i} \leq t < t_{i + 1}, i = 2, 3, \dots, L, \end{matrix}

(15)

where

k_{i R} = exp (- \frac{1}{2} \sum_{j = 1}^{i} \frac{t_{j + 1} - t_{j}}{σ_{j}^{2}}) .

Thus, the distribution, survival, and hazard functions of this model are given by:

G_{i} (t) = \{\begin{matrix} 1 - exp (- \frac{α t^{2}}{2 σ_{1}^{2}}), & if i = 1, \\ 1 - k_{i R}^{α} exp (- \frac{α}{2 σ_{i}^{2}} (t^{2} - t_{i}^{2})), & if t_{i} \leq t < t_{i + 1}, i = 2, 3, \dots, L . \end{matrix}

(16)

S_{i} (t) = \{\begin{matrix} exp (- \frac{α t^{2}}{2 σ_{1}^{2}}), & if i = 1, \\ k_{i R}^{α} exp (- \frac{α}{2 σ_{i}^{2}} (t^{2} - t_{i}^{2})), & if t_{i} \leq t < t_{i + 1}, i = 2, 3, \dots, L - 1 . \end{matrix}

(17)

h_{i G} (t; σ_{i}) = \frac{α t}{σ_{i}^{2}}, i = 1, 2, 3, \dots, L .

3.2. Segmented Log-Logistic Proportional Hazard Distribution

The hazard function of the log-logistic distribution for the i-th group is given by

h_{i} (t) = \frac{β_{i} λ_{i} t^{β_{i} - 1}}{1 + λ_{i} t^{β_{i}}}

, which leads to the expression of the PDF for the segmented proportional hazard as:

g_{i} (t) = \{\begin{matrix} \frac{α β_{1} λ_{1} t^{β_{1} - 1}}{{(1 + λ_{1} t^{β_{1}})}^{α + 1}}, & if i = 1, \\ \frac{α β_{i} λ_{i} t^{β_{i} - 1}}{{(1 + λ_{i} t^{β_{i}})}^{α + 1}} \frac{\prod_{j = 1}^{i} {(1 + λ_{j} t_{j}^{β_{j}})}^{α}}{\prod_{j = 1}^{i - 1} {(1 + λ_{j} t_{j + 1}^{β_{j}})}^{α}}, & if t_{i} \leq t < t_{i + 1}, i = 2, 3, \dots, L . \end{matrix}

(18)

The survival function is expressed by

S_{i} (t) = \{\begin{matrix} \frac{1}{{(1 + λ_{1} t^{β_{1}})}^{α}}, & if i = 1, \\ \frac{1}{{(1 + λ_{i} t^{β_{i}})}^{α}} \frac{\prod_{j = 1}^{i} {(1 + λ_{j} t_{j}^{β_{j}})}^{α}}{\prod_{j = 1}^{i - 1} {(1 + λ_{j} t_{j + 1}^{β_{j}})}^{α}}, & if x_{i} \leq x < x_{i + 1}, i = 2, 3, \dots, L, \end{matrix}

(19)

as long as the hazard function is given by

h_{i L L} (t) = \frac{α β_{i} λ_{i} t^{β_{i} - 1}}{1 + λ_{i} t^{β_{i}}} .

3.3. Segmented Lindley Proportional Hazard Distribution

The hazard function of the Lindley distribution’s PDF is given by

h_{i} (t) = \frac{λ_{i}^{2} (1 + t)}{1 + λ_{i} (t + 1)},

then, after intensive calculations, the segmented proportional hazard PDF is given by

g_{i} (t) = \{\begin{matrix} \frac{α λ_{1}^{2} (1 + t) {(1 + λ_{1} (1 + t))}^{α - 1}}{{(1 + λ_{1})}^{α}} exp (- α λ_{1} t), & if i = 1, \\ \frac{α λ_{i}^{2} (1 + t) {(1 + λ_{i} (1 + t))}^{α - 1}}{{(1 + λ_{i} (t_{i} + 1))}^{α}} \frac{\prod_{j = 1}^{i - 1} {(1 + λ_{j} (t_{j + 1} + 1))}^{α}}{\prod_{j = 1}^{i - 1} {(1 + λ_{j} (t_{j} + 1))}^{α}} exp (- α λ_{i} (t - t_{i}) - & if t_{i} \leq t < t_{i + 1}, \\ α \sum_{j = 1}^{i - 1} λ_{j} (t_{j + 1} - t_{j})), & i = 2, 3, \dots, L . \end{matrix}

(20)

Therefore, the survival function is given by

S_{i} (t) = \{\begin{matrix} \frac{{(1 + λ_{1} (1 + t))}^{α}}{{(1 + λ_{1})}^{α}} exp (- α λ_{1} t), & if i = 1, \\ \frac{{(1 + λ_{i} (1 + t))}^{α}}{{(1 + λ_{i} (t_{i} + 1))}^{α}} \frac{\prod_{j = 1}^{i - 1} {(1 + λ_{j} (t_{j + 1} + 1))}^{α}}{\prod_{j = 1}^{i - 1} {(1 + λ_{j} (t_{j} + 1))}^{α}} exp (- α λ_{i} (t - t_{i}) - & if t_{i} \leq t < t_{i + 1}, \\ α \sum_{j = 1}^{i - 1} λ_{j} (t_{j + 1} - t_{j})), & i = 2, 3, \dots, L, \end{matrix}

(21)

where the segmented proportional hazard function is obtained

h_{i H} (t) = \frac{α λ_{i}^{2} (1 + t)}{1 + λ_{i} (t + 1)} .

3.4. Segmented Laplace Proportional Hazard Distribution

The PDF of a Laplace random variable with parameters

μ

and

λ

is given by:

f_{L} (x) = \{\begin{matrix} \frac{λ}{2} exp (λ (x - μ)), & if x < μ, \\ \frac{λ}{2} exp (- λ (x - μ)), & if x \geq μ, \end{matrix}

(22)

where the survival and hazard functions are expressed, respectively, by

S_{L} (x) = \{\begin{matrix} 1 - \frac{1}{2} exp (λ (x - μ)), & if x < μ, \\ 1 - \frac{1}{2} exp (- λ (x - μ)), & if x \geq μ, \end{matrix}

(23)

and

h_{L} (x) = \{\begin{matrix} \frac{\frac{λ}{2} exp (λ (x - μ))}{1 - \frac{1}{2} exp (λ (x - μ))}, & if x < μ, \\ λ, & if x \geq μ . \end{matrix}

(24)

Then, for

i = 1

, the proportional hazard density is obtained from Laplace

g_{1} (x) = \{\begin{matrix} \frac{α λ_{1}}{2} \frac{exp (λ_{1} (x - μ_{1})) {(1 - \frac{1}{2} exp (λ_{1} (x - μ_{1})))}^{α - 1}}{{(1 - \frac{1}{2} exp (λ_{1} (x_{1} - μ_{1})))}^{α}}, & if x < μ_{1}, \\ \frac{α λ_{1}}{2} exp (- \frac{α λ_{1}}{2} (x - x_{1})), & if x \geq μ_{1}, \end{matrix}

(25)

while if

x_{i} \leq x < x_{i + 1}, i = 2, 3, \dots, L

g_{i} (x) = \{\begin{matrix} \frac{α λ_{i}}{2} \frac{exp (λ_{i} (x - μ_{i})) {(1 - \frac{1}{2} exp (λ_{i} (x - μ_{i})))}^{α - 1}}{{(1 - \frac{1}{2} exp (λ_{i} (x_{i} - μ_{i})))}^{α}} \prod_{j = 1}^{i - 1} {(\frac{1 - \frac{1}{2} exp (λ_{j} (x_{j + 1} - μ_{j}))}{1 - \frac{1}{2} exp (λ_{j} (x_{j} - μ_{j}))})}^{α}, & if x < μ_{i}, \\ \frac{α λ_{i}}{2} exp (- \frac{α λ_{i}}{2} (x - x_{i}) - \frac{α}{2} \sum_{j = 1}^{i - 1} λ_{j} (x_{j + 1} - x_{j})), & if x \geq μ_{i} . \end{matrix}

(26)

The survival function of the segmented proportional hazard for

i = 1

is given by

S_{1} (x) = \{\begin{matrix} \frac{{(1 - \frac{1}{2} exp (λ_{1} (x - μ_{1})))}^{α}}{{(1 - \frac{1}{2} exp (λ_{1} (x_{1} - μ_{1})))}^{α}}, & if x < μ_{1}, \\ exp (- \frac{α λ_{1}}{2} (x - x_{1})), & if x \geq μ_{1}, \end{matrix}

(27)

while if

x_{i} \leq x < x_{i + 1}, i = 2, 3, \dots, L

S_{i} (x) = \{\begin{matrix} \frac{{(1 - \frac{1}{2} exp (λ_{i} (x - μ_{i})))}^{α}}{{(1 - \frac{1}{2} exp (λ_{i} (x_{i} - μ_{i})))}^{α}} \prod_{j = 1}^{i - 1} {(\frac{1 - \frac{1}{2} exp (λ_{j} (x_{j + 1} - μ_{j}))}{1 - \frac{1}{2} exp (λ_{j} (x_{j} - μ_{j}))})}^{α}, & if x < μ_{i}, \\ exp (- \frac{α λ_{i}}{2} (x - x_{i}) - \frac{α}{2} \sum_{j = 1}^{i - 1} λ_{j} (x_{j + 1} - x_{j})), & if x \geq μ_{i}, \end{matrix}

(28)

therefore, the hazard function is given by

h_{i} (x) = \{\begin{matrix} \frac{α λ_{1}}{2} \frac{exp (λ_{1} (x - μ_{1}))}{1 - \frac{1}{2} exp (λ (x - μ_{1}))}, & if x < μ_{i}, \\ α λ_{i}, & if x \geq μ_{i} . \end{matrix}

(29)

4. Estimation

For the estimation of the parameters of the segmented proportional hazard distribution, we will employ the maximum likelihood estimate. Suppose that

g (x; θ, α)

represents the PDF of the segmented proportional hazard distribution indexed by the

α

parameter and the parameter vector

θ = (θ_{1}, θ_{2}, \dots, θ_{i}, \dots, θ_{L})

, where for

i = 1, 2, \dots, L

, we have

θ_{i} = (θ_{i 1}, θ_{i 2}, \dots, θ_{i a}, \dots, θ_{i b})

. This sets the framework for defining,

I_{i} (x) = \{\begin{matrix} 1, & if l_{i} \leq x < l_{i + 1}, \\ 0, & otherwise . \end{matrix}

(30)

The PDF

g (x; θ, α)

can be expressed in the following form:

g (x; θ, α) = α \prod_{i = 1}^{L} h_{i}^{I_{i} (x)} (x, θ_{i}) exp (- α \sum_{i = 1}^{L} I_{i} (x) Λ_{i} (x, θ_{i}) - α \sum_{i = 2}^{L} \sum_{j = 1}^{i - 1} I_{i} (x) ψ_{j} (θ_{j})),

(31)

Assuming that the elements of the partition

l_{1}, l_{2}, \dots, l_{L}

are known and given a sample of size

n = n_{1} + n_{2} + \dots + n_{i} + \dots + n_{L}

, obtained from observations

x_{1}, x_{2}, \dots, x_{n}

, the likelihood function of

(θ, α)

is expressed as follows:

\begin{matrix} L (θ, α) & = & α^{n} \prod_{i = 1}^{L} \prod_{l_{i} \leq x_{k} < l_{i + 1}} h_{i} (x_{k}, θ_{i}) exp (- α \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{i} (x_{k}, θ_{i}) - \\ α \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i})) . \end{matrix}

(32)

Subsequently, the log-likelihood function is given by

\begin{matrix} ℓ (θ, α) & = & n \log (α) + \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} \log (h_{i} (x_{k}, θ_{i})) - α \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{i} (x_{k}, θ_{i}) - \\ α \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i}) . \end{matrix}

(33)

Consequently, the components of the score function are given by:

U (θ_{i a}) = \sum_{l_{i} \leq x_{k} < l_{i + 1}} \frac{h_{θ_{i a}}^{'}}{h_{i} (x_{k}, θ_{i})} - α \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{θ_{i a}}^{'} - α ψ_{θ_{i a}}^{'} \sum_{i = j + 1}^{L} n_{i},

(34)

where

i = 1, 2, \dots, L

and

a = 1, 2, \dots, b

. Here,

h_{θ_{i a}}^{'}

denotes the first derivative of

h_{i} (x_{k}, θ_{i})

with respect to

θ_{i a}

, and the same notation is used for

Λ_{i}

and

ψ_{i}

.

The derivative with respect to

α

is given by:

U (α) = \frac{n}{α} - \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{i} (x_{k}, θ_{i}) - \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i}) .

(35)

Equating (34) and (35) to zero gives the score equations, the solution of which, using iterative numerical methods, leads to the maximum likelihood estimates of the parameters of the segmented proportional hazard model. It should be noted that there are existing statistical software and libraries, such as the R project, that provide functions like optim and nlm, among others, which can be used to obtain maximum likelihood estimates for any proposed model. From the equation

U (α) = 0

, we obtain

α (θ_{i}) = \frac{n}{\sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{i} (x_{k}, θ_{i}) + \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i})} .

(36)

Subsequently, substituting (36) into (33) yields the profiled log-likelihood of

θ_{i}

, removing the constant term:

\begin{matrix} ℓ_{p} (θ) & = & - n \log (\sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{i} (x_{k}, θ_{i}) + \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i})) + \\ \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} \log (h_{i} (x_{k}, θ_{i})) + n (\log (n) - 1) . \end{matrix}

(37)

We now proceed to determine the observed and expected information matrices in general terms. These matrices are of significant interest because, for large samples, their inverses evaluated at the maximum likelihood estimates (MLE) of the parameters yield the asymptotic covariance matrix. This matrix, in turn, is used to determine the standard errors of the parameter estimates. Under the assumption of asymptotic normality and certain regularity conditions, this allows for the calculation of confidence intervals for the parameters and the execution of hypothesis tests. Furthermore, certain optimization methods use the information matrix to compute the MLEs of the parameters. The existence of these matrices depends on the specific configuration of f and F in each case.

Given that the elements of the information matrix are defined as the negative second derivatives of the log-likelihood function’s elements, these elements will depend on the properties of f and F as formulated in the particular case’s PDF. In this sense, the conditions and properties of the observed and expected information matrices are contingent upon these configurations. Assuming that the second derivatives of

h_{i} (x_{k}, θ_{i})

,

Λ_{i} (x_{k}, θ_{i})

, and

ψ_{j} (θ_{j})

exist (which is guaranteed since both f and F are continuous, as well as h), the elements of the observed information matrix (

K (θ, α)

) are given by

\begin{matrix} κ_{θ_{i a} θ_{i a^{'}}} = & - \sum_{l_{i} \leq x_{k} < l_{i + 1}} \frac{h_{θ_{i a} θ_{i a^{'}}}^{″} - h_{θ_{i a}}^{'} h_{θ_{i a^{'}}}^{'}}{h_{i}^{2} (x_{k}, θ_{i})} + α \sum_{l_{i} \leq x_{k} < l_{i + 1}} \frac{Λ_{θ_{i a} θ_{i a^{'}}}^{″} - Λ_{θ_{i a}}^{'} Λ_{θ_{i a^{'}}}^{'}}{Λ_{i}^{2} (x_{k}, θ_{i})} + \\ α \frac{ψ_{θ_{i a} θ_{i a^{'}}}^{″} - ψ_{θ_{i a}}^{'} ψ_{θ_{i a^{'}}}^{'}}{ψ_{i}^{2} (x_{k}, θ_{i})} \sum_{i = j + 1}^{L} n_{i}, \end{matrix}

(38)

where

h_{θ_{i a} θ_{i a^{'}}}^{″}

represents the second derivative of

h_{i} (x_{k}, θ_{i})

, with respect to

θ_{i a}

and

θ_{i a^{'}}

respectively, the same notation is used for

Λ_{i}

and

ψ_{i}

.

κ_{θ_{i a} α} = \sum_{l_{i} \leq x_{k} < l_{i + 1}} Λ_{θ_{i a}}^{'} + ψ_{θ_{i a}}^{'} \sum_{i = j + 1}^{L} n_{i},

(39)

and

κ_{α α} = \frac{n}{α^{2}} .

(40)

The elements of the information matrix (

I (θ, α)

) are obtained by calculating the expectation of the elements of the observed information matrix, denoted as

I_{γ_{l} γ_{l^{'}}}

, where

γ_{l}

spans the parameters

θ_{i a}

for

i = 1, 2, \dots, L

and

a = 1, 2, \dots, b

, as well as the parameter

α

.

Under conditions of regularity, employing asymptotic theory for the maximum likelihood estimator vector of the parameter vector

(θ, α)

, we observe that as

n \to \infty

,

K (θ, α) \to I (θ, α)

. Using the same argument of stochastic convergence, it follows that

(\hat{θ}, \hat{α})

converges in distribution to a normal distribution, that is,

\sqrt{n} ((\hat{θ}, \hat{α}) - (θ, α)) \overset{D}{\to} N_{L p + 1} (0, I^{- 1} (θ, α)) .

4.1. Censoring

We now introduce a point of censoring, denoted as C, in the distribution of the segmented proportional hazard. In studies of survival data, it is common to encounter various types of censoring [18]. Therefore, for the k-th observation, the censored variable in the segmented proportional hazard is defined as

Y_{k} = min (X_{k}, C_{k})

, where

k = 1, 2, \dots, n

. It is assumed that the value of censoring for the k-th individual does not depend on

C_{k}

. To this end, we define the indicator variable for censoring as

δ_{k} = \{\begin{matrix} 0, & if the individual is censored, \\ 1, & otherwise \end{matrix}

(41)

The density function for the case of right-censoring (

δ_{k} = 0

for

x_{k} \geq C

) for the i-th group is given by:

\begin{matrix} g_{C i} (x_{k}) & = & g_{C i}^{δ_{k}} (x_{k}) S_{i - G}^{1 - δ_{k}} (C) \\ = & α^{δ_{k}} h_{i}^{δ_{k}} (x_{k}) exp [- α δ_{k} \sum_{j = 1}^{i - 1} ψ_{j} - α δ_{k} Λ_{i} (x_{k}) + (1 - δ_{k}) (- α \sum_{j = 1}^{i - 1} ψ_{j} - α Λ_{i} (C))] \\ = & α^{δ_{k}} h_{i}^{δ_{k}} (x_{k}) exp [- α \sum_{j = 1}^{i - 1} ψ_{j} - α δ_{k} Λ_{i} (x_{k}) - α (1 - δ_{k}) Λ_{i} (C)] . \end{matrix}

(42)

Note that if

α = 1

, the segmented censored density of

f (x)

is obtained. Similarly, if

L = 1

, the censored PDF of the proportional hazard is achieved, and if

α = L = 1

, the censored PDF of

f (x)

is obtained.

For a random sample,

x_{1}, x_{2}, \dots, x_{n}

, the log-likelihood function is given by

\begin{matrix} ℓ_{C} (θ, α) & = & \log (α) \sum_{k = 1}^{n} δ_{k} + \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} δ_{k} \log (h_{i} (x_{k}, θ_{i})) \\ - α \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} δ_{k} Λ_{i} (x_{k}, θ_{i}) \\ - α \sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} (1 - δ_{k}) Λ_{i} (C, θ_{i}) \\ - α \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i}), \end{matrix}

(43)

where

δ_{k} = 0

for

x_{k} \geq C

. The estimation method remains an iterative process, using the score equations as in the case of the uncensored scenario. Profiled likelihood can also be employed since, for the

α

parameter, it follows that:

\begin{matrix} α (θ) = \frac{\sum_{k = 1}^{n} δ_{k}}{\sum_{i = 1}^{L} \sum_{l_{i} \leq x_{k} < l_{i + 1}} (δ_{k} Λ_{i} (x_{k}, θ_{i}) + (1 - δ_{k}) Λ_{i} (C, θ_{i})) + \sum_{j = 1}^{L - 1} (ψ_{j} (θ_{j}) \sum_{i = j + 1}^{L} n_{i})} . \end{matrix}

(44)

Until now, it has been assumed that the number of exchange points, denoted as L, and the values

l_{1}, l_{2}, \dots, l_{L}

are known. However, this assumption does not hold in practice, necessitating the exploration of alternatives that can help identify these points or provide their estimates. The first approach to address this challenge involves analyzing potential change points in the empirical cumulative hazard distribution derived from the data and considering these points as the actual change points in the distribution.

Another possible alternative involves estimating the change points through an iterative process. This process entails creating partitions for

i = 1, 2, \dots, L

and estimating potential change points

l_{1}, l_{2}, \dots, l_{L}

within each partition. Subsequently, the log-likelihood function is maximized for each partition to determine the most suitable one. Comparison criteria for models, such as AIC, BIC, and CAIC, can also be employed to assist in partition selection.

In reality, it is generally not expected to have a partition with numerous change points. Instead, real-world data tends to exhibit multiple instances of subtle changes in the cumulative hazard distribution. Therefore, the partition with a limited number of change points is often more representative of the underlying distribution.

5. Illustration

Here, we illustrate the segmented proportional hazard model using a real dataset. We consider the IQ scores data of 52 white males hired by a large insurance company in 1971, known as the Otis IQ Scores dataset. These data were previously studied by [19] [see also [12]. The cumulative hazard function based on this dataset is provided in Figure 1 - (c).

We fit various families of segmented distributions to the data, including segmented Laplace, segmented Laplace proportional hazard, segmented half-normal, and segmented half-normal proportional hazard. Additionally, we adjust other distributions, such as the Laplace distribution, skew-normal, Gamma, normal, and the mixture of normals (MN).

To discriminate and choose the best model among the aforementioned options, we employed the Akaike Information Criterion (AIC) [1], the corrected Akaike Information Criterion (CAIC) proposed by [4], and the Hannan-Quinn criterion (HQC) introduced by [13].

These criteria are defined as follows:

A I C = - 2 ℓ (\hat{θ}) + 2 p, C A I C = - 2 ℓ (\hat{θ}) + \frac{2 n (p + 1)}{n - p - 2} and H Q I C = - 2 ℓ (\hat{θ}) + 2 p \log (\log (n)) .

Table 1 displays the AIC, CAIC, and HQIC values for the various fitted models, along with the estimated partition values for the case of

L = 2

.

The Table 2 and Table 3 contain the maximum likelihood estimates of the fitted models.

Initially, the Laplace model is compared to the SLPH model using hypothesis tests.

H_{0} : (L, α) = (1, 1) versus H_{1} : (L, α) \neq (1, 1) .

Using the likelihood ratio statistic,

Λ = \frac{ℓ_{L} (\hat{θ})}{ℓ_{S L P H} (\hat{θ})}

we obtain

- 2 \log (Λ) = 11.857,

The result is greater than the value of

χ_{2.95 %}^{2} = 5.99

. Thus, the SLPH model stands as a suitable alternative for fitting the dataset. Additionally, the SLPH model is compared to the LS model, and the Laplace model is compared to the LS model using hypothesis tests.

H_{01} : α = 1 versus H_{11} : α \neq 1, and H_{02} : L = 1 versus H_{12} : L \neq 1,

respectively, using the likelihood ratio statistics

Λ_{1} = \frac{ℓ_{L S} (θ)}{ℓ_{S L P H} (θ)} and Λ_{2} = \frac{ℓ_{L} (θ)}{ℓ_{L S} (θ)} .

After numerical evaluations, we obtain

- 2 \log (Λ_{1}) = 5.8406 and - 2 \log (Λ_{2}) = 6.0164,

This result is greater than the value of

χ_{1, 95 %}^{2} = 3.84

. The SLPH model exhibits the best fit among the other models.

Figure 1 - (a), (b), and (c) depict the CDF of the adjusted distributions and the cumulative hazard function derived from parameter estimates. These figures demonstrate the good fit of the segmented Laplace proportional hazard model (SLPH).

6. Concluding Remarks

This study addresses the growing need for advanced statistical models in survival analysis, particularly when dealing with data that may exhibit abrupt changes in the hazard rates during the observation period. Traditional models often fail to capture these complexities, especially when handling skewed distributions and heavy-tailed data. The segmented proportional hazard models we propose in this paper contribute to filling this gap by offering enhanced flexibility for accurately modeling such behaviors.

Our introduction of segmented proportional hazard models, including the Rayleigh, log-logistic, Lindley, and Laplace distributions, provides a new framework for capturing the nuances in survival data, particularly in medical research where sudden shifts in failure rates are common. By incorporating a censoring mechanism, our approach also addresses a critical aspect of real-world survival data that has previously been overlooked in segmented models. This makes our model well-suited for medical applications, such as the analysis of time-to-event data or survival times in clinical trials.

Furthermore, our development of the maximum likelihood estimation procedure for these models, coupled with its proven asymptotic properties, strengthens the theoretical foundation of segmented hazard models in survival analysis. The ability of our method to adapt to varying data patterns is supported by real data applications, such as the analysis of IQ scores, where the segmented Laplace hazard proportional model demonstrated superior performance. This finding underscores the model’s practical utility in medical research, particularly for analyzing datasets that exhibit non-normality, censoring, and abrupt changes in hazard rates.

Author Contributions

Conceptualization, G.M., R.B.A.F., and C.B.; data curation, G.M., R.B.A.F., and C.B.; formal analysis, G.M. and R.B.A.F.; funding acquisition, C.B.; investigation, G.M., R.B.A.F., and C.B.; methodology, G.M.; project administration, G.M. and C.B.; resources, G.M., R.B.A.F., and C.B.; supervision, G.M., R.B.A.F., and C.B.; visualization, C.B.; writing—original draft, G.M., R.B.A.F., and C.B.; writing—review and editing, C.B. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported in a collaborative and non-monetary way by the Universidad de Córdoba through the SFCB-07-23 project (Modelos de regresión segmentados con respuestas distribuidas Hazard proporcional como herramienta para el análisis estadístico de datos), and by the Institución Universitaria ITM, which contributed time and resources for the researchers.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

G.M. acknowledges the support provided by Universidad de Córdoba, Montería, Colombia. R.B.A.F. thanks the Universidade Federal do Ceará-Brazil. C.B. expresses his sincere gratitude to the Institución Universitaria ITM.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PHD	Proportional hazard distribution
HPSM	Proportional hazard segmented model
MN	Mixture of normals
HNS	Segmented half-normal
HNSHP	Segmented half-normal proportional hazard
LS	Segmented Laplace
SLPH	Segmented Laplace proportional hazard
SM	Segmented model of hazard function
SN	Skew normal

References

Akaike H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [CrossRef]
Agamez-Montalvo; G.S. Modelos de mistura finita usando a classe de distribuições alpha potência. Doctoral thesis, Universidade de Sao paulo, Brasil.
Bathiany, S.; Dijkstra, H.; Crucifix, M.; Dakos, V.; Brovkin, V.; Williamson, M.S.; Lenton, T.M.; Scheffer, M. Beyond bifurcation: using complex models to understand and predict abrupt climate change Dynamics and Statistics of the Climate System 2016, 1(1), 2059-6987.
Cavanaugh, J. E. Unifying the derivations for the Akaike and corrected Akaike information criteria. Statistics & Probability Letters 1997, 33(2), 201-208. [CrossRef]
Chen, X.; Baron, M. Change-point analysis of survival data with application in clinical trials. Open Journal of Statistics 2014, 4(09), 663. [CrossRef]
Chen, Wx.; Long, Cx.; Yang, R.; Dong-sen Y. Maximum Likelihood Estimator of the Location Parameter under Moving Extremes Ranked Set Sampling Design. Acta Math. Appl. Sin. Engl. 2021, 37, 101–108.
Coelho-Barros, E.A.; Achcar J.A.; Martinez, E.Z.; Davarzani, N.; Grabsch, H.I. Bayesian Inference for the Segmented Weibull Distribution. Colombian Journal of Statistics 2019, 42(2), 225–243. [CrossRef]
Conners, T.E. Segmented models for stress-strain diagrams. Wood Sci.Technol. 1989, 23, 65–73. [CrossRef]
Feigl, P.; Zelen, M. Estimation of exponential survival probabilities with concomitant information. Biometrics 1965, 21, 826-838. [CrossRef]
Friedman, M. Piecewise exponential models for survival data with covariates. Ann Statist 1982, 10, 101-113. [CrossRef]
Gómez, Y. M.; Gallardo, D. I.; Arnold, B. C. The power piecewise exponential model. Journal of Statistical Computation and Simulation 2018, 88(5), 825-840.
Gupta, R.C.; Brown, N. Reliability studies of skew normal distribution and its application to a strengthstress model. Commun Stat Theory Method 2001, 30(11), 2427-2445. [CrossRef]
Hannan, E.; Quinn, B. The determinationof the order of an autoregression. Journal of the Royal Statistical Society. Series B 1979, 41(2), 190-195.
Li, H.; Zuo, H.; Su, Y.; Xu, J; Yin, Y. Study on segmented distribution for reliability evaluation. Chin J Aeronaut 2016, 30(1), 310-329. [CrossRef]
Martínez-Flórez, G.; Moreno-Arenas, G.; Vergara-Cardoso, S. Properties and Inference for Proportional Hazard Models. Colombian Journal of Statistics 2013, 36, 95–114.
Mazucheli, J.; Coelho-Barros, E.A.; Achcar, J.A. Inferences for the change-point of the exponentiated Weibull hazard function. REVSTAT 2012, 10(3), 309-322.
Perevaryukha, A.Y. Modeling Abrupt Changes in Population Dynamics with Two Threshold States. Cybern Syst Anal 2016, 52, 623-630. [CrossRef]
Reich, B.J.; Smith, L.B. Bayesian quantile regression for censored data. Biometrics 2013, 69(3), 651-660. [CrossRef]
Roberts H.V. Data analysis for managers with minitab. Scientific Press, Redwood City. 1988.
R Core Team R: A language and environment for statistical computing. R Foundation for Statistical Computing. SVienna, Austria. 2022.
Samawi, H., Yu, L., & Yin, J. On Cox proportional hazards model performance under different sampling schemes. PLOS ONE, 2023, 18(4), e0278700. [CrossRef]
Zhang, S.; Zhang, H.; Li, J.; Li, J. AGCT: a hybrid model for identifying abrupt and gradual change in hydrological time series Environ Earth Sci 2019, 78(43). [CrossRef]

Figure 1. (a) Empirical CDF (solid line), SLPH (dashed line), LS (dotted line), and Laplace (dotted and dashed line). (b) Empirical CDF (solid line), SLPH (dashed line), Gamma (dotted line), and SN (dotted and dashed line). (c) Cumulative hazard function: empirical (solid line), SLPH (dashed line), LS (dotted line), and Laplace (dotted line and dashed strokes).

Table 1. AIC, CAIC and HQIC criteria for the fitted models.

Distribution	${\hat{l}}_{1}$	AIC	BIC	HQIC
Laplace		641.708	643.9977	643.6944
LS	108.0222	641.6922	644.7422	646.6569
SLPH	111.9693	637.8516	641.2693	643.8092
HNS	99.9687	682.7748	685.2626	685.7536
HNSHP	99.9687	683.4016	686.1423	687.3734
SN		644.612	647.0998	647.5908
Gamma		642.9468	645.2360	644.9327
Normal		646.3490	648.6382	648.3349
MN		646.4074	649.4574	651.3721

Table 2. Estimation of the parameters, along with their standard errors, for the Laplace, LS, SLPH, HNS, and HNSHP models.

Estimate	Laplace	LS	SLPH	HNS	HNSHP
${\hat{μ}}_{1}$	112.0004	110.6122	105.9556
	(0.0308)	(1.1075)	(0.4240)
${\hat{μ}}_{2}$		113.3408	111.9993
		(0.9016)	(0.1693)
${\hat{σ}}_{1}$	7.1859	5.6885	5.9306	1382.2189	272.5887
	(0.7706)	(1.3427)	(1.1603)	(598.4787)	(108.9069)
${\hat{σ}}_{2}$		7.3183	2.1697	6.2681	17.7197
		(1.0452)	(0.5466)	(2.5036)	(0.1300)
$\hat{α}$			0.3093		0.1867
			(0.0638)		(0.0214)

Table 3. Estimation of the parameters, along with their standard errors, for the SN, Gamma, Normal, and MN models.

Estimate	SN	Gamma	Normal	MN
${\hat{μ}}_{1}$	105.724		112.862	111.6556
	(3.119)		(1.028)	(2.1347)
${\hat{μ}}_{2}$				113.7674
				(2.5908)
${\hat{σ}}_{1}$	11.910	1.2396	9.589	35.9973
	(2.076)	(0.1880)	(0.2349)	(42.1964)
${\hat{σ}}_{2}$				130.2105
				(81.5536)
$\hat{α}$	1.143	139.9127		0.4278
	(0.684)	(21.1856)		(0.6189)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Flexible Hazard Modeling with Segmented Distributions

Abstract

Keywords:

Subject:

1. Introduction

2. Proportional Hazard Distribution

3. Segmented Proportional Hazard Distribution

3.1. Segmented Rayleigh Proportional Hazard Distribution

3.2. Segmented Log-Logistic Proportional Hazard Distribution

3.3. Segmented Lindley Proportional Hazard Distribution

3.4. Segmented Laplace Proportional Hazard Distribution

4. Estimation

4.1. Censoring

5. Illustration

6. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

MDPI Initiatives

Important Links

Subscribe