Asymmetric Kernel Density Estimation for Biased Data

Yoshihide Kakizawa

doi:10.20944/preprints202306.1944.v1

Submitted:

27 June 2023

Posted:

28 June 2023

You are already at the latest version

Abstract

Nonparametric density estimation for nonnegative data is considered in a situation where a random sample is not directly available but the data are instead observed from the length-biased sampling. Due to the boundary bias problem of the location-scale kernel, the approach in this paper is an application of asymmetric kernel. Two nonparametric density estimators are proposed. The mean integrated squared error, strong consistency, and asymptotic normality of the estimators are investigated. Some simulations illustrate the finite sample performance of the estimators.

Keywords:

Biased data

;

nonparametric density estimation

;

boundary bias

;

asymmetric kernel

Subject:

Computer Science and Mathematics - Probability and Statistics

1. Introduction

Nonparametric density estimation is a major issue of statistics and econometrics. Although the kernel density estimator (KDE) of location-scale type, originally proposed by Rosenblatt (1956) and Parzen (1962), is perhaps the most popular in the literature, its boundary bias problem is an important matter of concern when the support of the density to be estimated is not the whole real line

R

. Various remedies for avoiding the boundary bias problem have been discussed, on the basis of renormalization, reflection, and generalized jackknifing (Jones (1993)), transformation (Marron and Ruppert (1994)), advanced reflection (Zhang et al. (1999)), and so on. Recently, there has been a vast literature on this subject, especially, with the renewed interest in revisiting an asymmetric kernel method after Chen (1999, 2000).

For the standard statistical inference problem of nonnegative data, it is assumed that a random sample

X_{1}, \dots, X_{n}}

of a size n is drawn from a population with density

f (x)

,

x \geq 0

. Practically, one encounters many situations where the available data are observed only under a certain biased sampling scheme (see, e.g., Patil and Rao (1978)). Then, it may be reasonable to regard a functional of the biased distribution as the inferential target. But, some analyses need the estimation referring to the original distribution. Cox (1969) considered estimating the population mean

μ = \int_{0}^{\infty} t f (t) d t

(

> 0

) and cumulative distribution function

F (x) = \int_{0}^{x} f (t) d t

,

x > 0

. An overview of nonparametric functional estimation under the biased sampling scheme is found in Cristóbal and Alcalá (2001).

Nonparametric density estimation from the biased data started in the late 1980s. Two important papers often quoted are Bhattacharyya et al. (1988) and Jones (1991) (see also Richardson et al. (1991) and Guillamón et al. (1998)). Jones (1991) studied a kernel smoothing on the basis of Cox’s (1969) distribution estimator, without cares about the boundary bias problem of the classical Rosenblatt–Parzen location-scale KDE. The main contribution of this paper is to revisit the nonparametric density estimation under the biased sampling scheme, using asymmetric kernel method which enables us to avoid the boundary bias problem and then have desirable asymptotic properties. Our approach is different from Mnatsakanov and Ruymgaart (2003,2006) on moment-type density estimation motivated by the so-called moment problem, and Chaubey et al. (2010) using Hill’s lemma (Feller (1971; (1.5) of page 220)).

The rest of this paper is organized as follows. Section 2 describes the length-biased (LB) distribution and illustrates, in detail, the boundary behavior of a convolution integral. After a brief introduction of the asymmetric kernel method, two density estimators from the LB data are proposed, in parallel with those of Bhattacharyya et al. (1988) and Jones (1991). Section 3 and Section 4 state the required assumptions and main results of this paper. All proofs are given in the appendix. In Section 5, some simulations illustrate the finite sample performance of the density estimators.

As usual, we use the notation

{| | h | |}_{S} = {sup}_{x \in S} | h (x) |

for any bounded function h on

S

. We write, for

j \in N

,

h^{(j)} (x) = {(d / d x)}^{j} h (x)

(if it exists), and

h^{(0)} (x) = h (x)

. For an estimator

\hat{f} (x)

of

f (x)

, where

x \geq 0

, the mean squared error (MSE) is defined by

M S E [\hat{f} (x)] = E [{\hat{f} (x) - f (x)}^{2}] = B i a s^{2} [\hat{f} (x)] + V [\hat{f} (x)],

and the mean integrated squared error (MISE);

M I S E [\hat{f}] = \int_{0}^{\infty} M S E [\hat{f} (x)] d x

is a global measure of discrepancy of

\hat{f}

from f.

2. Preliminaries

2.1. LB Density

Nonparametrically, we wish to estimate the density f with nonnegative support, in a situation where a random sample

{X_{1}, \dots, X_{n}}

is not directly available but a sample

{Y_{1}, \dots, Y_{n}} = Y_{n}

(say) is instead observed from the LB distribution having the density

f_{LB} (x) = \frac{x f (x)}{μ}, x \geq 0 .

Throughout this paper, if there is no confusion, the

X_{i^{'}}

s are iid copies of the random variable X having the density f, whereas the

Y_{i}

’s are iid copies of the random variable Y having the LB density

f_{LB}

. We repeatedly use the fact that, for a measurable function G and a real number r,

E [Y^{r} G (Y)] = \int_{0}^{\infty} G (t) t^{r} f_{LB} (t) d t = \frac{1}{μ} \int_{0}^{\infty} G (t) t^{r + 1} f (t) d t = \frac{1}{μ} E [X^{r + 1} G (X)] (i f i t e x i s t s) .

2.2. Boundary Bias Problem and Asymmetric Kernel Method

When

supp (g) = [0, \infty)

, it is well known that a usual approximation (near the origin) of a certain smooth nonnegative function g as the convolution integral does not hold when

g (0) > 0

. We emphasize that the following fact is a starting point in the present paper: Even for the case

g (0) = 0

, the convergence rate near the origin

x = 0

is slower when

g^{'} (0) \neq 0

; a typical example is

g (x) = x e^{- x}

.

More precisely, if k is a symmetric density on

[- 1, 1]

(say) and

h = h_{n} > 0

is a bandwidth which tends to zero as

n \to \infty

(hereafter, we will omit the phrase “as the sample size n tends to the infinity” unless otherwise stated, and we denote by x the location where the density estimation is made), then, as shown in, e.g., Jones (1993),

\begin{matrix} \int_{0}^{\infty} \frac{1}{h} k (\frac{x - s}{h}) g (s) d s - g (x) \\ = & \int_{- 1}^{min (x / h, 1)} k (t) g (x - h t) d t - g (x) \\ = & \{\begin{matrix} \int_{- 1}^{1} k (t) g (x - h t) d t - g (x) \approx h^{2} \frac{g^{''} (x)}{2} \int_{- 1}^{1} t^{2} k (t) d t, \\ \int_{- 1}^{p} k (t) g (x - h t) d t - g (x) \approx - g (0) \int_{p}^{1} k (t) d t + h g^{'} (0) \int_{p}^{1} (- p + t) k (t) d t, \end{matrix} \end{matrix}

according to

x \geq h

or

x = h p

(

0 \leq p \leq 1

), because the kernel

k ((x - \cdot) / h) / h

creates a mass outside

[0, \infty)

when the location x is at or near the origin. This is the motivation that, instead of the location-scale kernel

k ((x - \cdot) / h) / h

, we focus on an application of an asymmetric kernel

k (\cdot; β, x)

, whose support matches the support of the target function, where

β = β_{n} > 0

is a smoothing parameter, with

β \to 0

. It should be remarked that, after Chen (2000), the notation

β

(rather than h) is common in the asymmetric kernel method; indeed, the parameter

β

for the asymmetric kernel under consideration has no meaning of the bandwidth h as in the classical Rosenblatt–Parzen location-scale kernel with a compact support, although

β

corresponds to

h^{2}

and controls the bias-variance trade-off. This is why we refer to

β

as a smoothing parameter (rather then a bandwidth).

We formally require that

\int_{0}^{\infty} k (s; β, x) g (s) d s = g (x) + O (β) f o r a n y x \geq 0,

if

supp (g) = [0, \infty)

. To the best of our knowledge, Silverman (1986; page 28) first mentioned a fairly simple idea of using gamma and log-normal (LN) kernels for the nonnegative data, where the kernel shape varies according to

(β, x)

. Perhaps, Chen’s (2000) gamma kernel, defined by1

k^{(G)} (s; β, x) = \frac{1}{β} {(\frac{s}{β})}^{x / β} \frac{e^{- s / β}}{Γ (x / β)}, s, x \geq 0,

plays a central role in this area; however, some simulations reveal that when the the target density at the origin is zero, the gamma kernel is disadvantage compared to the LN kernel (e.g., Igarashi (2016)). Subsequent authors have discussed various kernels like LN, inverse Gaussian (IG), reciprocal IG (RIG), Birnbaum–Saunders (BS), and inverse gamma. The analysts can now choose what they like, among many options available for the kernel with support

[0, \infty)

. According to Igarashi and Kakizawa (2020) (see also Kakizawa (2021)), let us choose

k (\cdot; β, x)

in the following form:

Definition 1

(Igarashi and Kakizawa (2020)). Given a baselined density

p (\cdot; \cdot)

, we set

k (s; β, x) = \frac{1}{β} p (\frac{s}{β}; \frac{x}{β}), s, x \geq 0,

where the functional form of p, with nonnegative support, is independent of

β

and x.

To make this formulation clear, let

f_{g}^{(q B S)} (s; θ_{1}, θ_{2}, θ_{3}) = C_{g} g ({\{\frac{a^{〈 q 〉} (s / θ_{2})}{θ_{1}} - θ_{3}\}}^{2}) \frac{A^{〈 q 〉} (s / θ_{2})}{θ_{1} θ_{2}}, s \geq 0,

be a symmetrical-based qBS density, associated with a density generator g, where

θ_{1}, θ_{2} > 0

,

θ_{3} \in R

,

a^{〈 q 〉} (t) = \{\begin{matrix} \frac{1}{2 q} (t^{q} - t^{- q}), & q \neq 0, \\ log t, & q = 0, \end{matrix} a n d A^{〈 q 〉} (t) = \frac{1}{2} (t^{q - 1} + t^{- (q + 1)})

(due to the fact

(t^{q} - t^{- q}) / q = (t^{| q |} - t^{- | q |}) / | q |

, it is enough to take

q \geq 0

). Note that the density

C_{g} g (u^{2})

on

R

is symmetric about the origin, where

1 / C_{g} = \int_{0}^{\infty} y^{- 1 / 2} g (y) d y

. Here, it is common to standardize g so that

\int_{- \infty}^{\infty} u^{2} C_{g} g (u^{2}) d u = 1

, without loss of generality. Indeed, as long as

\int_{- \infty}^{\infty} u^{2} C_{g} g (u^{2}) d u = J

for some constant

J = J_{g} > 0

, this standardization can always be imposed with the replacement of

g (y)

by

J^{1 / 2} g (J y)

; the nomalizing constant

C_{g}

is then invariant, i.e.,

\int_{- \infty}^{\infty} J^{1 / 2} g (J u^{2}) d u = \int_{- \infty}^{\infty} g (t^{2}) d t = 1 / C_{g}

.

Given constants

q \geq 0

and

c > 0

, a family of symmetrical-based non-central qBS kernels (Kakizawa (2021))2 is defined by

k_{g}^{(q B S)} (s; β, x) = f_{g}^{(q B S)} (s; \sqrt{1 / (x / β + c)}, β (x / β + c), θ \sqrt{1 / (x / β + c)}), s, x \geq 0,

where

θ \in R

. Kakizawa (2018) considered the central case

θ = 0

. Such a family

k_{g}^{(q B S)}

is flexible via the (infinite-dimensional) density generator g, as well as the parameter

q \geq 0

. In some numerical studies of Section 5, we will put

q = 1 / 2

(symmetrical-based BS kernel) or

q = 0

(log-symmetrical (LS) kernel), with

(θ, c) = (0, 1)

, for simplicity, and use the power exponential (PE) generator;

g_{PE [p]} (y) = exp (- λ_{p} y^{p})

,

p \geq 1 / 2

, where the particular choice

λ_{p} = {Γ (3 / (2 p)) / Γ (1 / (2 p))}^{p}

ensures that the PE density has the variance 1, like the standard normal density. Other generators (Kotz-type, generalized Pearson-type VII, and generalized logistic-type III) are found in Kakizawa (2018,2021). A symmetrical-based central qBS kernel belongs to a family of symmetrical-based qMIG kernels (MIG is an abbreviation of a mixture of IG and RIG), which is enlarged, linking to a class of skew-BS type kernels. See Kakizawa (2018,2021).

Remark 1.

Chaubey and Li (2013) applied Scaillet’s (2004) RIG kernel. As pointed out by Igarashi and Kakizawa (2014), an RIG KDE, however, also suffered from the boundary bias problem, so that the re-formulated RIG kernel should be applied.

2.3. Two density estimators under LB sampling scheme

Using

E [Y^{- 1}] = \int_{0}^{\infty} t^{- 1} f_{LB} (t) d t = 1 / μ

, as well as the relation

f (x) = \frac{μ}{x} f_{LB} (x) = \frac{x^{- 1} f_{LB} (x)}{E [Y^{- 1}]}, x > 0,

our first estimator is defined in parallel with that of Bhattacharyya et al. (1988), as follows:

{\tilde{f}}_{β, ϵ} (x) = \frac{{(n x)}^{- 1} \sum_{i = 1}^{n} k (Y_{i}; β, x)}{n^{- 1} \sum_{i = 1}^{n} {(Y_{i} + ϵ)}^{- 1} + ϵ}, x > 0 .

Except for a technical issue that needs to take a small

ϵ \propto n^{- 1 / 2}

, this estimator may be natural in the sense that

f_{LB} (x)

,

x > 0

, is consistently estimable by the asymmetric KDE

n^{- 1} \sum_{i = 1}^{n} k (Y_{i}; β, x)

(Bhattacharyya et al. (1988) used the classical Rosenblatt–Parzen KDE

n^{- 1} \sum_{i = 1}^{n} (1 / h) k ((x - Y_{i}) / h)

). On the other hand, Jones’s (1991) idea;

\frac{E [Y^{- 1} (1 / h) k ((x - Y) / h)]}{E [Y^{- 1}]} = \int_{0}^{\infty} \frac{1}{h} k (\frac{x - s}{h}) f (s) d s

(see Cox (1969)) is also reasonable. However, in order to solve its boundary bias problem unless

f (0) = f^{'} (0) = 0

(see Introduction), our second estimator

{\hat{f}}_{β, ϵ} (x) = \frac{n^{- 1} \sum_{i = 1}^{n} Y_{i}^{- 1} k (Y_{i}; β, x)}{n^{- 1} \sum_{i = 1}^{n} {(Y_{i} + ϵ)}^{- 1} + ϵ}, x \geq 0

is proposed, for which some asymptotic properties will be studied in Subsection 4.1.

Before we proceed with description of our required assumptions (Section 3) and specific asymptotic results (Section 4), we highlight a novelty, compared with Jones (1991). Suppose that

f (0) = 0

but

f^{'} (0) \neq 0

(we have an example

f (x) = x e^{- x}

,

x \geq 0

). Then,

Jones’s (1991) estimator ${\hat{f}}_{h, Jones}$ , based on the location-scale kernel with the support $[- 1, 1]$ (say), suffers from the boundary bias problem, i.e., $B i a s [{\hat{f}}_{h, Jones} (x)] = O (h^{2})$ for $x \geq h$ and $B i a s [{\hat{f}}_{h, Jones} (x)] = O (h)$ for $0 \leq x \leq h$ , and, as a result, it is shown that

$M I S E [{\hat{f}}_{h, Jones}] = O (h^{3} + {(n h)}^{- 1}) (= O (n^{- 3 / 4}));$
our estimator ${\hat{f}}_{β, ϵ}$ achieves the convergence rate $n^{- 4 / 5}$ of the MISE (see Theorem 4).

We notice that

{\hat{f}}_{β, ϵ}

is more preferable, since

{\tilde{f}}_{β, ϵ}

has the factor

x^{- 1}

(it is numerically unstable near the origin); besides, a rigorous error analysis for the (unweighted) MISE of

{\tilde{f}}_{β, ϵ}

seems to be hard technically (or it might be impossible), although the pointwise MSE (for

x > 0

) and weighted MISE of

{\tilde{f}}_{β, ϵ}

are tractable (see Subsection 4.2).

3. Assumptions

We use the notation

f_{- j} (\cdot) = f (\cdot) / {(\cdot)}^{j}

. In order to prove asymptotic properties of

{\hat{f}}_{β, ϵ}

(Subsection 4.1), the following set of assumptions, labeled as F, is imposed for the density f to be estimated:

F.: (i) 1. f is a twice continuously differentiable function on $[0, \infty)$ , where $f, f^{'}$ , and $f^{''}$ are bounded;

2. $f^{''}$ is a Hölder-continuous function (with exponent $0 < η \leq 1$ ) on $[0, \infty)$ , i.e., there exists a constant $L > 0$ , such that $| f^{''} (u) - f^{''} {(v) | \leq L | u - v |}^{η}$ for any $u, v \geq 0$ .

(ii) 1. $f_{- 1}$ is a bounded function on $[0, \infty)$ ;

2. $f_{- 1}$ is a Hölder-continuous function (with exponent $0 < η^{'} \leq 1$ ) on $[0, \infty)$ .

(iii) the inverse moment of X; $E [X^{- 1}] = \int_{0}^{\infty} t^{- 1} f (t) d t = \int_{0}^{\infty} f_{- 1} (t) d t = μ_{- 1}^{'}$ (say) exists (note that $E [Y^{- 2}] = μ^{- 1} E [X^{- 1}]$ ).

(iii $^{'}$ ) $E [X^{- (1 + q)}] = \int_{0}^{\infty} f_{- (1 + q)} (t) d t = μ_{- (1 + q)}^{'}$ (say) exists for some constant $q > 0$ .

(iv) $\int_{0}^{\infty} {f^{'} (t)}^{2} d t, \int_{0}^{\infty} {t f^{''} (t)}^{2} d t$ , and $\int_{0}^{\infty} f_{- 3 / 2} (t) d t$ exist.

Remark 2.

Under the boundedness of

f_{- 1}

, given in F(ii.1), the density f to be estimated must have a constraint

f (0) = 0

(note that F(iii

^{'}

) for some

q > 1

implies

f_{- 1} (0) = 0

). However, as illustrated earlier, even in the case

f (0) = 0

, Jones’s (1991) estimator suffers from the boundary bias problem when

f^{'} (0) \neq 0

(we have an example

f (x) = x e^{- x}

,

x \geq 0

).

On the other hand, for

{\tilde{f}}_{β, ϵ}

(Subsection 4.2), we additionally make some assumptions on the corresponding LB density, labeled as F

^{†}

:

F $^{†}$ .: 1. In addition to F(i.1), $f_{LB}, f_{LB}^{'}$ , and $f_{LB}^{''}$ are bounded functions3 on $[0, \infty)$ ;

2. $f_{LB}^{''}$ is a Hölder-continuous function (with exponent $0 < η^{''} \leq 1$ ) on $[0, \infty)$ .

Lastly, high-level conditions on the kernel

k (\cdot; β, x)

, labeled as A, are needed, whose details will be given in the top of Appendix. For notational simplicity, we write

B_{g} (x) = ζ_{1, 1} g^{'} (x) + \frac{ζ_{2, 1}}{2} x g^{''} (x), V_{g} (x) = ζ \frac{g (x)}{x^{1 / 2}},

where the constants

ζ_{1, 1}, ζ_{2, 1}

, and

ζ

(

ζ_{2, 1}, ζ > 0

) appear in Assumption A1.2–3 of Appendix. It is convenient for us to define

J_{β}^{〈 k, g 〉} (x) = \int_{0}^{\infty} k (s; β, x) g (s) d s

,

B_{β}^{〈 k, g 〉} (x) = J_{β}^{〈 k, g 〉} (x) - g (x)

, and

J_{β}^{〈 k^{2}, g 〉} (x) = \int_{0}^{\infty} k^{2} (s; β, x) g (s) d s

. Obviously, the following inequalities hold:

J_{β}^{〈 k, g 〉} (x) \leq {| | g | |}_{[0, \infty)}, | B_{β}^{〈 k, g 〉} {(x) | \leq 2 | | g | |}_{[0, \infty)}, J_{β}^{〈 k^{2}, g 〉} (x) \leq \{sup_{s \geq 0} k (s; β, x)\} J_{β}^{〈 k, g 〉} (x) .

It should be remarked that most of the items in Assumption F (or F

^{†}

) are needed to approximate

J_{β}^{〈 \cdot, \cdot 〉} (x)

, whose error analyse is found in Appendix. Roughly speaking, we obtain

J_{β}^{〈 k, f 〉} (x) \approx f (x) + β B_{f} (x)

under F(i) and

μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) \approx β^{- 1 / 2} μ V_{f_{- 1}} (x)

under F(ii), for the estimator

{\hat{f}}_{β, ϵ}

. Also, for the estimator

{\tilde{f}}_{β, ϵ}

, it is shown that, under F

^{†}

,

\begin{matrix} \frac{μ}{x} J_{β}^{〈 k, f_{LB} 〉} (x) & \approx & \frac{μ}{x} [f_{LB} (x) + β B_{f_{LB}} (x)] = f (x) + β B (x), \\ {(\frac{μ}{x})}^{2} J_{β}^{〈 k^{2}, f_{LB} 〉} (x) & \approx & {(\frac{μ}{x})}^{2} \frac{V_{f_{LB}} (x)}{β^{1 / 2}} = \frac{μ V_{f_{- 1}} (x)}{β^{1 / 2}}, \end{matrix}

where

B (x) = (μ / x) B_{f_{LB}} (x)

. Note that

B (x) = ζ_{1, 1} {x^{- 1} f (x) + f^{'} (x)} + ζ_{2, 1} \{f^{'} (x) + \frac{x}{2} f^{''} (x)\} = B_{f} (x) + ζ_{1, 1} f_{- 1} (x) + ζ_{2, 1} f^{'} (x) .

We can see that, under F(i.1 and iv),

\int_{0}^{\infty} B_{f}^{2} (x) d x = I^{B_{f}^{2}}

(say) and

\int_{0}^{\infty} V_{f_{- 1}} (x) d x = I^{V_{f_{- 1}}}

(say) are well-defined; besides4,

\int_{0}^{\infty} B^{2} (x) d x = I^{B^{2}}

(say) is well-defined (we assume F(ii.1)).

4. Main Results

We assume that

B( $ι$ ).: $β = n^{- ι} ℓ (n)$ , where ℓ is a (positive) slowly varying function.

Note that all powers of

log y

and a function

L (y)

approaching a positive limit vary slowly. For the achievement of the optimal rate of the M(I)SE,

β = C n^{- 2 / 5}

must be feasible, at least, where

C > 0

is a constant, independent of n.

In what follows, let

ϵ = C^{'} n^{- 1 / 2}

, where

C^{'} > 0

is a constant, independent of n. We write

\begin{matrix} ω_{β, η} (x) & = & β^{3 / 2} x^{- 1 / 2} + β^{2} + {(β x)}^{1 + η / 2}, \\ ω_{β, η} (x; g) & = & β^{- 1 / 2} V_{g} (x) {(β x^{- 1})}^{1 / 2} + χ_{{0 < η < 1}} {(β x)}^{(η - 1) / 2}, \end{matrix}

where

χ_{S}

is the indicator function of the set S.

4.1. Asymptotic Properties of ${\hat{f}}_{β, ϵ}$

Theorem 1.

Suppose that AssumptionsA1, A2.1(

ν = 0, 1

); see Appendix, and F(i–iii)hold. UnderB(

0 < ι < 1)

, given constants

c_{L} > 0

and

0 < τ < 1

, we have

\begin{matrix} 1 . sup_{0 \leq x \leq c_{L} β^{τ}} | B i a s [{\hat{f}}_{β, ϵ} (x)] | = O (β^{τ} + n^{- 1 / 2}), \\ sup_{0 \leq x \leq c_{L} β^{τ}} V [{\hat{f}}_{β, ϵ} (x)] = O (n^{- 1} β^{- 1} + n^{- 1 / 2} β^{2 τ}); \\ 2 . f o r x \geq c_{L} β^{τ}, \\ B i a s [{\hat{f}}_{β, ϵ} (x)] = β B_{f} (x) + R_{β}^{Bias} (x), V [{\hat{f}}_{β, ϵ} (x)] = n^{- 1} β^{- 1 / 2} μ V_{f_{- 1}} (x) + R_{β}^{V} (x), \end{matrix}

with

\begin{matrix} | R_{β}^{Bias} (x) | & \leq & M [ω_{β, η} (x) + n^{- 1 / 2} + n^{- 1 / 2} β | B_{f} (x) |], \\ | R_{β}^{V} (x) | & \leq & M^{'} [n^{- 1} {ω_{β, η^{'}} (x; f_{- 1}) + 1 + V_{f_{- 1}} (x)} + n^{- 1 / 2} {β^{2} B_{f}^{2} (x) + ω_{β, η}^{2} (x)}], \end{matrix}

where

M, M^{'} > 0

are constants, independent of

n, β

, and x. Also, we have

\begin{matrix} B i a s [{\hat{f}}_{β, ϵ} (0)] & = & β f^{'} (0) \int_{0}^{\infty} u p (u; 0) d u + O (n^{- 1 / 2} + β^{2}), \\ V [{\hat{f}}_{β, ϵ} (0)] & = & n^{- 1} β^{- 1} μ f_{- 1} (0) \int_{0}^{\infty} p^{2} (u; 0) d u \\ + O (n^{- 1} β^{η^{'} - 1} + n^{- 1} + n^{- 3 / 2} β^{- 1} + n^{- 1 / 2} β^{2}) . \end{matrix}

Remark 3.

(i) The asymptotic bias and variance of

{\hat{f}}_{β, ϵ} (x)

when x is near the origin;

x / β \to κ

, where

κ \geq 0

is finite, can be obtained, as in Kakizawa (2018,2021). The details are omitted.

(ii) The pointwise MSE of

{\hat{f}}_{β, ϵ}

is a corollary of Theorem 1(2), as follows: For fixed

x > 0

,

M S E [{\hat{f}}_{β, ϵ} (x)] = A M S E_{x} [β] + o (β^{2} + n^{- 1} β^{- 1 / 2}),

where

\begin{matrix} A M S E_{x} [β] & = & β^{2} B_{f}^{2} (x) + n^{- 1} β^{- 1 / 2} μ V_{f_{- 1}} (x) \\ \geq & \frac{5}{4^{4 / 5}} {B_{f}^{2} (x)}^{1 / 5} {μ V_{f_{- 1}} (x)}^{4 / 5} n^{- 4 / 5} i f B_{f} (x) \neq 0 \end{matrix}

(the equality holds iff

β = {[{μ V_{f_{- 1}} (x)} / {4 B_{f}^{2} (x)}]}^{2 / 5} n^{- 2 / 5}

).

(iii) We have

M S E [{\hat{f}}_{β, ϵ} (0)] = O (β^{2} + n^{- 1} β^{- 1})

(

= O (n^{- 2 / 3})

if

f^{'} (0) f_{- 1} (0) \neq 0

).

Theorem 2.

Suppose that AssumptionsA1, A2.1(

ν = 0, 1

); see Appendix, andF(i–iii)hold. UnderB(

0 < ι < 1 / 2)

, we have

{\hat{f}}_{β, ϵ} (x) \overset{a . s .}{⟶} f (x)

for fixed

x > 0

(note that

{\hat{f}}_{β, ϵ} (0) \overset{a . s .}{⟶} f (0) = 0

).

Theorem 3.

Suppose that AssumptionsA1, A2.2(

ν = 0, 1

); see Appendix, andF(i–iii

^{'}

)hold.

(i)UnderB(

0 < ι < min {2 q / (2 + q), 1}

), we have

{(n β^{1 / 2})}^{1 / 2} {{\hat{f}}_{β, ϵ} (x) - E [{\hat{f}}_{β, ϵ} (x)]} \overset{d}{⟶} N (0, μ V_{f_{- 1}} (x)) f o r f i x e d x > 0 .

(ii)UnderB(

0 < ι < q / (2 + q)

), we have

{(n β)}^{1 / 2} {{\hat{f}}_{β, ϵ} (0) - E [{\hat{f}}_{β, ϵ} (0)]} \overset{d}{⟶} N (0, μ f_{- 1} (0) \int_{0}^{\infty} p^{2} (u; 0) d u) .

For fixed

x > 0

, the replacement of

E [{\hat{f}}_{β, ϵ} (x)]

by

f (x)

is a routine by combining Theorem 3(i) with a bias of Theorem 1(2): If Assumption F(iii

^{'}

) holds for some

q > 1 / 2

, and if

2 / 5 \leq ι < min {2 q / (2 + q), 1}

(for the extreme case

ι = 2 / 5

, assume

n β^{5 / 2} \to 0

), then,

{(n β^{1 / 2})}^{1 / 2} {{\hat{f}}_{β, ϵ} (x) - f (x)} \overset{d}{⟶} N (0, μ V_{f_{- 1}} (x)) f o r f i x e d x > 0 .

Theorem 4.

Suppose that Assumptions A1, A2.1(

ν = 0, 1

), A3(

H = 6 / η + 1 + δ_{0})

; see Appendix, and F hold, where

\int_{0}^{\infty} t^{2 (3 / η + 1) + δ_{0}} f (t) d t

exists for some constant

δ_{0} > 0

. UnderB(

0 < ι < 1)

, we have

M I S E [{\hat{f}}_{β, ϵ}] = A M I S E [β] + o (β^{2} + n^{- 1} β^{- 1 / 2}),

where

\begin{matrix} A M I S E [β] & = & β^{2} I^{B_{f}^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}} \\ \geq & \frac{5}{4^{4 / 5}} {(I^{B_{f}^{2}})}^{1 / 5} {(μ I^{V_{f_{- 1}}})}^{4 / 5} n^{- 4 / 5} i f B_{f} (x) ≢ 0 \end{matrix}

(the equality holds iff

β = {(μ I^{V_{f_{- 1}}}) / (4 I^{B_{f}^{2}})}^{2 / 5} n^{- 2 / 5} = β_{opt}

(say)).

4.2. Asymptotic Properties of ${\tilde{f}}_{β, ϵ}$

Due to the presence of the factor

x^{- 1}

in the estimator

{\tilde{f}}_{β, ϵ}

, the case

x = 0

is excluded throughout this subsection; besides, in Theorem 8 (see also Remark 5), we will consider a truncated MISE of

{\tilde{f}}_{β, ϵ}

, for the global performance.

Theorem 5.

Suppose that AssumptionsA1, A2.1(

ν = 0

); see Appendix,F(ii.1 and iii), andF

^{†}

hold. UnderB(

0 < ι < 1)

, given constants

c_{L} > 0

and

0 < τ < 1

, we have, for

x \geq c_{L} β^{τ}

,

B i a s [{\tilde{f}}_{β, ϵ} (x)] = β B (x) + R_{β}^{† Bias} (x), V [{\tilde{f}}_{β, ϵ} (x)] = n^{- 1} β^{- 1 / 2} μ V_{f_{- 1}} (x) + R_{β}^{† V} (x),

with

\begin{matrix} | R_{β}^{† Bias} (x) | & \leq & M^{†} [x^{- 1} ω_{β, η^{''}} (x) + n^{- 1 / 2} (1 + x^{- 1}) + n^{- 1 / 2} β | B (x) |], \\ | R_{β}^{† V} (x) | & \leq & M^{†'} [n^{- 1} {ω_{β, η^{'}} (x; f_{- 1}) + 1 + x^{- 2} + V_{f_{- 1}} (x)} \\ + n^{- 1 / 2} {β^{2} B^{2} (x) + x^{- 2} ω_{β, η^{''}}^{2} (x)}], \end{matrix}

where

M^{†}, M^{†'} > 0

are constants, independent of

n, β

, and x.

Remark 4.

As a corollary of Theorem 5, we have, for fixed

x > 0

,

M S E [{\tilde{f}}_{β, ϵ} (x)] = A M S E_{x}^{†} [β] + o (β^{2} + n^{- 1} β^{- 1 / 2}),

where

\begin{matrix} A M S E_{x}^{†} [β] & = & β^{2} B^{2} (x) + n^{- 1} β^{- 1 / 2} μ V_{f_{- 1}} (x) \\ \geq & \frac{5}{4^{4 / 5}} {B^{2} (x)}^{1 / 5} {μ V_{f_{- 1}} (x)}^{4 / 5} n^{- 4 / 5} i f B (x) \neq 0 \end{matrix}

(the equality holds iff

β = {[{μ V_{f_{- 1}} (x)} / {4 B^{2} (x)}]}^{2 / 5} n^{- 2 / 5}

).

Theorem 6.

Suppose that AssumptionsA1, A2.1(

ν = 0

); see Appendix,F(ii.1 and iii), andF

^{†}

hold. UnderB(

0 < ι < 1)

, we have

{\tilde{f}}_{β, ϵ} (x) \overset{a . s .}{⟶} f (x)

for fixed

x > 0

.

Theorem 7.

Suppose that AssumptionsA1, A2.2(

ν = 0

); see Appendix,F(ii.1 and iii), andF

^{†}

hold. UnderB(

0 < ι < 1

), we have

{(n β^{1 / 2})}^{1 / 2} {{\tilde{f}}_{β, ϵ} (x) - E [{\tilde{f}}_{β, ϵ} (x)]} \overset{d}{⟶} N (0, μ V_{f_{- 1}} (x)) f o r f i x e d x > 0 .

For fixed

x > 0

, the replacement of

E [{\tilde{f}}_{β, ϵ} (x)]

by

f (x)

is a routine by combining Theorem 7 with a bias of Theorem 6: If

2 / 5 \leq ι < 1

(for the extreme case

ι = 2 / 5

, assume

n β^{5 / 2} \to 0

), then,

{(n β^{1 / 2})}^{1 / 2} {{\tilde{f}}_{β, ϵ} (x) - f (x)} \overset{d}{⟶} N (0, μ V_{f_{- 1}} (x)) f o r f i x e d x > 0 .

Theorem 8.

Suppose that AssumptionsA1, A2.1(

ν = 0

), A3(

H = 2 / η^{''} + 1 + δ_{0})

; see Appendix,F(ii.1, iii, and iv), andF

^{†}

hold, where

\int_{0}^{\infty} t^{2 (1 / η^{''} + 1) + δ_{0}} f_{LB} (t) d t

exists for some constant

δ_{0} > 0

. UnderB(

0 < ι < 1)

, we have, for every

0 < τ < 1 / 2

,

\int_{β^{τ}}^{\infty} M S E [{\tilde{f}}_{β, ϵ} (x)] d x = A M I S E^{†} [β] + o (β^{2} + n^{- 1} β^{- 1 / 2}),

where

\begin{matrix} A M I S E^{†} [β] & = & β^{2} I^{B^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}} \\ \geq & \frac{5}{4^{4 / 5}} {(I^{B^{2}})}^{1 / 5} {(μ I^{V_{f_{- 1}}})}^{4 / 5} n^{- 4 / 5} i f B (x) \neg \equiv 0 \end{matrix}

(the equality holds iff

β = {(μ I^{V_{f_{- 1}}}) / (4 I^{B^{2}})}^{2 / 5} n^{- 2 / 5}

).

Remark 5.

Whether or not there exists a

0 < τ < 1 / 2

, such that

\int_{0}^{β^{τ}} M S E [{\tilde{f}}_{β, ϵ} (x)] d x

is order

o (β^{2} + n^{- 1} β^{- 1 / 2})

(under some additional unnecessary stronger conditions) would be rather technical. We do not pursue the issue any more.

5. Simulation Studies

To demonstrate the finite sample performance of the proposed density estimator

{\hat{f}}_{β, ϵ}

, we generated 1000 random samples of size

n = 200, 300, 500

from the LB density

f_{LB} (x) = x^{2} e^{- x} / 2

, and computed the PE

[p]

-based BS/log KDEs (

p = 1, 3 / 2

) and gamma KDE for the original density

f (x) = x e^{- x}

. In the simulation, we used the least squared cross-validated (LSCV) smoothing parameter, for each sample. Then, the average integrated squared errors (ISEs),

(1 / 1000) \sum_{ℓ = 1}^{1000} \int_{0}^{\infty} {{\hat{f}}_{β, ϵ, [ℓ]} (x) - f (x)}^{2} d x

, were reported in Table 1, where

{\hat{f}}_{β, ϵ, [ℓ]}

is computed from the ℓth sample.

As expected, all average ISEs decreased as the sample size n increased, which is in agreement with the MISE result. The BS/LN KDEs (

p = 1

) were, overall, improved by the estimators with

p = 3 / 2

; such a tendency can be illustrated via the AMISE relative efficiency index

\frac{{AMISE}_{opt} (p)}{{AMISE}_{opt} (1)} = {\{2^{1 - 1 / (2 p)} p \sqrt{π} \frac{Γ^{1 / 2} (3 / (2 p))}{Γ^{3 / 2} (1 / (2 p))}\}}^{4 / 5}

(see Kakizawa (2018,2021)), since

{AMISE}_{opt} (p) = \frac{5}{4} {(I^{B_{f}^{2}})}^{1 / 5} {[(\frac{C_{g_{PE [p]}}^{2}}{C_{g_{PE [p]}^{2}}}) μ \int_{0}^{\infty} f_{- 3 / 2} (x) d x]}^{4 / 5} n^{- 4 / 5} .

Needless to say, the best implemented smoothing parameter

β_{opt}

, given in Theorem 4, depends on the unknown f, so that the data-driven procedure is crucial. We tried to conduct the LSCV smoothing parameter selection5. Unlike the direct sample (Kakizawa (2018,2021)), the present LB setting, for small sample size

n = 100

(not being reported here), produced multiple local minima for the LSCV score (in many cases, it was rather unstable numerically), whereas such an undesirable behavior seemed to be fixed when

n = 300

. A further issue of considering a plug-in selection with a pilot estimator is left in future.

6. Discussion

Our asymptotic results under the LB sampling can be extended to more general biased sampling, i.e., a weighted distribution for a known (positive) weight function

ω

, given by

f_{ω} (x) \propto ω (x) f (x)

. The LB density is a special case of

ω (x) = x

, and another example is

ω (x) = x^{2}

(the area-biased density). Also, the d-variate weighted density is defined by

f_{ω} (x) \propto ω (x) f (x)

,

x = {(x_{1}, \dots, x_{d})}^{'}

. Ahmad (1995) extended Jones’s (1991) estimator to the d-variate case. Note that the product-kernel-method, using the product asymmetric kernel

\prod_{j = 1}^{d} k (X_{i j}; β_{j}, x_{j})

,

x \in {[0, \infty)}^{d}

, instead of

\prod_{j = 1}^{d} k ((x_{j} - X_{i j}) / h_{j}) / h_{j}

, or the non-product kernel (Igarashi (2018) and Kakizawa (2022)), can be straightforwardly applied to solve the boundary bias problem of Ahmad’s (1995) estimator.

Funding

The author has been supported in part by the Japan Society for the Promotion of Science; Grant-in-Aid for Scientific Research (C), 20K11700 and 23K11002.

Data Availability Statement

Not available.

Acknowledgments

Some preliminary results were first announced, without face-to-face talking (due to the pandemic of covid-19), at Japanese Joint Statistical Meeting 2021 (Japanese federation of statistical science associations), autumn meeting 2021 (Mathematical Society of Japan), and the 5th International Conference on Econometrics and Statistics (EcoStat2022).

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

KDE	kernel density estimator
LB	length-biased
MSE	mean squared error
MISE	mean integrated squared error
LN	log-normal
IG	inverse Gaussian
RIG	reciprocal inverse Gaussian
BS	Birnbaum–Saunders
LS	log-symmetrical
PE	power exponential
MIG	mixture of IG and RIG
LSCV	least squared cross-validated
ISE	integrated squared error
ROT	rule of thumb

Appendix A

Appendix A.1. Technical Conditions on k(·;β,x)

There are three indispensable requirements on the kernel

k (s; β, x)

,

s, x \geq 0

:

(I) approximations of

μ_{j} (k (\cdot; β, x)) = \int_{0}^{\infty} {(s - x)}^{j} k (s; β, x) d s

and

\int_{0}^{\infty} k^{2} (s; β, x) d s

;

(II) uniform/nonuniform bounds of

{sup}_{s \geq 0} k (s; β, x)

;

(III) behavior of the tail integral of

k (s; β, x)

with respect to x (the regularity as

x \to \infty

will be required only for dealing with an asymptotic expansion of the MISE rigorously).

More precisely, we assume (e.g., Igarashi and Kakizawa (2020) and Kakizawa (2021)):

A1.

In addition to Definition 1, there exists a density

p (\cdot; \cdot)

, such that

1.: $\int_{0}^{\infty} u^{2} p (u; 0) d u$ exists and, for any $y \geq 0$ , $\int_{0}^{\infty} u p (u; y) d u$ ( $\leq \tilde{C} (1 + y)$ ) exists, where $\tilde{C} > 0$ is a constant, independent of y;
2.: given constants $0 \leq \tilde{η} < 1$ and $c_{L} > 0$ , for all sufficiently small $β > 0$ , $x \geq c_{L} β^{\tilde{η}}$ implies that

$μ_{j} (k (\cdot; β, x)) = \{\begin{matrix} β ζ_{1, 1} + r_{1, β} (x), & j = 1, \\ β ζ_{2, 1} x + r_{2, β} (x), & j = 2, \\ r_{4, β} (x), & j = 4, \end{matrix}$

with $| r_{1, β} (x) | \leq {\tilde{M}}_{1} β^{3 / 2} / x^{1 / 2}$ , $| r_{2, β} (x) | \leq {\tilde{M}}_{2} β^{2}$ , $0 < r_{4, β} (x) \leq {\tilde{M}}_{4} β^{2} {(x + β)}^{2}$ , where $ζ_{1, 1}, ζ_{2, 1}$ ( $ζ_{2, 1} > 0$ ) and ${\tilde{M}}_{1}, {\tilde{M}}_{2}, {\tilde{M}}_{4} > 0$ are constants, independent of $β$ and x;
3.: $\int_{0}^{\infty} p^{2} (u; 0) d u$ exists, and, given constants $0 < \tilde{η} < 1$ and $c_{L} > 0$ , for all sufficiently small $β > 0$ , $x \geq c_{L} β^{\tilde{η}}$ implies that

$|\int_{0}^{\infty} k^{2} (s; β, x) d s - \frac{ζ}{{(β x)}^{1 / 2}}| \leq \frac{\tilde{M}}{{(β x)}^{1 / 2}} {(\frac{β}{x + β})}^{1 / 2},$

where $ζ, \tilde{M} > 0$ are constants, independent of $β$ and x.

A2( $ν$ ).

u_{β, ν} (x) = {sup}_{s \geq 0} {{(β / s)}^{ν} k (s; β, x)}

satisfies:

1.: ${sup}_{x \geq 0} u_{β, ν} (x) \leq L_{K, ν} β^{- 1}$ , where $L_{K, ν} > 0$ is a constant, independent of $β$ ;
2.: for $x > 0$ , $u_{β, ν} (x) \leq L_{K, ν}^{'} {(β x)}^{- 1 / 2}$ , where $L_{K, ν}^{'} > 0$ is a constant, independent of $β$ and x.

A3(H).

Given a constant

τ > 0

, and for all sufficiently small

β > 0

,

\int_{0}^{\infty} ((\int_{β^{- τ}}^{\infty} k (s; β, x) g (s) d x) d s) = O (β^{τ (H + 1)})

(assume that

\int_{0}^{\infty} s^{H + 1} g (s) d s

exists).

Most of the existing asymmetric kernels satisfy Assumptions A1, A2

(0)

, and A3(H); see Igarashi and Kakizawa (2020) and Kakizawa (2021). For instance, the constants

ζ_{1, 1}, ζ_{2, 1}

, and

ζ

(given in A1.2–3), associated with

k_{g}^{(q B S)} (\cdot; β, x)

, are given by

ζ_{1, 1} = c + θ + J_{g} / 2

,

ζ_{2, 2} = J_{g}

, and

ζ = C_{g}^{2} / C_{g^{2}}

, independent of

q \geq 0

, where

J_{g} = \int_{- \infty}^{\infty} u^{2} C_{g} g (u^{2}) d u

. Of course, we need to impose a set of requirements on the density generator g, under which A1, A2(0) and A3(H) hold for the asymmetric kernel

k_{g}^{(q B S)} (\cdot; β, x)

; see Kakizawa (2018,2021). For simplicity, we assume that there exist constants

M_{g}, B > 0

such that

g (y) \leq M_{g} e^{- B y}

for every

y \geq 0

. It remains to discuss A2

(ν > 0)

. Note that A2(

ν = 0, 1

) is technically required to prove Theorems 1–4 (i.e., (A8) and (A11) under A2(

ν = 1

)); indeed, (A11) is crucial for Lemma A3. On the other hand, A2(

ν = 0

) is enough for the proofs of Theorems 5–8.

Property A1.

The kernel

k_{g}^{(q B S)} (\cdot; β, x)

satisfies Assumption A2(ν) for any

ν \geq 0

, with

u_{β, ν}^{(q B S)} (x) \leq \{\begin{matrix} \frac{C_{g} exp \{\frac{ν + 1}{c} max (- θ, 0)\}}{c^{ν} {β (x + β c)}^{1 / 2}} sup_{u \in R} \{exp (\frac{ν + 1}{c^{1 / 2}} | u |) g (u^{2})\}, & q = 0, \\ \frac{C_{g} {\{\frac{8 q^{2}}{c} (1 + \frac{θ^{2}}{c}) + 2\}}^{(q + ν + 1) / (2 q)}}{c^{ν} {β (x + β c)}^{1 / 2}} M_{g, ν} (q), & q > 0, \end{matrix}

where

M_{g, ν} (q) = {sup}_{y \geq 0} [{(y + 1)}^{(q + ν + 1) / (2 q)} g (y)]

.

Proof.

Recall that, with

α (y) = 1 / {(y + c)}^{1 / 2}

(note that

{sup}_{y \geq 0} α (y) = 1 / c^{1 / 2}

),

\begin{matrix} k_{g}^{(q B S)} (s; β, x) \\ = & \frac{C_{g}}{{β (x + β c)}^{1 / 2}} g ({\{\frac{a^{〈 q 〉} (α^{2} (x / β) (s / β))}{α (x / β)} - θ α (x / β)\}}^{2}) A^{〈 q 〉} (α^{2} (x / β) (s / β)) . \end{matrix}

This, together with

β / s \leq (x + β c) / (c s) = β / {c s α^{2} (x / β)}

, yields

u_{β, ν}^{(q B S)} (x) \leq \frac{C_{g}}{c^{ν} {β (x + β c)}^{1 / 2}} sup_{t \geq 0} [g ({\{\frac{a^{〈 q 〉} (t)}{α (x / β)} - θ α (x / β)\}}^{2}) \frac{A^{〈 q 〉} (t)}{t^{ν}}], ν \geq 0 .

It suffices to bound

t^{- ν} A^{〈 q 〉} (t) = (1 / 2) t^{- ν} (t^{q - 1} + t^{- (q + 1)})

,

q \geq 0

, in the same manner as Kakizawa (2018). □

Appendix A.2. Auxiliary Lemmas

We mention (without proof) the following basic lemma; (ii) is a slight modification of Kakizawa (2021):

Lemma A1.

(i)Let g be a twice continuously differentiable function on

[0, \infty)

, where

g, g^{'}

, and

g^{''}

are bounded; besides,

g^{''}

is Hölder-continuous with exponent

0 < η \leq 1

. Under AssumptionA1.1–2, given constants

c_{L} > 0

and

0 < τ < 1

, we have, for all sufficiently small

β > 0

,

\begin{matrix} 1 . sup_{0 \leq x \leq c_{L} β^{τ}} | B_{β}^{〈 k, g 〉} (x) | = O (β^{τ}); \\ 2 . B_{β}^{〈 k, g 〉} (0) = β g^{'} (0) \int_{0}^{\infty} u p (u; 0) d u + O (β^{2}); \\ 3 . f o r x \geq c_{L} β^{τ}, B_{β}^{〈 k, g 〉} (x) = β B_{g} (x) + E_{β} (x), \end{matrix}

with

| E_{β} (x) | \leq M_{g} ω_{β, η} (x)

, where

M_{g} > 0

is a constant, independent of β and x.

(ii)Let g be a bounded and Hölder-continuous function (with exponent

0 < η^{'} \leq 1

) on

[0, \infty)

. Under AssumptionsA1andA2(

ν = 0

), given constants

c_{L} > 0

and

0 < τ < 1

, we have, for all sufficiently small

β > 0

,

\begin{matrix} 1 . J_{β}^{〈 k^{2}, g 〉} (0) = β^{- 1} g (0) \int_{0}^{\infty} p^{2} (u; 0) d u + O (β^{η^{'} - 1}); \\ 2 . f o r x \geq c_{L} β^{τ}, J_{β}^{〈 k^{2}, g 〉} (x) = β^{- 1 / 2} V_{g} (x) + E_{β}^{'} (x), \end{matrix}

with

| E_{β}^{'} (x) | \leq M_{g}^{'} {ω_{β, η^{'}} (x; g) + 1}

, where

M_{g}^{'} > 0

is a constant, independent of β and x.

We can verify that, according to, e.g., Igarashi and Kakizawa (2020) and Kakizawa (2021),

\int_{0}^{\infty} {B_{β}^{〈 k, g 〉} (x)}^{2} d x = β^{2} I^{B_{g}^{2}} + o (β^{2}) a n d \int_{0}^{\infty} J_{β}^{〈 k^{2}, g 〉} (x) d x = β^{- 1 / 2} I^{V_{g}} + o (β^{- 1 / 2})

if

\int_{0}^{\infty} t^{2 {3 / min (η, η^{'}) + 1} + δ_{0}} g (t) d t

exists for some constant

δ_{0} > 0

, as follows:

1.: (i) For $2 / 3 < τ < 1$ , $\int_{0}^{β^{τ}} {B_{β}^{〈 k, g 〉} (x)}^{2} d x = O (β^{3 τ}) = o (β^{2})$ using Lemma A1(i.1).

(ii) Take a constant $0 < τ^{'} < η / (3 + η)$ ( $\leq 1 / 4$ ). Using Lemma A1(i.3), we have, for any $0 < τ < 1$ ,

$\begin{matrix} |\int_{β^{τ}}^{β^{- τ^{'}}} {B_{β}^{〈 k, g 〉} (x)}^{2} d x - β^{2} I^{B_{g}^{2}}| \\ \leq β^{2} (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) B_{g}^{2} (x) d x + 2 β {[I^{B_{g}^{2}} \int_{β^{τ}}^{β^{- τ^{'}}} E_{β}^{2} (x) d x]}^{1 / 2} + \int_{β^{τ}}^{β^{- τ^{'}}} E_{β}^{2} (x) d x \\ = o (β^{2}) . \end{matrix}$

(iii) Under A3( $H = 6 / η + 1 + δ_{0}$ ), we have, for $2 / (H + 1) < τ^{'} < η / (3 + η)$ ,

$\begin{matrix} \int_{β^{- τ^{'}}}^{\infty} {B_{β}^{〈 k, g 〉} (x)}^{2} d x & \leq & {2 | | g | |}_{[0, \infty)} [\int_{0}^{\infty} (\int_{β^{- τ^{'}}}^{\infty} k (s; x, β) g (s) d x) d s + \int_{β^{- τ^{'}}}^{\infty} g (x) d x] \\ = & O (β^{τ^{'} (H + 1)}) = o (β^{2}) . \end{matrix}$
2.: (i) For $τ > 1 / 2$ , $\int_{0}^{β^{τ}} J_{β}^{〈 k^{2}, g 〉} (x) d x \leq L_{K, 0} β^{τ - 1} {| | g | |}_{[0, \infty)} = o (β^{- 1 / 2})$ by A2.1( $ν = 0$ ).

(ii) Take a constant $0 < τ^{'} < η^{'} / (1 + η^{'})$ ( $\leq 1 / 2$ ). Using Lemma A1(ii.2), we have, for any $0 < τ < 1$ ,

$\begin{matrix} |\int_{β^{τ}}^{β^{- τ^{'}}} J_{β}^{〈 k^{2}, g 〉} (x) d x - β^{- 1 / 2} I^{V_{g}}| & \leq & β^{- 1 / 2} (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) V_{g} (x) d x + \int_{β^{τ}}^{β^{- τ^{'}}} | E_{β}^{'} (x) | d x \\ = & o (β^{- 1 / 2}) . \end{matrix}$

(iii) Under A3( $H = 6 / η^{'} + 1 + δ_{0}$ ), we have, for $1 / {2 (H + 1)} < τ^{'} < η^{'} / (1 + η^{'})$ ,

$\begin{matrix} \int_{β^{- τ^{'}}}^{\infty} J_{β}^{〈 k^{2}, g 〉} (x) d x & \leq & L_{K, 0} β^{- 1} \int_{0}^{\infty} (\int_{β^{- τ^{'}}}^{\infty} k (s; x, β) g (s) d x) d s (b y A 2.1 (ν = 0)) \\ = & O (β^{{τ^{'} (H + 1) - 1 / 2} - 1 / 2}) = o (β^{- 1 / 2}) . \end{matrix}$

Appendix B

Define

ζ_{ϵ} = \frac{1}{n} \sum_{i = 1}^{n} μ {(Y_{i} + ϵ)}^{- 1} + μ ϵ - 1 .

Then,

{\hat{f}}_{β, ϵ} (x) = \frac{f (x) + ζ_{β} (x)}{1 + ζ_{ϵ}}, w i t h ζ_{β} (x) = \frac{1}{n} \sum_{i = 1}^{n} μ Y_{i}^{- 1} k (Y_{i}; β, x) - f (x) .

(A1)

Also,

{\tilde{f}}_{β, ϵ} (x) = \frac{f (x) + (μ / x) ζ_{β}^{†} (x)}{1 + ζ_{ϵ}}, w i t h ζ_{β}^{†} (x) = \frac{1}{n} \sum_{i = 1}^{n} k (Y_{i}; β, x) - f_{LB} (x) .

(A2)

Appendix B.1. Some basic results for ζ ϵ

For ease of reference, we first mention the tail probability/moment bounds of

ζ_{ϵ}

. Rewrite

ζ_{ϵ} = {\bar{Δ}}_{ϵ} + δ_{ϵ},

(A3)

where

δ_{ϵ} = μ E [{(Y + ϵ)}^{- 1}] + μ ϵ - 1

, and

{\bar{Δ}}_{ϵ}

is the average of zero-mean independent random variables

Δ_{i, ϵ} = μ {(Y_{i} + ϵ)}^{- 1} - μ E [{(Y_{i} + ϵ)}^{- 1}], i = 1, \dots, n,

with

| Δ_{i, ϵ} | \leq μ ϵ^{- 1}

,

V [Δ_{i, ϵ}] \leq μ^{2} E [{(Y + ϵ)}^{- 2}] \leq μ ϵ^{- 1}

(we also have

V [Δ_{i, ϵ}] \leq μ^{2} E [Y^{- 2}]

). Then, Bernstein’s inequality yields the exponential bound of the tail probability

P [| {\bar{Δ}}_{ϵ} | \geq t] \leq 2 exp \{- \frac{n ϵ t^{2}}{2 (1 + t / 3) μ}\} f o r a l l t > 0

(A4)

(hence,

{\bar{Δ}}_{ϵ} \overset{a . s .}{⟶} 0

if

(n ϵ) / log n \to \infty

).

Suppose that

E [Y^{- 2}]

exists. Using

μ {(Y + ϵ)}^{- 1} - μ Y^{- 1} = - μ ϵ Y^{- 1} {(Y + ϵ)}^{- 1}

, we have

| δ_{ϵ} | \leq μ ϵ (1 + E [Y^{- 2}]) .

(A5)

Furthermore, it is easy to see that

\begin{matrix} V [{\bar{Δ}}_{ϵ}] & = & n^{- 1} μ^{2} V [{(Y + ϵ)}^{- 1}] \leq n^{- 1} μ^{2} E [Y^{- 2}], \\ E [{\bar{Δ}}_{ϵ}^{4}] & = & \frac{1}{n^{3}} E [Δ_{1, ϵ}^{4}] + \frac{3 (n - 1)}{n^{3}} {(V [Δ_{1, ϵ}])}^{2} \leq μ^{4} \{{(n ϵ^{2})}^{- 1} + 3 E [Y^{- 2}]\} n^{- 2} E [Y^{- 2}] . \end{matrix}

It follows that

E [ζ_{ϵ}^{2}] = δ_{ϵ}^{2} + V [{\bar{Δ}}_{ϵ}] = O (n^{- 1}), E [ζ_{ϵ}^{4}] \leq 8 (δ_{ϵ}^{4} + E [{\bar{Δ}}_{ϵ}^{4}]) = O (n^{- 2}) .

(A6)

Appendix B.2. Some preliminary results for ζ β (x)

We next list some facts about

ζ_{β} (x)

, including the tail probability/moment bounds and asymptotic normality. Rewrite

ζ_{β} (x) = {\bar{Δ}}_{β} (x) + B_{β}^{〈 k, f 〉} (x),

(A7)

where

{\bar{Δ}}_{β} (x)

is the average of zero-mean independent random variables

Δ_{i, β} (x) = μ Y_{i}^{- 1} k (Y_{i}; β, x) - μ E [Y_{i}^{- 1} k (Y_{i}; β, x)], i = 1, \dots, n,

with

| Δ_{i, β} (x) | \leq β^{- 1} u_{β, 1} (x) μ

,

V [Δ_{i, β} (x)] \leq μ^{2} E [{Y^{- 1} k (Y; β, x)}^{2}] \leq β^{- 1} u_{β, 1} (x) μ J_{β}^{〈 k, f 〉} (x)

(we also have

V [Δ_{i, β} (x)] \leq u_{β, 0} (x) μ J_{β}^{〈 k, f_{- 1} 〉} (x)

). Assumption A2.1(

ν = 1

) and Bernstein’s inequality yield the exponential bound of the tail probability

P [| {\bar{Δ}}_{β} (x) | \geq t] \leq 2 exp \{- \frac{n β^{2} t^{2}}{{2 (| | f | |}_{[0, \infty)} + t / 3) L_{K, 1} μ}\} f o r a l l t > 0

(A8)

(hence,

{\bar{Δ}}_{β} (x) \overset{a . s .}{⟶} 0

if

(n β^{2}) / log n \to \infty

, which is implied by, e.g.,

β \propto n^{- ι}

for some constant

0 < ι < 1 / 2

). On the other hand, F(iii

^{'}

), A1.3, and A2.2(

ν = 0

) imply that, for fixed

x > 0

(assume

f (x) > 0

), we have, under B(

0 < ι < min {2 q / (2 + q), 1}

) (note that

ι = 2 / 5

is feasible when F(iii

^{'}

) holds for some

q > 1 / 2

),

\begin{matrix} \frac{1}{n^{2 + q}} \sum_{i = 1}^{n} \frac{E [| Δ_{i, β} (x) |^{2 + q}]}{{(V [{\bar{Δ}}_{β} (x)])}^{1 + q / 2}} & \leq & μ^{1 + q} μ_{- (1 + q)}^{'} [\frac{n {2 L_{K, 0}^{'} {(β x)}^{- 1 / 2}}^{2 + q}}{{(n^{2} V [{\bar{Δ}}_{β} (x)])}^{1 + q / 2}}] \\ = & O (n^{- q / 2} β^{- (1 + q / 2) / 2}) = o (1), \end{matrix}

hence,

\frac{{\bar{Δ}}_{β} (x)}{{(V [{\bar{Δ}}_{β} (x)])}^{1 / 2}} \overset{d}{⟶} N (0, 1), i . e ., {(n β^{1 / 2})}^{1 / 2} {\bar{Δ}}_{β} (x) \overset{d}{⟶} N (0, μ V_{f_{- 1}} (x))

(A9)

using Lyapunov’s theorem (for triangular arrays), together with

n β^{1 / 2} V [{\bar{Δ}}_{β} (x)] \to μ V_{f_{- 1}} (x)

for fixed

x > 0

, which will be shown in (A18) below. Similarly, F(iii

^{'}

), A1.3, and A2.1(

ν = 0

) imply that, for the case

f_{- 1} (0) > 0

, we have, under B(

0 < ι < q / (2 + q)

) (note that

ι = 1 / 3

is feasible when F(iii

^{'}

) holds for some

q > 1

(in this case,

f_{- 1} (0) = 0

), i.e.,

ι = 1 / 3

is unfortunately infeasible for

0 < q \leq 1

),

\begin{matrix} \frac{1}{n^{2 + q}} \sum_{i = 1}^{n} \frac{E [| Δ_{i, β} (0) |^{2 + q}]}{{(V [{\bar{Δ}}_{β} (0)])}^{1 + q / 2}} & \leq & μ^{1 + q} μ_{- (1 + q)}^{'} [\frac{n {(2 L_{K, 0} β^{- 1})}^{2 + q}}{{(n^{2} V [{\bar{Δ}}_{β} (0)])}^{1 + q / 2}}] \\ = & O (n^{- q / 2} β^{- (1 + q / 2)}) = o (1), \end{matrix}

hence,

\frac{{\bar{Δ}}_{β} (0)}{{(V [{\bar{Δ}}_{β} (0)])}^{1 / 2}} \overset{d}{⟶} N (0, 1), i . e ., {(n β)}^{1 / 2} {\bar{Δ}}_{β} (0) \overset{d}{⟶} N (0, μ f_{- 1} (0) \int_{0}^{\infty} p^{2} (u; 0) d u)

(A10)

using Lyapunov’s theorem, together with

n β V [{\bar{Δ}}_{β} (0)] \to μ f_{- 1} (0) \int_{0}^{\infty} p^{2} (u; 0) d u

, which will be shown in (A14) below.

It is easy to see that

\begin{matrix} V [{\bar{Δ}}_{β} (x)] & = & n^{- 1} [μ^{2} E [{Y^{- 1} k (Y; β, x)}^{2}] - {μ E [Y^{- 1} k (Y; β, x)]}^{2}] \\ = & n^{- 1} [μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) - {J_{β}^{〈 k, f 〉} (x)}^{2}] (\leq n^{- 1} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x)), \\ E [ζ_{β}^{2} (x)] & = & {B_{β}^{〈 k, f 〉} (x)}^{2} + V [{\bar{Δ}}_{β} (x)] \\ \leq & {B_{β}^{〈 k, f 〉} (x)}^{2} + n^{- 1} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) = D_{β} (x) (s a y), \\ E [{\bar{Δ}}_{β}^{4} (x)] & = & \frac{1}{n^{3}} E [Δ_{1, β}^{4} (x)] + \frac{3 (n - 1)}{n^{3}} {(V [Δ_{1, β} (x)])}^{2} \\ \leq & {{(n β)}^{- 1} L_{K, 1} β^{- 1} μ}^{2} D_{β} (x) + 3 D_{β}^{2} (x) (b y A 2.1 (ν = 1)) . \end{matrix}

(A11)

In addition to

{sup}_{x \geq 0} | B_{β}^{〈 k, f 〉} {(x) | \leq 2 | | f | |}_{[0, \infty)}

,

\begin{matrix} sup_{x \geq 0} V [{\bar{Δ}}_{β} (x)] & \leq & n^{- 1} L_{K, 0} β^{- 1} μ | | f_{- 1} {| |}_{[0, \infty)} (b y A 2.1 (ν = 0)), \end{matrix}

(A12)

\begin{matrix} B_{β}^{〈 k, f 〉} (0) & = & β f^{'} (0) \int_{0}^{\infty} u p (u; 0) d u + O (β^{2}) (b y L e m m a A 1 (i . 2)), \\ V [{\bar{Δ}}_{β} (0)] & = & n^{- 1} β^{- 1} μ f_{- 1} (0) \int_{0}^{\infty} p^{2} (u; 0) d u \end{matrix}

(A13)

\begin{matrix} + O (n^{- 1} β^{η^{'} - 1} + n^{- 1}) (b y L e m m a A 1 (i i . 1)), \end{matrix}

(A14)

Lemma A1 implies that, given a constant

0 < τ < 1

,

sup_{0 \leq x \leq β^{τ}} | B_{β}^{〈 k, f 〉} (x) | = O (β^{τ})

(A15)

(obviously,

\int_{0}^{β^{τ}} {B_{β}^{〈 k, f 〉} (x)}^{2} d x = O (β^{3 τ})

and

\int_{0}^{β^{τ}} V [{\bar{Δ}}_{β} (x)] d x = O (n^{- 1} β^{τ - 1})

), and that, for

x \geq β^{τ}

,

| B_{β}^{〈 k, f 〉} (x) - β B_{f} (x) | \leq M_{f} ω_{β, η} (x),

(A16)

| J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) - β^{- 1 / 2} V_{f_{- 1}} (x) | \leq M_{f_{- 1}}^{'} {ω_{β, η^{'}} (x; f_{- 1}) + 1},

(A17)

| V [{\bar{Δ}}_{β} (x)] - n^{- 1} β^{- 1 / 2} μ V_{f_{- 1}} (x) | \leq n^{- 1} [M_{f_{- 1}}^{'} μ {ω_{β, η^{'}} (x; f_{- 1}) + 1} + {| | f | |}_{[0, \infty)}^{2}] .

(A18)

Remark A1.

If

\int_{0}^{\infty} t^{2 {3 / min (η, η^{'}) + 1} + δ_{0}} f (t) d t

exists for some constant

δ_{0} > 0

(this ensures

\int_{0}^{\infty} t^{2 {3 / min (η, η^{'}) + 1} + δ_{0}} f_{- 1} (t) d t \leq {\int_{0}^{\infty} t^{2 {3 / min (η, η^{'}) + 1} + δ_{0}} f (t) d t}^{\frac{6 / min (η, η^{'}) + 1 + δ_{0}}{2 {3 / min (η, η^{'}) + 1} + δ_{0}}}

exists), in line with, e.g., Kakizawa (2021), Assumption A3(

H = 6 / min (η, η^{'}) + 1 + δ_{0}

) about the behavior of the tail integral of

k (s; β, x)

with respect to x is crucial for proving the negligibility of the integral

\int_{β^{- τ^{'}}}^{\infty} E [ζ_{β}^{2} (x)] d x

(

\leq \int_{β^{- τ^{'}}}^{\infty} D_{β} (x) d x

), as follows: We have, for any constant

τ^{'} > 2 / (H + 1)

,

\begin{matrix} \int_{β^{- τ^{'}}}^{\infty} D_{β} (x) d x & {\leq 2 | | f | |}_{[0, \infty)} \{\int_{0}^{\infty} (\int_{β^{- τ^{'}}}^{\infty} k (s; x, β) f (s) d x) d s + \int_{β^{- τ^{'}}}^{\infty} f (x) d x\} \\ + n^{- 1} L_{K, 0} β^{- 1} μ \int_{0}^{\infty} (\int_{β^{- τ^{'}}}^{\infty} k (s; x, β) f_{- 1} (s) d x) d s (b y A 2.1 (ν = 0)) \\ = O (β^{τ^{'} (H + 1)} (1 + n^{- 1} β^{- 1})) = o (β^{2} + n^{- 1} β^{- 1 / 2}) . \end{matrix}

Then, the approximation

\int_{0}^{\infty} E [ζ_{β}^{2} (x)] d x = β^{2} I^{B_{f}^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}} + o (β^{2} + n^{- 1} β^{- 1 / 2})

can be verified, since, by taking

2 / (H + 1) < τ^{'} < min {η / (3 + η), η^{'} / (1 + η^{'})}

,

\begin{matrix} \int_{0}^{β^{- τ^{'}}} E [ζ_{β}^{2} (x)] d x & = O (β^{3 τ} + n^{- 1} β^{τ - 1}) + \int_{β^{τ}}^{β^{- τ^{'}}} [D_{β} (x) - n^{- 1} {J_{β}^{〈 k, f 〉} (x)}^{2}] d x \\ = β^{2} I^{B_{f}^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}} + o (β^{2} + n^{- 1} β^{- 1 / 2}) \end{matrix}

(we also take

2 / 3 < τ < 1

; see the argument after Lemma A1).

Appendix B.3. Some Preliminary Results for ζ β † (x)

We here list some facts about

ζ_{β}^{†} (x)

, including the tail probability/moment bounds and asymptotic normality. Rewrite

ζ_{β}^{†} (x) = {\bar{Δ^{†}}}_{β} (x) + B_{β}^{〈 k, f_{LB} 〉} (x),

(A19)

where

{\bar{Δ^{†}}}_{β} (x)

is the average of zero-mean independent random variables

Δ_{i, β}^{†} (x) = k (Y_{i}; β, x) - E [k (Y_{i}; β, x)], i = 1, \dots, n,

with

| Δ_{i, β}^{†} (x) | \leq u_{β, 0} (x)

,

V [Δ_{i, β}^{†} (x)] \leq E [k^{2} (Y; β, x)] \leq u_{β, 0} (x) J_{β}^{〈 k, f_{LB} 〉} (x)

. As in, e.g., Igarashi and Kakizawa (2020) and Kakizawa (2021), by Assumption A2.1(

ν = 0

), an application of Bernstein’s inequality yields the exponential bound of the tail probability

P [| {\bar{Δ^{†}}}_{β} (x) | \geq t] \leq 2 exp \{- \frac{n β t^{2}}{2 (| | f_{LB} {| |}_{[0, \infty)} + t / 3) L_{K, 0}}\} f o r a l l t > 0

(A20)

(hence,

{\bar{Δ^{†}}}_{β} (x) \overset{a . s .}{⟶} 0

if

(n β) / log n \to \infty

), whereas, if

β \to 0

and

n β^{1 / 2} \to \infty

, then, A2.2(

ν = 0

) implies that, for fixed

x > 0

(assume

f_{LB} (x) > 0

),

\frac{1}{n^{2 + p}} \sum_{i = 1}^{n} \frac{E [| Δ_{i, β}^{†} (x) |^{2 + p}]}{{(V [{\bar{Δ^{†}}}_{β} (x)])}^{1 + p / 2}} = {\{\frac{{(L_{K, 0}^{'})}^{2} {(β x)}^{- 1}}{n (n V [{\bar{Δ^{†}}}_{β} (x)])}\}}^{p / 2} = O ({(n β^{1 / 2})}^{- p / 2}) f o r a n y p > 0,

hence,

\frac{{\bar{Δ^{†}}}_{β} (x)}{{(V [{\bar{Δ^{†}}}_{β} (x)])}^{1 / 2}} \overset{d}{⟶} N (0, 1), i . e ., {(n β^{1 / 2})}^{1 / 2} \frac{μ}{x} {\bar{Δ^{†}}}_{β} (x) \overset{d}{⟶} N (0, μ V_{f_{- 1}} (x))

(A21)

using Lyapunov’s theorem, together with

n β^{1 / 2} {(μ / x)}^{2} V [{\bar{Δ^{†}}}_{β} (x)] \to μ V_{f_{- 1}} (x)

for fixed

x > 0

, which will be shown in (A26) below.

It is easy to see that

\begin{matrix} V [{\bar{Δ}}_{β}^{†} (x)] & = n^{- 1} [J_{β}^{〈 k^{2}, f_{LB} 〉} (x) - {J_{β}^{〈 k, f_{LB} 〉} (x)}^{2}] (\leq n^{- 1} J_{β}^{〈 k^{2}, f_{LB} 〉} (x)), \\ E [{ζ_{β}^{†} (x)}^{2}] & = {B_{β}^{〈 k, f_{LB} 〉} (x)}^{2} + V [{\bar{Δ^{†}}}_{β} (x)] \\ \leq {B_{β}^{〈 k, f_{LB} 〉} (x)}^{2} + n^{- 1} J_{β}^{〈 k^{2}, f_{LB} 〉} (x) = D_{β}^{†} (x) (s a y), \end{matrix}

E [{{\bar{Δ^{†}}}_{β} (x)}^{4}] \leq \frac{1}{n^{3}} E [{Δ_{1, β}^{†} (x)}^{4}] + \frac{3 (n - 1)}{n^{3}} {(V [Δ_{1, β}^{†} (x)])}^{2} \leq {(n^{- 1} L_{K, 0} β^{- 1})}^{2} D_{β}^{†} (x) + 3 {D_{β}^{†} (x)}^{2} (b y A 2.1 (ν = 0)) .

(A22)

In addition to

{sup}_{x \geq 0} | B_{β}^{〈 k, f_{LB} 〉} (x) | \leq 2 | | f_{LB} {| |}_{[0, \infty)}

and

V [{\bar{Δ}}_{β}^{†} (x)] \leq n^{- 1} L_{K, 0} β^{- 1} | | f_{LB} {| |}_{[0, \infty)} (b y A 2.1 (ν = 0)),

(A23)

Lemma A1 implies that, given a constant

0 < τ < 1

, we have, for

x \geq β^{τ}

,

\begin{matrix} |\frac{μ}{x} B_{β}^{〈 k, f_{LB} 〉} (x) - β B (x)| & \leq & M_{f_{LB}} \frac{μ}{x} ω_{β, η^{''}} (x), \end{matrix}

(A24)

|{(\frac{μ}{x})}^{2} J_{β}^{〈 k^{2}, f_{LB} 〉} (x) - β^{- 1 / 2} μ V_{f_{- 1}} (x)| \leq M_{f_{LB}}^{'} \{μ ω_{β, 1} (x; f_{- 1}) + {(\frac{μ}{x})}^{2}\},

(A25)

\begin{matrix} |{(\frac{μ}{x})}^{2} V [{\bar{Δ^{†}}}_{β} (x)] - n^{- 1} β^{- 1 / 2} μ V_{f_{- 1}} (x)| \leq n^{- 1} [M_{f_{LB}}^{'} μ ω_{β, 1} (x; f_{- 1}) \\ + {(\frac{μ}{x})}^{2} {M_{f_{LB}}^{'} + | | f_{LB} {| |}_{[0, \infty)}^{2}}] \\ \leq n^{- 1} M_{f_{LB}}^{''} {ω_{β, 1} (x; f_{- 1}) + x^{- 2}}, \end{matrix}

(A26)

where

M_{f_{LB}}^{''} = M_{f_{LB}}^{'} μ + (M_{f_{LB}}^{'} + | | f_{LB} {| |}_{[0, \infty)}^{2}) μ^{2}

.

Remark A2.

As mentioned in Theorem 8, we take an arbitrary constant

0 < τ < 1 / 2

and consider a weighted MISE, with a weight function

w (t) = χ_{[β^{τ}, \infty)} (t)

(say). In line with, e.g., Kakizawa (2021), if

\int_{0}^{\infty} t^{2 (1 + η^{''}) / η^{''} + δ_{0}} f_{LB} (t) d t

exists for some constant

δ_{0} > 0

, Assumption A3(

H = 2 / η^{''} + 1 + δ_{0}

), together with

{sup}_{x \geq β^{- τ^{'}}} (μ / x) \leq 1

(say) for any constant

τ^{'} > 0

, is crucial for proving the negligibility of

\int_{β^{- τ^{'}}}^{\infty} {(μ / x)}^{2} E [{ζ_{β}^{†} (x)}^{2}] d x

(

\leq \int_{β^{- τ^{'}}}^{\infty} D_{β}^{†} (x) d x

), as follows: We have, for any constant

2 / (H + 1) < τ^{'} < η^{''} / (1 + η^{''})

,

\begin{matrix} \int_{β^{- τ^{'}}}^{\infty} D_{β}^{†} (x) d x & \leq 2 | | f_{LB} {| |}_{[0, \infty)} \{\int_{0}^{\infty} (\int_{β^{- τ^{'}}}^{\infty} k (s; x, β) f_{LB} (s) d x) d s + \int_{β^{- τ^{'}}}^{\infty} f_{LB} (x) d x\} \\ + n^{- 1} L_{K, 0} β^{- 1} \int_{0}^{\infty} (\int_{β^{- τ^{'}}}^{\infty} k (s; x, β) f_{LB} (s) d x) d s (b y A 2.1 (ν = 0)) \\ = O (β^{τ^{'} (H + 1)} (1 + n^{- 1} β^{- 1})) = o (β^{2} + n^{- 1} β^{- 1 / 2}) . \end{matrix}

We can verify that

\int_{β^{τ}}^{\infty} {(\frac{μ}{x})}^{2} E [{ζ_{β}^{†} (x)}^{2}] d x = β^{2} I^{B^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}} + o (β^{2} + n^{- 1} β^{- 1 / 2})

for every

0 < τ < 1 / 2

, since

\begin{matrix} |\int_{β^{τ}}^{β^{- τ^{'}}} {\{\frac{μ}{x} B_{β}^{〈 k, f_{LB} 〉} (x)\}}^{2} d x - β^{2} I^{B^{2}}| \\ \leq β^{2} (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) B^{2} (x) d x \\ + 2 β M_{f_{LB}} {[I^{B^{2}} \int_{β^{τ}}^{β^{- τ^{'}}} {(\frac{μ}{x})}^{2} ω_{β, η^{''}}^{2} (x) d x]}^{1 / 2} + M_{f_{LB}}^{2} \int_{β^{τ}}^{β^{- τ^{'}}} {(\frac{μ}{x})}^{2} ω_{β, η^{''}}^{2} (x) d x \\ = o (β^{2}), \\ |\int_{β^{τ}}^{β^{- τ^{'}}} {(\frac{μ}{x})}^{2} V [{\bar{Δ^{†}}}_{β} (x)] d x - n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}}| \\ \leq n^{- 1} β^{- 1 / 2} μ (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) V_{f_{- 1}} (x) d x + M_{f_{LB}}^{''} \int_{β^{τ}}^{β^{- τ^{'}}} {ω_{β, 1} (x; f_{- 1}) + x^{- 2}} d x \\ = o (n^{- 1} β^{- 1 / 2}) . \end{matrix}

Appendix B.4. Proofs of Main Results

Before proving Theorems 1–4, we prepare two lemmas (Lemmas A2 and A3):

Lemma A2.

Suppose that

E [Y^{- 2}]

exists. Then,

E [ζ_{β}^{2} (x) ζ_{ϵ}^{2}] \leq M_{1} n^{- 1} D_{β} (x),

where

M_{1} > 0

is a constant, independent of

n, β

, and x.

Proof.

It is easy to see that

\begin{matrix} E [{\bar{Δ}}_{β}^{2} (x) {\bar{Δ}}_{ϵ}^{2}] & = \frac{1}{n^{3}} E [Δ_{1, β}^{2} (x) Δ_{1, ϵ}^{2}] + \frac{n - 1}{n^{3}} [V [Δ_{1, β} (x)] V [Δ_{1, ϵ}] + 2 {C o v [Δ_{1, β} (x), Δ_{1, ϵ}]}^{2}] \\ \leq \frac{1}{n^{3}} E [Δ_{1, β}^{2} (x) Δ_{1, ϵ}^{2}] + \frac{3}{n^{2}} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) μ^{2} E [Y^{- 2}], \end{matrix}

where

\begin{matrix} E [Δ_{1, β}^{2} (x) Δ_{1, ϵ}^{2}] & \leq 4 [μ^{4} E [{(Y + ϵ)}^{- 2} Y^{- 2} k^{2} (Y; β, x)] + 3 μ^{2} E [Y^{- 2} k^{2} (Y; β, x)] μ^{2} E [{(Y + ϵ)}^{- 2}]] \\ \leq 4 μ^{3} (ϵ^{- 2} + 3 E [Y^{- 2}]) J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) . \end{matrix}

The result follows from

E [ζ_{β}^{2} (x) ζ_{ϵ}^{2}] \leq 4 {E [{\bar{Δ}}_{β}^{2} (x) {\bar{Δ}}_{ϵ}^{2}] + V [{\bar{Δ}}_{β} (x)] δ_{ϵ}^{2}} + 2 {B_{β}^{〈 k, f 〉} (x)}^{2} E [ζ_{ϵ}^{2}] .

□

Now, we rewrite (A1) as

{\hat{f}}_{β, ϵ} (x) = f (x) (1 - ζ_{ϵ} + \frac{ζ_{ϵ}^{2}}{1 + ζ_{ϵ}}) + ζ_{β} (x) (1 - \frac{ζ_{ϵ}}{1 + ζ_{ϵ}}) = f (x) + L_{β, ϵ} (x) + R_{β, ϵ} (x),

(A27)

where

L_{β, ϵ} (x) = ζ_{β} (x) - f (x) ζ_{ϵ}, R_{β, ϵ} (x) = f (x) \frac{ζ_{ϵ}^{2}}{1 + ζ_{ϵ}} - ζ_{β} (x) \frac{ζ_{ϵ}}{1 + ζ_{ϵ}} .

We need to evaluate

E [R_{β, ϵ}^{2} (x)]

.

Lemma A3.

Suppose that AssumptionsA2.1(

ν = 1

)andBhold, and that

E [Y^{- 2}]

exists. Under the boundedness of f, we have

E [R_{β, ϵ}^{2} (x)] \leq M [4 n^{- 2} + n^{- 1} D_{β} (x)],

where

M > 0

is a constant, independent of

n, β

, and x

Proof.

Considering an event

S_{n} = {Y_{n} : | ζ_{ϵ} | < 1 / 2}

(say), we have

\begin{matrix} | R_{β, ϵ} (x) | χ_{S_{n}} & \leq 2 {f (x) ζ_{ϵ}^{2} + | ζ_{β} (x) ζ_{ϵ} |} χ_{S_{n}}, \\ | R_{β, ϵ} (x) | (1 - χ_{S_{n}}) & \leq [{(μ ϵ)}^{- 1} {f (x) + | ζ_{β} (x) {} + f (x) + | f (x) | ζ_{ϵ} | + | ζ_{β} (x) |}] (1 - χ_{S_{n}}) \\ \leq [{(μ ϵ)}^{- 1} {f (x) + | ζ_{β} (x) |} + 2 {3 f (x) ζ_{ϵ}^{2} + | ζ_{β} (x) ζ_{ϵ} |}] (1 - χ_{S_{n}}) . \end{matrix}

Using

{Y_{n} : | ζ_{ϵ} | \geq 1 / 2} \subset {Y_{n} : | {\bar{Δ}}_{ϵ} | \geq 1 / 4}

(say) for all sufficiently large n, it can be shown that

\begin{matrix} E [R_{β, ϵ}^{2} (x)] \\ \leq 16 [9 f^{2} (x) E [ζ_{ϵ}^{4}] + E [ζ_{β}^{2} (x) ζ_{ϵ}^{2}]] \\ + 4 {(μ ϵ)}^{- 2} [{f^{2} (x) + D_{β} (x)} P [| {\bar{Δ}}_{ϵ} | \geq 1 / 4] + {E [{\bar{Δ}}_{β}^{4} (x)] P [| {\bar{Δ}}_{ϵ} {| \geq 1 / 4]}}^{1 / 2}] . \end{matrix}

Then, the result follows from (A4), (A6), (A12), and Lemma A2, under Assumption B (recall that

ϵ = C^{'} n^{- 1 / 2}

; in this case,

P [| {\bar{Δ}}_{ϵ} | \geq 1 / 4] \leq 2 e^{- ϱ n^{1 / 2}}

, where

ϱ > 0

is a constant, independent of n). □

We are ready to prove Theorems 1–4.

Proof (Proof of Theorem 1)

We start with

\begin{matrix} E [{\hat{f}}_{β, ϵ} (x)] & = & f (x) + B_{β}^{〈 k, f 〉} (x) - f (x) δ_{ϵ} + E [R_{β, ϵ} (x)], \end{matrix}

(A28)

{\hat{f}}_{β, ϵ} (x) - E [{\hat{f}}_{β, ϵ} (x)] = {\bar{Δ}}_{β} (x) - f (x) {\bar{Δ}}_{ϵ} + R_{β, ϵ} (x) - E [R_{β, ϵ} (x)],

(A29)

where

V [{\hat{f}}_{β, ϵ} (x)] = V [{\bar{Δ}}_{β} (x)] + 2 C o v [{\bar{Δ}}_{β} (x), - f (x) {\bar{Δ}}_{ϵ} + R_{β, ϵ} (x)] + V [- f (x) {\bar{Δ}}_{ϵ} + R_{β, ϵ} (x)] .

(A30)

Also,

C o v [{\bar{Δ}}_{β} (x), {\bar{Δ}}_{ϵ}] = n^{- 1} [μ^{2} E [{(Y + ϵ)}^{- 1} Y^{- 1} k (Y; β, x)] - μ E [Y^{- 1} k (Y; β, x)] μ E [{(Y + ϵ)}^{- 1}]],

hence,

| C o v [{\bar{Δ}}_{β} (x), {\bar{Δ}}_{ϵ}] | \leq n^{- 1} [μ J_{β}^{〈 k, f_{- 1} 〉} (x) + J_{β}^{〈 k, f 〉} (x)] \leq n^{- 1} [μ | | f_{- 1} {| |}_{[0, \infty)} + {| | f | |}_{[0, \infty)}] .

It is shown from (A28) and (A30) that

\begin{matrix} |B i a s [{\hat{f}}_{β, ϵ} (x)] - B_{β}^{〈 k, f 〉} (x)| & \leq {| | f | |}_{[0, \infty)} δ_{ϵ} + {E [R_{β, ϵ}^{2} (x)]}^{1 / 2}, \\ |V [{\hat{f}}_{β, ϵ} (x)] - V [{\bar{Δ}}_{β} (x)]| & \leq M_{2} [n^{- 1} + {n^{- 1} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) E [R_{β, ϵ}^{2} (x)]}^{1 / 2} + E [R_{β, ϵ}^{2} (x)]], \end{matrix}

where

M_{2} > 0

is a constant, independent of

n, β

, and x. Using Lemma A3 and

\begin{matrix} {n^{- 1} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) E [R_{β, ϵ}^{2} (x)]}^{1 / 2} & \leq M^{1 / 2} [2 n^{- 1} D_{β}^{1 / 2} (x) + n^{- 1 / 2} D_{β} (x)] \\ \leq M^{1 / 2} [n^{- 1} + (n^{- 1} + n^{- 1 / 2}) D_{β} (x)], \end{matrix}

we have

\begin{matrix} |B i a s [{\hat{f}}_{β, ϵ} (x)] - B_{β}^{〈 k, f 〉} (x)| & \leq & M_{3} [n^{- 1 / 2} + n^{- 1 / 2} | B_{β}^{〈 k, f 〉} (x) |] (a s s u m e n^{- 1} β^{- 1} = o (1)), \\ |V [{\hat{f}}_{β, ϵ} (x)] - V [{\bar{Δ}}_{β} (x)]| & \leq & M_{4} [n^{- 1} + n^{- 1 / 2} D_{β} (x)], \end{matrix}

where

M_{3}, M_{4} > 0

are constants, independent of

n, β

, and x. Then, using (A12)–(A18), the proof is completed. □

Proof (Proof of Theorem 2)

Recall (A1) (also (A3) and (A7)). The strong consistency follows from (A4), (A5), and (A8), together with

B_{β}^{〈 k, f 〉} (x) = O (β)

for fixed

x > 0

(we also have

B_{β}^{〈 k, f 〉} (0) = O (β)

). See Lemma A1(i.2–3). □

Proof (Proof of Theorem 3)

By Lemma A3, we notice that

R_{β, ϵ} (x) - E [R_{β, ϵ} (x)] = o_{p} ({(n β^{1 / 2})}^{- 1 / 2})

for fixed

x > 0

, and that

R_{β, ϵ} (0) - E [R_{β, ϵ} (0)] = o_{p} ({(n β)}^{- 1 / 2})

. Recalling (A29), where

{\bar{Δ}}_{ϵ} = O_{p} (n^{- 1 / 2})

, we have

\begin{matrix} {(n β^{1 / 2})}^{1 / 2} ({\hat{f}}_{β, ϵ} (x) - E [{\hat{f}}_{β, ϵ} (x)]) & = {(n β^{1 / 2})}^{1 / 2} {\bar{Δ}}_{β} (x) + o_{p} (1) f o r f i x e d x > 0, \\ {(n β)}^{1 / 2} ({\hat{f}}_{β, ϵ} (0) - E [{\hat{f}}_{β, ϵ} (0)]) & = {(n β)}^{1 / 2} {\bar{Δ}}_{β} (0) + o_{p} (1) . \end{matrix}

The results (i) and (ii) follow from (A9) and (A10), respectively. □

Proof (Proof of Theorem 4)

In the same way as the argument of Remark , a rigorous derivation of the MISE is made by splitting the integral into three parts (we set

H = 6 / η + 1 + δ_{0}

), as follows:

(i): Theorem 1 yields $\int_{0}^{β^{τ}} M S E [{\hat{f}}_{β, ϵ} (x)] d x = O (β^{3 τ} + n^{- 1} β^{τ - 1}) = o (β^{2} + n^{- 1} β^{- 1 / 2})$ for $2 / 3 < τ < 1$ .
(ii): Take a constant $τ^{'} > 2 / (H + 1)$ . The inequality

$\begin{matrix} ϵ^{- 2} \int_{β^{- τ^{'}}}^{\infty} E [ζ_{β}^{2} (x) ζ_{ϵ}^{2}] d x & \leq & O (n^{- 1}) \int_{0}^{\infty} [\int_{β^{- τ^{'}}}^{\infty} k (s; β, x) {f_{- 1} (s) + f (s)} d x] d s & + O (1) \int_{β^{- τ^{'}}}^{\infty} D_{β} (x) d x (L e m m a A 2), \end{matrix}$

together with (A6), enables us to see that, by (A27),

$\begin{matrix} \int_{β^{- τ^{'}}}^{\infty} E [{{\hat{f}}_{β, ϵ} (x) - f (x)}^{2}] d x & \leq & 4 [\int_{β^{- τ^{'}}}^{\infty} D_{β} (x) d x + {| | f | |}_{[0, \infty)} E [ζ_{ϵ}^{2}] & + {(μ ϵ)}^{- 2} \{{| | f | |}_{[0, \infty)} E [ζ_{ϵ}^{4}] + \int_{β^{- τ^{'}}}^{\infty} E [ζ_{β}^{2} (x) ζ_{ϵ}^{2}] d x\}] & = & o (β^{2} + n^{- 1} β^{- 1 / 2}) (s e e t h e a r g u m e n t i n R e m a r k A 1) . \end{matrix}$
(iii): Taking $2 / 3 < τ < 1$ and $2 / (H + 1) < τ^{'} < min {η / (3 + η), η^{'} / (1 + η^{'})}$ , Theorem 1 yields

$\begin{matrix} |\int_{β^{τ}}^{β^{- τ^{'}}} {B i a s^{2} [{\hat{f}}_{β, ϵ} (x)] + V [{\hat{f}}_{β, ϵ} (x)]} d x - (β^{2} I^{B_{f}^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}})| \\ \leq β^{2} (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) B_{f}^{2} (x) d x \\ + 2 β [I^{B_{f}^{2}} \int_{β^{τ}}^{β^{- τ^{'}}} {R_{β}^{Bias} (x)}^{2} d x] 1 / 2 + \int_{β^{τ}}^{β^{- τ^{'}}} {R_{β}^{Bias} (x)}^{2} d x \\ + n^{- 1} β^{- 1 / 2} μ (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) V_{f_{- 1}} (x) d x + \int_{β^{τ}}^{β^{- τ^{'}}} {R_{β}^{V} (x) | d x \\ = o (β^{2} + n^{- 1} β^{- 1 / 2}) . \end{matrix}$

□

Before proving Theorems 5–8, we prepare two lemmas (Lemmas A4 and A5):

Lemma A4.

Suppose that

E [Y^{- 2}]

exists. Then,

E [{ζ_{β}^{†} (x)}^{2} ζ_{ϵ}^{2}] \leq M_{1}^{†} n^{- 1} D_{β}^{†} (x),

where

M_{1}^{†} > 0

is a constant, independent of

n, β

, and x.

Proof. It is easy to see that

\begin{matrix} E [{{\bar{Δ}}_{β}^{†} (x)}^{2} {\bar{Δ}}_{ϵ}^{2}] & = & \frac{1}{n^{3}} E [{Δ_{1, β}^{†} (x)}^{2} Δ_{1, ϵ}^{2}] + \frac{n - 1}{n^{3}} [V [Δ_{1, β}^{†} (x)] V [Δ_{1, ϵ}] + 2 {C o v [Δ_{1, β}^{†} (x), Δ_{1, ϵ}]}^{2}] & \leq & \frac{1}{n^{3}} E [{Δ_{1, β}^{†} (x)}^{2} Δ_{1, ϵ}^{2}] + \frac{3}{n^{2}} J_{β}^{〈 k^{2}, f_{LB} 〉} (x) μ^{2} E [Y^{- 2}], \end{matrix}

where

\begin{matrix} E [{Δ_{1, β}^{†} (x)}^{2} Δ_{1, ϵ}^{2}] & \leq & 4 [μ^{2} E [{(Y + ϵ)}^{- 2} k^{2} (Y; β, x)] + 3 E [k^{2} (Y; β, x)] μ^{2} E [{(Y + ϵ)}^{- 2}]] & \leq & 4 μ^{2} (ϵ^{- 2} + 3 E [Y^{- 2}]) J_{β}^{〈 k^{2}, f_{LB} 〉} (x) . \end{matrix}

The result follows from

E [{ζ_{β}^{†} (x)}^{2} ζ_{ϵ}^{2}] \leq 4 {E [{{\bar{Δ}}_{β}^{†} (x)}^{2} {\bar{Δ}}_{ϵ}^{2}] + V [{\bar{Δ}}_{β}^{†} (x)] δ_{ϵ}^{2}} + 2 {B_{β}^{〈 k, f_{LB} 〉} (x)}^{2} E [ζ_{ϵ}^{2}] .

□

Now, (A2) can be rewritten as

{\tilde{f}}_{β, ϵ} (x) = f (x) (1 - ζ_{ϵ} + \frac{ζ_{ϵ}^{2}}{1 + ζ_{ϵ}}) + \frac{μ}{x} ζ_{β}^{†} (x) (1 - \frac{ζ_{ϵ}}{1 + ζ_{ϵ}}) = f (x) + \frac{μ}{x} {L_{β, ϵ}^{†} (x) + R_{β, ϵ}^{†} (x)},

(A31)

where

L_{β, ϵ}^{†} (x) = ζ_{β}^{†} (x) - f_{LB} (x) ζ_{ϵ}, R_{β, ϵ}^{†} (x) = f_{LB} (x) \frac{ζ_{ϵ}^{2}}{1 + ζ_{ϵ}} - ζ_{β}^{†} (x) \frac{ζ_{ϵ}}{1 + ζ_{ϵ}} .

We need to evaluate

E [{R_{β, ϵ}^{†} (x)}^{2}]

.

Lemma A5.

Suppose that AssumptionsA2.1(

ν = 0

)andBhold, and that

E [Y^{- 2}]

exists. Under the boundedness of

f_{LB}

, we have

E [{R_{β, ϵ}^{†} (x)}^{2}] \leq M^{†} [n^{- 2} + n^{- 1} D_{β}^{†} (x)],

where

M^{†} > 0

is a constant, independent of

n, β

, and x.

Proof. We have

\begin{matrix} | R_{β, ϵ}^{†} (x) | χ_{S_{n}} & \leq & 2 {f_{LB} (x) ζ_{ϵ}^{2} + | ζ_{β}^{†} (x) ζ_{ϵ} |} χ_{S_{n}}, | R_{β, ϵ}^{†} (x) | (1 - χ_{S_{n}}) & \leq & [{(μ ϵ)}^{- 1} {f_{LB} (x) + | ζ_{β}^{†} (x) |} & + f_{LB} (x) + {f_{LB} (x) | ζ_{ϵ} | + | ζ_{β}^{†} (x) |}] (1 - χ_{S_{n}}) & \leq & [{(μ ϵ)}^{- 1} {f_{LB} (x) + | ζ_{β}^{†} (x) |} + 2 {3 f_{LB} (x) ζ_{ϵ}^{2} + | ζ_{β}^{†} (x) ζ_{ϵ} |}] (1 - χ_{S_{n}}) . \end{matrix}

Using

{Y_{n} : | ζ_{ϵ} | \geq 1 / 2} \subset {Y_{n} : | {\bar{Δ}}_{ϵ} | \geq 1 / 4}

(say) for all sufficiently large n, it can be shown that

\begin{matrix} E [{R_{β, ϵ}^{†} (x)}^{2}] & \leq 16 [9 f_{LB}^{2} (x) E [ζ_{ϵ}^{4}] + E [{ζ_{β}^{†} (x)}^{2} ζ_{ϵ}^{2}]] \\ + 4 {(μ ϵ)}^{- 2} [{f_{LB}^{2} (x) + D_{β}^{†} (x)} P [| {\bar{Δ}}_{ϵ} | \geq 1 / 4] + {E [{{\bar{Δ^{†}}}_{β} (x)}^{4}] P [| {\bar{Δ}}_{ϵ} {| \geq 1 / 4]}}^{1 / 2}] . \end{matrix}

Then, the result follows from (A4), (A6), (A22), and Lemma A4, under Assumption B (recall that

ϵ = C^{'} n^{- 1 / 2}

; in this case,

P [| {\bar{Δ}}_{ϵ} | \geq 1 / 4] \leq 2 e^{- ϱ n^{1 / 2}}

, where

ϱ > 0

is a constant, independent of n). □

We are ready to prove Theorems 5–8.

Proof (Proof of Theorem 5) We start with

\begin{matrix} E [{\hat{f}}_{β, ϵ} (x)] & = & f (x) + B_{β}^{〈 k, f 〉} (x) - f (x) δ_{ϵ} + E [R_{β, ϵ} (x)], \end{matrix}

(A32)

{\hat{f}}_{β, ϵ} (x) - E [{\hat{f}}_{β, ϵ} (x)] = {\bar{Δ}}_{β} (x) - f (x) {\bar{Δ}}_{ϵ} + R_{β, ϵ} (x) - E [R_{β, ϵ} (x)],

(A33)

where

V [{\hat{f}}_{β, ϵ} (x)] = V [{\bar{Δ}}_{β} (x)] + 2 C o v [{\bar{Δ}}_{β} (x), - f (x) {\bar{Δ}}_{ϵ} + R_{β, ϵ} (x)] + V [- f (x) {\bar{Δ}}_{ϵ} + R_{β, ϵ} (x)] .

(A34)

Also,

C o v [{\bar{Δ}}_{β} (x), {\bar{Δ}}_{ϵ}] = n^{- 1} [μ^{2} E [{(Y + ϵ)}^{- 1} Y^{- 1} k (Y; β, x)] - μ E [Y^{- 1} k (Y; β, x)] μ E [{(Y + ϵ)}^{- 1}]],

hence,

| C o v [{\bar{Δ}}_{β} (x), {\bar{Δ}}_{ϵ}] | \leq n^{- 1} [μ J_{β}^{〈 k, f_{- 1} 〉} (x) + J_{β}^{〈 k, f 〉} (x)] \leq n^{- 1} [μ | | f_{- 1} {| |}_{[0, \infty)} + {| | f | |}_{[0, \infty)}] .

It is shown from (A32) and (A34) that

\begin{matrix} |B i a s [{\hat{f}}_{β, ϵ} (x)] - B_{β}^{〈 k, f 〉} (x)| & \leq & {| | f | |}_{[0, \infty)} δ_{ϵ} + {E [R_{β, ϵ}^{2} (x)]}^{1 / 2}, \\ |V [{\hat{f}}_{β, ϵ} (x)] - V [{\bar{Δ}}_{β} (x)]| & \leq & M_{2} n^{- 1} + {n^{- 1} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) E [R_{β, ϵ}^{2} (x)]}^{1 / 2} \\ + E [R_{β, ϵ}^{2} (x)], \end{matrix}

where

M_{2} > 0

is a constant, independent of

n, β

, and x. Using Lemma A5 and

\begin{matrix} {n^{- 1} μ J_{β}^{〈 k^{2}, f_{- 1} 〉} (x) E [R_{β, ϵ}^{2} (x)]}^{1 / 2} & \leq & M^{1 / 2} [2 n^{- 1} D_{β}^{1 / 2} (x) + n^{- 1 / 2} D_{β} (x)] \\ \leq & M^{1 / 2} [n^{- 1} + (n^{- 1} + n^{- 1 / 2}) D_{β} (x)], \end{matrix}

we have

\begin{matrix} |B i a s [{\hat{f}}_{β, ϵ} (x)] - B_{β}^{〈 k, f 〉} (x)| & \leq & M_{3} [n^{- 1 / 2} + n^{- 1 / 2} | B_{β}^{〈 k, f 〉} (x) |] \\ (a s s u m e n^{- 1} β^{- 1} = o (1)), \\ |V [{\hat{f}}_{β, ϵ} (x)] - V [{\bar{Δ}}_{β} (x)]| & \leq & M_{4} [n^{- 1} + n^{- 1 / 2} D_{β} (x)], \end{matrix}

where

M_{3}, M_{4} > 0

are constants, independent of

n, β

, and x. Then, using (A24)–(A26), the proof is completed. □

Proof Proof of Theorem 6)

Recall (A2) (also (A3) and (A19)). The strong consistency follows from (A4), (A5), and (A20), together with

(μ / x) B_{β}^{〈 k, f_{LB} 〉} (x) = O (β)

for fixed

x > 0

(see (A24)). □

Proof (Proof of Theorem 7) Recall (A33), where

{\bar{Δ}}_{ϵ} = O_{p} (n^{- 1 / 2})

. For fixed

x > 0

, Lemma A5 implies that

R_{β, ϵ}^{†} (x) - E [R_{β, ϵ}^{†} (x)] = o_{p} ({(n β^{1 / 2})}^{- 1 / 2})

, i.e.,

{(n β^{1 / 2})}^{1 / 2} ({\tilde{f}}_{β, ϵ} (x) - E [{\tilde{f}}_{β, ϵ} (x)]) = {(n β^{1 / 2})}^{1 / 2} \frac{μ}{x} {\bar{Δ^{†}}}_{β} (x) + o_{p} (1) .

The asymptotic normality follows from (A21)). □

Proof (Proof of Theorem 8)

We assume that

\int_{0}^{\infty} t^{2 (1 / η^{''} + 1) + δ_{0}} f_{LB} (t) d t = I

(say) exists for some constant

δ_{0} > 0

(then,

\int_{0}^{\infty} t^{2 (1 / η^{''} + 1) + δ_{0}} f (t) d t = μ \int_{0}^{\infty} t^{2 / η^{''} + 1 + δ_{0}} f_{LB} (t) d t \leq μ I^{\frac{2 / η^{''} + 1 + δ_{0}}{2 (1 / η^{''} + 1) + δ_{0}}}

exists). In the same way as the argument of Remark , a rigorous derivation is made by splitting the integral into two parts (we set

H = 2 / η^{''} + 1 + δ_{0}

), as follows:

(i): Take a constant $τ^{'} > 2 / (H + 1)$ . The inequality

$\begin{matrix} ϵ^{- 2} \int_{β^{- τ^{'}}}^{\infty} E [{ζ_{β}^{†} (x)}^{2} ζ_{ϵ}^{2}] d x & \leq & O (n^{- 1}) \int_{0}^{\infty} [\int_{β^{- τ^{'}}}^{\infty} k (s; β, x) {f (s) + f_{LB} (s)} d x] d s \\ + O (1) \int_{β^{- τ^{'}}}^{\infty} D_{β}^{†} (x) d x (L e m m a A 4), \end{matrix}$

together with (A6) and ${sup}_{x \geq β^{- τ^{'}}} (μ / x) \leq 1$ (say), enables us to see that, by (A31),

$\begin{matrix} \int_{β^{- τ^{'}}}^{\infty} E [{{\tilde{f}}_{β, ϵ} (x) - f (x)}^{2}] d x \\ \leq 4 [\int_{β^{- τ^{'}}}^{\infty} D_{β}^{†} (x) d x + {| | f | |}_{[0, \infty)} E [ζ_{ϵ}^{2}] \\ + {(μ ϵ)}^{- 2} \{{| | f | |}_{[0, \infty)} E [ζ_{ϵ}^{4}] + \int_{β^{- τ^{'}}}^{\infty} E [{ζ_{β}^{†} (x)}^{2} ζ_{ϵ}^{2}] d x\}] \\ = o (β^{2} + n^{- 1} β^{- 1 / 2}) (r e p e a t t h e s a m e a r g u m e n t i n R e m a r k A 2) . \end{matrix}$
(ii): Taking $2 / (H + 1) < τ^{'} < η^{''} / (1 + η^{''})$ , Theorem 5 enables us to see that, for every $0 < τ < 1 / 2$ ,

$\begin{matrix} |\int_{β^{τ}}^{β^{- τ^{'}}} {B i a s^{2} [{\tilde{f}}_{β, ϵ} (x)] + V [{\tilde{f}}_{β, ϵ} (x)]} d x - (β^{2} I^{B^{2}} + n^{- 1} β^{- 1 / 2} μ I^{V_{f_{- 1}}})| \\ \leq β^{2} (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) B^{2} (x) d x \\ + 2 β {[I^{B^{2}} \int_{β^{τ}}^{β^{- τ^{'}}} {R_{β}^{† Bias} (x)}^{2} d x]}^{1 / 2} + \int_{β^{τ}}^{β^{- τ^{'}}} {R_{β}^{† Bias} (x)}^{2} d x \\ + n^{- 1} β^{- 1 / 2} μ (\int_{0}^{β^{τ}} + \int_{β^{- τ^{'}}}^{\infty}) V_{f_{- 1}} (x) d x + \int_{β^{τ}}^{β^{- τ^{'}}} | R_{β}^{† V} (x) | d x \\ = o (β^{2} + n^{- 1} β^{- 1 / 2}) . \end{matrix}$

□

References

Ahmad, I.A. On multivariate kernel estimation for samples from weighted distributions. Statist.Probab.Lett. 1995, 22, 121–129. [Google Scholar] [CrossRef]
Bhattacharyya, B.B.; Franklin, L.A.; Richardson, G.D. A comparison of nonparametric unweighted and length-biased density estimation of fibres. Comm.Statist.Theory Methods 1988, 17, 3629–3644. [Google Scholar]
Chaubey, Y.P.; Li, J. Asymmetric kernel density estimator for length-biased data. In Contemporary Topics in Mathematics and Statistics with Applications; Adhikari, A., Adhikari, M.R., Chaubey, Y.P., Eds.; Asian Books Private Ltd: New Delhi, 2013; pp. 1–28. [Google Scholar]
Chaubey, Y.P.; Sen, P.K.; Li, J. Smooth density estimation for length-biased data. J.Indian Soc.Agricultual Statist. 2010, 64, 145–155. [Google Scholar]
Chen, S.X. 1999. Beta kernel estimators for density functions. Comput.Statist.Data.Anal. 1999, 31, 131–145. [Google Scholar] [CrossRef]
Chen, S.X. Probability density function estimation using gamma kernels. Ann.Inst.Stat.Math. 2000, 52, 471–480. [Google Scholar] [CrossRef]
Cox, D.R. Some sampling problems in technology. In New Developments in Survey Sampling; Johnson, N.L., Smith, H.Jr., Eds.; Wiley: New York, 1969; pp. 506–527. [Google Scholar]
Cristóbal, J.A.; Alcalá, J.T. An overview of nonparametric contributions to the problem of functional estimation from biased data. Test 2001, 10, 309–332. [Google Scholar] [CrossRef]
Feller, W. An Introduction to Probability Theory and its Applications, 2nd ed.; Wiley: New York, 1971; Volume II. [Google Scholar]
Guillamón, A. , Navarro, J., and Ruiz, J.M. 1998. Kernel density estimation using weighted data. M. 1998. Kernel density estimation using weighted data. Comm.Statist.Theory Methods 27 2123–2135.
Igarashi, G. Weighted log-normal kernel density estimation. Comm.Statist.Theory Methods 2016, 45, 6670–6687. [Google Scholar] [CrossRef]
Igarashi, G. Multivariate density estimation using a multivariate weighted log-normal kernel. Sankyhā 2018, 80, 247–266. [Google Scholar] [CrossRef]
Igarashi, G.; Kakizawa, Y. Re-formulation of inverse Gaussian, reciprocal inverse Gaussian, and Birnbaum–Saunders kernel estimators. Statist.Probab.Lett. 2014, 84, 235–246. [Google Scholar] [CrossRef]
Igarashi, G.; Kakizawa, Y. Multiplicative bias correction for asymmetric kernel density estimators revisited. Comput.Statist.Data.Anal. 2020, 141, 40–61. [Google Scholar] [CrossRef]
Jones, M.C. Kernel density estimation for length biased data. Biometrika 1991, 78, 511–519. [Google Scholar] [CrossRef]
Jones, M.C. Simple boundary correction for kernel density estimation. Stat.Comput. 1993, 3, 135–146. [Google Scholar] [CrossRef]
Kakizawa, Y. Nonparametric density estimation for nonnegative data, using symmetrical-based inverse and reciprocal inverse Gaussian kernels through dual transformation. J.Statist.Plann.Inference 2018, 193, 117–135. [Google Scholar] [CrossRef]
Kakizawa, Y. A class of Birnbaum–Saunders type kernel density estimators for nonnegative data. Comput.Statist.Data.Anal. 2021, 161, 107249. [Google Scholar] [CrossRef]
Kakizawa, Y. Multivariate elliptical-based Birnbaum–Saunders kernel density estimation for nonnegative data. J.Multivariate.Anal. 2022, 187, 104834. [Google Scholar] [CrossRef]
Marron, J.S. and Ruppert, D. 1994. Transformations to reduce boundary bias in kernel density estimation. J.R.Stat.Soc. 1994, 56, 653–671. [Google Scholar]
Mnatsakanov, R.M.; Ruymgaart, F.H. Some properties of moment-empirical cdf’s with application to some inverse estimation problems. Math.Methods Statist. 2003, 12, 478–495. [Google Scholar]
Mnatsakanov, R.M.; Ruymgaart, F.H. On moment-density estimation in some biased models. In Optimality: The Second Erich L. Lehmann Symposium; Institute of Mathematical Statistics, 2006; pp. 322–333. [Google Scholar]
Parzen, E. On estimation of a probability density function and mode. On estimation of a probability density function and mode. Ann.Math.Statist. 33 1962, 1065–1076. [Google Scholar] [CrossRef]
Patil, G.P.; Rao, C.R. Weighted distributions and size-biased sampling with applications to wildlife populations and human families. Biometrics 1978, 34, 179–189. [Google Scholar] [CrossRef]
Richardson, G.D.; Kazempour, M.K.; Bhattacharyya, B.B. Length biased density estimation of fibres. J.Nonparametr.Stat. 1991, 1, 127–141. [Google Scholar] [CrossRef]
Rosenblatt, M. Remarks on some nonparametric estimates of density functions. Ann.Math.Statist. 1956, 27, 832–837. [Google Scholar] [CrossRef]
Scaillet, O. Density estimation using inverse and reciprocal inverse Gaussian kernels. J.Nonparametr.Stat. 2004, 16, 217–226. [Google Scholar] [CrossRef]
Silverman, B.W. Density Estimation for Statistics and Data Analysis; Chapman and Hal: London, UK, 1986. [Google Scholar]
Zhang, S. , Karunamuni, R.J., and Jones, M.C. 1999. An improved estimator of the density function at the boundary. C. 1999. An improved estimator of the density function at the boundary. J.Amer.Statist.Assoc. 94 1231–1241.

1	Clearly, the kernel is a rescaled version of the (standard) gamma density $s^{θ - 1} e^{- s} / Γ (θ)$ , with substitution of $x / β + 1$ for the shape parameter $θ$ . Here, the shape $θ$ is limited to be greater than or equal to 1, so as to ensure that the resulting kernel is bounded.
2	It is easy to see that $\begin{matrix} {k_{g}^{(q B S)} (s; β, x)}}^{2} \\ = & \frac{(C_{g}^{2} / C_{g^{2}})}{{β (x + c β)}}^{1 / 2}} f_{g^{2}}^{(q B S)} (s; \sqrt{1 / (x / β + c)}, β (x / β + c), θ \sqrt{1 / (x / β + c)}) A^{〈 q 〉} (\frac{s}{β (x / β + c)}), s, x \geq 0, \end{matrix}$ provided that $g^{2}$ is also a density generator.
3	It automatically means that $f_{LB}$ is a Lipschitz-continuous function (i.e., Hölder-continuous function, with exponent $η = 1$ ) on $[0, \infty)$ , with $f_{LB} (0) = 0$ , and that $f_{LB}^{'}$ and $f_{LB}^{''}$ are continuous functions on $[0, \infty)$ ; besides, under F(ii.1), $f (0) = 0$ and $f_{LB}^{'} (0) = 0$ . Note that $f_{LB}^{(j)} (t) = μ^{- 1} j f^{(j - 1)} (t) + t f^{(j)} (t)}$ , $j = 1, 2$ .
4	Note that $\int_{0}^{\infty} f_{- 1} (t) d t = (\int_{0}^{1} + \int_{1}^{\infty}) f_{- 1} (t) d t \leq \int_{0}^{1} f_{- 3 / 2} (t) d t + \int_{1}^{\infty} f (t) d t < \int_{0}^{\infty} f_{- 3 / 2} (t) d t + 1 .$ This, together with F(ii.1), implies that $\int_{0}^{\infty} f_{- 1}^{2} (t) d t$ exists.
5	The rule of thumb (ROT) procedure, with the gamma or LN reference, was also considered. But, these results were not reported here, since, not surprisingly, the ROT was non-robust for the misspecification, although the computational speed of the ROT was very high.

Table 1. Average ISEs

\times 1000

of the estimators using the LSCV selected smoothing parameter. The value in the parenthesis stands for the standard deviation.

Table 1. Average ISEs

\times 1000

of the estimators using the LSCV selected smoothing parameter. The value in the parenthesis stands for the standard deviation.

	PE $[p]$ -based BS KDE		PE $[p]$ -based log KDE		Gamma KDE
n	$p = 1$	$p = 3 / 2$	$p = 1$	$p = 3 / 2$
200	12.882	12.440	12.678	12.128	12.698
	(14.863)	(14.960)	(15.083)	(14.326)	(13.601)
300	9.767	9.699	9.809	9.833	9.939
	(11.067)	(11.587)	(11.414)	(12.160)	(10.339)
500	6.896	6.703	7.123	6.886	7.567
	(7.873)	(7.376)	(8.257)	(7.973)	(8.373)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Asymmetric Kernel Density Estimation for Biased Data

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries

2.1. LB Density

2.2. Boundary Bias Problem and Asymmetric Kernel Method

2.3. Two density estimators under LB sampling scheme

3. Assumptions

4. Main Results

4.1. Asymptotic Properties of ${\hat{f}}_{β, ϵ}$

4.2. Asymptotic Properties of ${\tilde{f}}_{β, ϵ}$

5. Simulation Studies

6. Discussion

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

Appendix A.1. Technical Conditions on k(·;β,x)

Appendix A.2. Auxiliary Lemmas

Appendix B

Appendix B.1. Some basic results for ζ ϵ

Appendix B.2. Some preliminary results for ζ β (x)

Appendix B.3. Some Preliminary Results for ζ β † (x)

Appendix B.4. Proofs of Main Results

References

MDPI Initiatives

Important Links

Subscribe

Asymmetric Kernel Density Estimation for Biased Data

Abstract

Keywords:

Subject:

1. Introduction

2. Preliminaries

2.1. LB Density

2.2. Boundary Bias Problem and Asymmetric Kernel Method

2.3. Two density estimators under LB sampling scheme

3. Assumptions

4. Main Results

4.1. Asymptotic Properties of f ^ β , ϵ

4.2. Asymptotic Properties of f ˜ β , ϵ

5. Simulation Studies

6. Discussion

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

Appendix A.1. Technical Conditions on k(·;β,x)

Appendix A.2. Auxiliary Lemmas

Appendix B

Appendix B.1. Some basic results for ζ ϵ

Appendix B.2. Some preliminary results for ζ β (x)

Appendix B.3. Some Preliminary Results for ζ β † (x)

Appendix B.4. Proofs of Main Results

References

MDPI Initiatives

Important Links

Subscribe

4.1. Asymptotic Properties of ${\hat{f}}_{β, ϵ}$

4.2. Asymptotic Properties of ${\tilde{f}}_{β, ϵ}$