Preprint
Article

This version is not peer-reviewed.

Functional Nonparametric Estimation of a Conditional Distribution Function under Spatial Weak Dependence

Submitted:

26 August 2025

Posted:

27 August 2025

You are already at the latest version

Abstract
This paper addresses the asymptotic analysis of a nonparametric kernel estimator for conditional distribution functions in the context of spatially weakly dependent functional variables, where the covariates lie in an infinite-dimensional space. We focus on establishing the almost complete convergence (a.~co) of the estimator and deriving its rate of convergence under a spatial sampling framework characterized by quasi-association. By integrating spatial structure into the theoretical development, we highlight the impact of weak spatial dependence on the performance of the estimator. Our results provide key insights into the applicability and robustness of kernel-based nonparametric inference for spatially structured functional data.
Keywords: 
;  ;  ;  

1. Introduction

In recent years, the field of statistical analysis has witnessed significant advancements, particularly in the context of dependent data. This evolution is evident in a variety of groundbreaking studies that have contributed to our understanding and methodologies [1]. Functional Data Analysis (FDA) is a statistical method that is used to analyze data that are considered functions, rather than just single values.
In traditional data analysis, observations are typically represented by single values such as numbers or categories. However, in many scenarios, data are collected over a continuum, such as time, space, or another ordered domain. FDA treats these data as continuous functions or curves rather than discrete points (for an example, see [2,3,4]).
Association refers to the relationship or connection between different elements within a dataset. In statistical analysis and data science, the association between variables and associated data is fundamental to understanding relationships and patterns within data sets. Associations help to explore how changes in one variable may impact another or how variables relate to each other in a dataset. By exploring associations, analysts and researchers can gain insight into how different variables interact and influence each other, leading to informed decision-making and a deeper understanding of the underlying phenomena represented by the data.
Newman’s early work in 1984 set the foundation by exploring the role of research within this domain. The concept of weak dependence, pivotal in this field, was further elaborated by Doukhan and Louhichi in 1999, providing a new perspective on dependence conditions and their applications to moment inequalities [5].
Studies addressing both positive and negative dependent random variables have been explored in various references [6,7,8]. In their work [9], the authors introduced the concept of quasi-association to analyze stochastic occurrences with real values, illustrating the notion of weak dependence. The idea, which is elaborated on by the work mentioned in [10], offers a unified framework for evaluating groups of randomly dependent variables that are positively or negatively dependent. This framework is applicable to the domain of real-valued random fields.
In the corpus of research that is currently available, the approach of nonparametric estimation of quasi-associated random variables has only attracted a small amount of attention. The research conducted by [11] focuses on a limit theorem that pertains to quasi-associated Hilbertian random variables. This theoretical framework was the topic of the examination. Furthermore, in circumstances where dependency is weak, the asymptotic outcomes of an M-estimator of regression are explored according to the findings of [12]. In the research work that is referred to as [13], the asymptotic normality of an estimator is investigated, with particular emphasis placed on the single-index structure of the conditional hazard function.
One of the areas that was considered significant in the study presented in [14,15] was the examination of nonparametric estimation for linked random variables. This was the topic of the work mentioned in [16]. The authors demonstrate the resilient uniform consistency properties of the partial derivatives of multivariate density functions under weak dependency using the approach provided in [17]. In addition to this, they determine the necessary rates of convergence in order to demonstrate the asymptotic normality of these estimators. Regarding the most recent research on quasi-associated data, the work has been carried out by [18,19,20].
In the context of a single-index regression model, the objective of this study is to evaluate the application of the kernel (k-nn) technique, especially in circumstances in which the explanatory variable is measured in functional space. The implementation of this method in circumstances that include association dependency is another aspect that is investigated in this work. The research given in [21] is an investigation of the use of k-nn in a regression model using a single index. This work follows a similar line of thought. Specifically, the study focuses on situations that include functional predictors and it reveals that almost ideal rates of convergence may be attained even in situations where there is an absence of strong reliance.
The purpose of the work mentioned in [22] is to identify the asymptotic characteristic of nonparametric estimation strategies for the regression function in the single functional index model (SFIM), with the assumption of quasi-association dependence.
In addition, the progress of this topic is characterized by the investigation of limit theorems for quasi-associated variables, as Douge’s paper from 2010 [11] demonstrates. Significant advances were made in 2016 with the work that Mechab and Laksaci did on nonparametric relative regression for associated random variables [23]. Daoudi et al. further advanced of research by establishing the asymptotic normality of the estimator of conditional hazard functions in the context of quasi-associated data [24].
In the realm of kernel density estimation on random fields, Lanh.Tat. Tran’s 1990 work was seminal [25], followed by significant contributions by Dabo-Niang and Yao in 2007 and Carbon, Francq and Tran in the same year [26,27]. J. Li and L.T. Tran in 2009 on nonparametric estimation of conditional expectation further underscores the evolving nature of statistical methods and their applications [28].
Moreover, the introduction of kernel nearest neighbors (k-nn) methodologies and the concept of a single functional index have further enriched the landscape of statistical analysis [29]. These approaches offer robust solutions for dealing with various types of data, including those that exhibit complex dependencies. The single functional index approach introduces an efficient way to model the influence of predictor variables on a response variable, simplifying complex multivariate relationships in a more manageable form [30]. This innovation has been instrumental in the advancement of the field, allowing for a more refined and insightful analysis of dependent data.
The integration of these advanced methodologies has opened new avenues in the realm of statistical analysis, particularly in the handling of complex datasets [29,30]. The k-nn approach adapts to the local structure of the data, providing a more accurate representation of the underlying relationships, which is crucial in various fields like finance, healthcare and environmental studies.
Spatial variables and spatial data play a critical role in understanding and analyzing geographic phenomena, patterns, and the development of predictive models and simulations to predict spatial patterns, trends, and future scenarios. Spatial data consists of information that is explicitly tied to geographic locations or coordinates on the Earth’s surface or in a specified area. Spatial data can be represented in various formats, including points, lines, polygons, grids, and rasters. It encompasses both the spatial coordinates and associated attribute data.
Using spatial data analysis techniques, researchers, policymakers, and practitioners can gain valuable insights into spatially distributed processes and make informed decisions for sustainable development, environmental conservation, and improved quality of life.
The contributions of Cressie, Banerjee, Carlin, Gelfand, and others have been pivotal in advancing the field of spatial statistics, with significant implications for environmental science, urban studies, and other disciplines [31,32].
In summary, the field of statistical analysis has undergone a remarkable transformation, driven by the development of new methodologies and the refinement of existing ones. The incorporation of kernel nearest neighbors, the single functional index model, and advancements in spatial data analysis represent significant leaps forward in our ability to analyze and interpret complex data. As we continue to explore and expand these methodologies, we pave the way for more sophisticated and insightful data analysis across various scientific disciplines.
In the following sections, we unveil our spatial model estimator in Section 2, propose hypotheses, and delve into the near-complete convergence of this estimator in Section 3. Through simulation studies, we scrutinize the asymptotic behavior using finite sample data in Section 4, culminating in conclusions outlined in Section 5. Appendix A contains detailed proofs to support our assertions.

2. Model Construction and Its Estimator

2.1. Model

To construct our model, we define:
Z i = ( A i , B i ) , i N N ,
where Z i takes values in F × , N N * , and ( F , d ) denotes a space of infinite-dimensional functions. Accordingly, we consider the sequence A i i N N as a functional random field.
In order to achieve the objective of our study, we introduce a kernel-type estimator F ^ a of the conditional distribution function F a . This estimation assumes that the functional random field is observed over the domain:
I n = i = ( i 1 , , i N ) N N : 1 i k n k , k = 1 , , N ,
with n = ( n 1 , , n N ) N N .
The kernel estimator of the conditional distribution function is given by:
F K ( a , b ) = i I n K h K 1 d ( a , A i ) H h H 1 ( b Y i ) i I n K h K 1 d ( a , A i ) , b R .
We can also express the numerator and denominator of the estimator separately as:
F K N ( a , b ) = i I n K i ( a ) H i ( b ) n ^ E K 1 ( a ) ,
F K D ( a ) = i I n K i ( a ) n ^ E K 1 ( a ) ,
where
K i ( a ) = K h K 1 d ( a , A i ) , H i ( b ) = H h H 1 ( b B i ) .
Here, K i ( a ) denotes a kernel function, while H i ( b ) is a smoothing (cumulative distribution-type) function. The parameters h K = h K , n and h H = h H , n are sequences of positive real numbers representing bandwidths. The symbol 1 denotes the unit multi-index in the spatial field.
The central aim of this study is to analyze the almost sure convergence of the estimator F K ( a , b ) toward F a with respect to the random field ( Z i ) i N N .

2.2. Spatial Dependence Measures

There exists a function φ ( t ) such that φ ( t ) 0 as t , and for all finite subsets E 0 , E ι N N , the following inequality holds:
α B ( E 0 ) , B ( E ι ) = sup B B ( E 0 ) , C B ( E ι ) P B C P ( B ) P ( C ) ψ Card ( E 0 ) , Card ( E ι ) φ dist ( E 0 , E ι ) .
Here, B ( E 0 ) (resp. B ( E ι ) ) denotes the Borel σ -algebra generated by Z i : i E 0 (resp. Z i : i E ι ). The notation Card ( E 0 ) (resp. Card ( E ι ) ) stands for the cardinality of E 0 (resp. E ι ), while dist ( E 0 , E ι ) represents the Euclidean distance between the two sets.
The function ψ : N 2 R + is symmetric and non-increasing in each argument. It satisfies one of the following regularity conditions:
ψ ( n , n 1 ) C min ( n , n 1 ) n , n 1 N ,
or
ψ ( n , n 1 ) C n + n 1 + 1 ζ , n , n 1 N ,
where ζ 1 and C > 0 is a constant.
Definition 1.
Let A 1 , , A n be real-valued random variables satisfying the following conditions:
  • E ( A j ) = 0 for all j = 1 , , n ,
  • P ( | A j | M ) = 1 for all j = 1 , , n , with M < .
Let σ n 2 = Var i = 1 n A i , and suppose there exist constants K > 0 (bounded) and β > 0 such that the following covariance inequality holds for all multi-indices:
( s 1 , , s u ) N u , ( t 1 , , t v ) N v , with 1 s 1 s u t 1 t v n :
Cov i = 1 u A s i , j = 1 v A t j K 2 M u + v 2 v exp β ( t 1 s u ) .
Then, for any t > 0 , the following exponential inequality holds:
P j = 1 n A j > t exp t 2 / 2 G n + J n 1 / 3 t 5 / 2 ,
where the constants G n and J n are given by:
G n σ n 2 , J n = 16 n K 2 9 G n · 1 e β 1 · 2 K v M 1 e β .

3. Asymptotic Properties

3.1. Background Information and Assumptions

Throughout this section, we denote by C and C strictly positive constants. We assume that for all y S :
F a ( b ) < 1 .
In the spatial context, the notation n means that:
min { n k } ,
and for all 1 j , k N , the ratio of components satisfies:
n j n k < C < .
We now introduce the following hypotheses:
(H1)
P ( A B ( a , r ) ) = ϕ a ( r ) > 0 .
Here, B ( a , r ) denotes the ball of radius r centered at a.
(H2)
The function φ satisfies the summability condition:
i = 1 i δ φ ( i ) < , with δ > 5 N .
(H3)
For all i j , the following holds:
0 < sup i j P ( A i , A j ) B ( a , h ) × B ( a , h ) C ϕ a ( h ) e + 1 e ,
where 1 < e < 1 N .
(H4)
There exists a neighborhood N a of a such that for all ( a 1 , a 2 ) N a 2 and ( b 1 , b 2 ) S 2 ,
| F a 1 ( b 1 ) F a 2 ( b 2 ) | C d m 1 ( a 1 , a 2 ) + | b 1 b 2 | m 2 ,
with m 1 > 0 , m 2 > 0 .
(H5)
The kernel K has compact support on [ 0 , 1 ] and satisfies:
C Be ( 0 , 1 ) ( · ) K ( · ) C Be ( 0 , 1 ) ( · ) ,
where Be ( 0 , 1 ) denotes the indicator function of the interval ( 0 , 1 ) .
(H6)
The function H belongs to the class C 2 , has compact support, and possesses bounded derivatives.
(H7)
There exist constants:
0 < α < δ 5 N 2 N , η 0 > 0 ,
such that:
lim n n ^ α = ,
and
C · n ^ ( 5 + 3 α ) N δ δ + η 0 ϕ a ( h ) ,
where:
n ^ = n 1 n N .

3.2. Main Result

Theorem 1.
Under the hypotheses (H1-H7), we have the following uniform convergence rate:
F K a ( b ) F a ( b ) = O a . co log ( n ^ ) n ^ ϕ a ( h K ) 1 / 2 + O a . co h K m 1 + h H m 2 ,
In the particular case where N = 1 , we recover the same convergence rate as in Ferraty et al. [33]. The proof of this result relies on the following decomposition:
F K ( a , b ) F a ( b ) = 1 F K D ( a ) F K N ( a , b ) E F K N ( a , b ) F a ( b ) E F K N ( a , b ) + F a ( b ) F K D ( a ) E F K D ( a ) F K D ( a ) .
The following results form the basis of the proof of Theorem 1:
Lemma 1.
Under hypotheses (H1-H5) and (H7), we have:
F K D ( a ) E F K D ( a ) = O a . co log ( n ^ ) n ^ ϕ a ( h K ) 1 / 2 .
Lemma 2.
Under the assumptions of Theorem 1, we have:
F K N ( a , b ) E F K N ( a , b ) = O a . co log ( n ^ ) n ^ ϕ a ( h K ) 1 / 2 .
Lemma 3.
Under hypotheses (H1-H5) and (H7), it holds that:
F a ( b ) E F K N ( a , b ) = O a . co h K m 1 + h H m 2 .
Corollary 1.
Under the assumptions of Lemma 1, we obtain:
n N N P F K D ( a ) < 1 2 < .

4. A Simulation Study

To assess the finite-sample performance of the proposed kernel estimator for the conditional distribution function F a ( b ) , we conduct a simulation study tailored to spatially weakly dependent functional data. The goal is to empirically validate the asymptotic properties established in our theoretical framework, particularly the almost complete convergence (a.co) and the rate of convergence under quasi-association.

4.1. Simulation Design

We consider a spatial domain of dimension N = 2 , defined over a regular grid of locations i I n = { 1 , , n 1 } × { 1 , , n 2 } , with a total sample size n ^ = n 1 × n 2 . At each spatial index, a functional covariate A i L 2 [ 0 , 1 ] is generated using smooth basis functions (e.g., B-splines), and perturbed with a spatially quasi-associated noise field to induce spatial weak dependence. This reflects a realistic spatial functional data scenario.
The scalar response variable B i is simulated via a known conditional model:
F a ( b ) = P ( B i b A i = a ) = Φ b m ( a ) σ ( a ) ,
where Φ is the standard normal cumulative distribution function, and m ( a ) , σ ( a ) are smooth functions of the norm a . This setup allows exact computation of the true conditional distribution for validation.

4.2. Visualization of Functional Data

To illustrate the nature of the spatially weakly dependent functional covariates A i , we plot a sample of these curves. Each function is observed on the interval [ 0 , 1 ] , and smoothness is ensured via B-spline basis functions.
As seen in Figure 1, we display a representative sample of 35 functional curves. These curves are smooth and exhibit gradual variability across space, consistent with the quasi-associated dependence structure. This visualization highlights the functional nature of the covariates and supports the assumptions used in our theoretical results.

4.3. Simulation Procedure

The simulation process proceeds as follows:
  • Parameter Initialization: Define n 1 = n 2 = 10 , bandwidths h K , h H , and basis size (e.g., 15 B-splines). Choose functions m ( a ) = a , ψ , and σ ( a ) = 0.3 + 0.1 a .
  • Data Generation: Each functional covariate is generated as:
    A i ( t ) = j = 1 J θ j ( i ) ϕ j ( t ) + δ i ( t ) ,
    where θ j ( i ) N ( 0 , 0 . 3 2 ) , and δ i is a spatially correlated perturbation modeled via exponential decay: Cov ( δ i , δ j ) exp ( i j ) .
  • Response Variable: For each i , compute:
    B i = m ( A i ) + η i , η i N ( 0 , σ ( a ) 2 ) .
  • Kernel Estimation: We employ the nonparametric estimator defined in Equation (1), where d ( · , · ) denotes the L 2 distance between functional covariates, H corresponds to the Gaussian cumulative distribution function, and K is the Epanechnikov kernel defined by:
    K ( u ) = 3 4 ( 1 u 2 ) 1 { | u | 1 } .
    K is the Epanechnikov kernel, and H is the Gaussian CDF.
  • Evaluation and Metrics: We compute pointwise metrics:
    Bias ( a , b ) = E [ F ^ K ( a , b ) ] F a ( b ) , MSE ( a , b ) = E [ ( F ^ K ( a , b ) F a ( b ) ) 2 ] ,
    estimated via Monte Carlo replications.
  • Visualization: Plot F ^ K ( a , b ) against F a ( b ) to visually assess the quality of the estimator across different ranges of b.

4.4. Results and Visual Analysis

To visually validate the performance of the kernel estimator, we plot the estimated conditional distribution function versus the true function for a fixed reference curve a. The figure below, generated from R, shows the estimated F ^ K ( a , b ) in red (dashed) and the true function F a ( b ) in blue (solid).
Figure 2. Comparison between the estimated and true conditional distribution functions F ^ K ( a , b ) and F a ( b ) for fixed functional input a.
Figure 2. Comparison between the estimated and true conditional distribution functions F ^ K ( a , b ) and F a ( b ) for fixed functional input a.
Preprints 174042 g002
We observe that the estimator aligns closely with the true distribution in the central region. Slight overestimation near the tails may arise from finite-sample effects or boundary bias. Overall, the estimator exhibits robust behavior under weak spatial dependence and functional variability.

4.5. Summary of Findings

The kernel estimator performs well in capturing the true conditional distribution under spatially quasi-associated functional covariates. Simulation results corroborate the theoretical properties, particularly the convergence rate stated in Theorem 1. Estimation accuracy improves with increased sample size and suitable bandwidth selection. Visualization confirms strong agreement between estimated and true CDFs, with mild deviation at boundaries. This simulation study confirms the consistency and practical feasibility of the proposed estimator. It supports our asymptotic results and highlights its flexibility in the spatial functional context.

5. Conclusions and Perspectives

This study provides a comprehensive investigation of a kernel-type estimator for conditional distribution functions in the context of spatially indexed functional data, particularly when covariates take values in infinite-dimensional spaces. The core objective was to examine the asymptotic properties of the estimator, focusing on its almost complete convergence (a. co) and the associated convergence rates.
We considered a spatial sampling framework with quasi-associated dependence structures, reflecting realistic conditions in many applied settings. Under a set of regularity conditions, we derived uniform convergence results and provided theoretical guarantees for the performance of the estimator. The theoretical developments were supported by a simulation study, which confirmed the estimator’s consistency and highlighted its sensitivity to spatial correlation and bandwidth selection.
From a broader perspective, our findings emphasize the critical role of incorporating spatial structure into statistical modeling. Ignoring such dependence can lead to inaccurate inference, especially in high-dimensional or functional data settings. The kernel-based approach demonstrated here offers a flexible and nonparametric alternative, well-suited for complex spatial data.

Future Research Directions

Building upon this foundation, several promising avenues for further research emerge:
The current framework can be generalized to irregular spatial lattices or continuous spatial domains using tools from spatial stochastic processes.
Investigating the behavior of the estimator under deviations from the quasi-association assumption, heavy-tailed distributions, or outliers would enhance its practical relevance.
Developing adaptive or cross-validated procedures for selecting kernel bandwidths h K and h H remains an open challenge, especially in functional settings.
Applying the methodology to spatial datasets in geostatistics, climate science, or public health would provide practical insights and further validate the approach.
Creating efficient algorithms and open-source software for kernel estimation in spatially dependent functional data would facilitate broader adoption in applied contexts.
Overall, this work contributes to the understanding of nonparametric inference for conditional distributions in spatial data and lays the groundwork for future theoretical and applied developments in this growing area of statistical research.

Author Contributions

Conceptualization, I.B and H.D.; A.B.; methodology, I.B and H.D.; software, I.B; validation, A.B., and H.D.; formal analysis, H.D. and I.B; investigation, I.B , A.B., and H.D.; resources, I.B and H.D.; data curation, A.R.; writing original draft preparation, H.D. and H.M.A.; writing review and editing, I.B, A.B, and H.D.; visualization, H.M.A.; supervision, A.B and H.M.A.; project administration,I.B; funding acquisition, H.M.A.

Funding

This research was funded by Taif University, Saudi Arabia, Project No. (TU-DSPP-2024-162).

Data Availability Statement

The data used to support the findings of this study are available on request from the corresponding author.

Acknowledgments

The authors extend their appreciation to Taif University, Saudi Arabia, for supporting this work through project number (TU-DSPP-2024-162).

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof 
(Proof of Lemma 1). The concepts used in the proof are comparable to those in [27]. We have:
F K D ( a ) E F K D ( a ) = 1 n ^ E K 1 ( a ) i I n N i ( a ) ,
where N i ( a ) : = K i ( a ) E [ K i ( a ) ] .
We take into account the spatial decomposition introduced by [25] for the variables N i ( a ) , defined with a fixed integer ρ n . For j N N , define:
Υ ( 1 , n , a , j ) = i k = 2 j k ρ n + 1 k = 1 , , N 2 j k ρ n + ρ n N i ( a ) .
In a similar vein, the other two terms are defined as:
Υ ( 2 N 1 , n , a , j ) = i k = 2 j k ρ n + ρ n + 1 k = 1 , , N 1 2 ( j k + 1 ) ρ n i N = 2 j N ρ n + 1 2 j N ρ n + ρ n N i ( a ) , Υ ( 2 N , n , a , j ) = i k = 2 j k ρ n + ρ n + 1 k = 1 , , N 2 ( j k + 1 ) ρ n N i ( a ) .
Now, set for i = 1 , , N :
r i = n i 2 ρ n , J = { 0 , , r 1 1 } × × { 0 , , r N 1 } .
We define:
( n , a , i ) = j J Υ ( i , n , a , j ) ,
so that we recover:
i I n N i ( a ) = i = 1 2 N ( n , a , i ) .
We can write, without loss of generality, the following identity:
F K D ( a ) E F K D ( a ) = 1 n ^ E K 1 ( a ) i = 1 2 N ( n , a , i ) .
Even in the case where n i 2 r i p n , we can group the remaining terms into an additional block denoted by ( n , a , 2 N + 1 ) , and this does not affect the validity of the proof (see [32] for details).
From Equation (A7), for any η > 0 , we have:
P F K D ( a ) E [ F K D ( a ) ] η 2 N max i = 1 , , 2 N P T ( n , a , i ) η n ^ E [ K 1 ( a ) ] ,
where it remains to bound, for each i = 1 , , 2 N :
P T ( n , a , i ) η n ^ E [ K 1 ( a ) ] .
We now focus on the specific case where i = 1 . To proceed, we perform a reordering of the variables, denoted as U ( 1 , n , a , j ) ; j J . Leveraging Lemma 4.5 from Carbon et al. [27], we apply it to this reordered sequence. The variables under the new enumeration are denoted by γ 1 , , γ M , where:
M = k = 1 N r k = 2 N n ^ p n N n ^ p n N .
For each γ j , there exists a corresponding index j j J such that:
γ j = i I ( 1 , n , a , j j ) N i ( a ) ,
where
I ( 1 , n , a , j j ) = i : 2 j k j p n + 1 i k 2 j k j p n + p n , k = 1 , , N .
Each such set contains p n N elements, and the minimal distance between any two such sets is at least p n N .
Now, consider independent random variables γ 1 * , , γ M * that are constructed to follow the same marginal distribution as the corresponding γ j , for j = 1 , , M . According to the result of Carbon et al. [27], we have:
j = 1 M E γ j γ j * 2 C M p n N ψ ( ( M 1 ) p n N , p n N ) φ ( p n ) .
Our next set of results is derived using **Bernstein’s** and **Markov’s** inequalities. We obtain:
P T ( n , a , i ) η n ^ E K 1 ( a ) RE 1 + RE 2 ,
where:
RE 1 = P j = 1 M γ j * η n ^ E K 1 ( a ) 2 2 exp η n ^ E K 1 ( a ) 2 M Var [ Z 1 * ] + C p n N η n ^ E K 1 ( a ) ,
and:
RE 2 = P j = 1 M γ j γ j * η n ^ E K 1 ( a ) 2 C η n ^ E K 1 ( a ) j = 1 M E γ j γ j * C M p n N η n ^ E K 1 ( a ) 1 ψ ( ( M 1 ) p n N , p n N ) φ ( p n ) .
Note that:
n ^ = 2 N M p n N ,
and
ψ ( ( M 1 ) p n N , p n N ) p n N .
Therefore, for:
η = η 0 log n ^ n ^ ϕ a ( h K ) ,
we obtain:
RE 2 n ^ p n N ( log n ^ ) 1 / 2 n ^ ϕ a ( h K ) 1 / 2 φ ( p n ) .
By choosing:
p n = C n ^ ϕ a ( h K ) log n ^ 1 / 2 N ,
we deduce:
RE 2 ( log n ^ ) 1 n ^ φ ( p n ) .
According to assumption (H7), we finally show that:
n I n n ^ φ ( p n ) < .
Concerning RE 1 , we now evaluate the variance term Var γ 1 * . In fact, we write:
Var γ 1 * = Var i I ( 1 , n , a , 1 ) N i ( a ) = i , j I ( 1 , n , a , 1 ) Cov ( N i ( a ) , N j ( a ) ) .
We denote:
RV n = i I ( 1 , n , a , 1 ) Var [ N i ( a ) ] and RC n = i j i , j I ( 1 , n , a , 1 ) Cov ( N i ( a ) , N j ( a ) ) .
Under assumptions (H1) and (H2), we have:
Var [ N i ( a ) ] C ϕ a ( h K ) + ϕ a ( h K ) 2 ,
which leads to the following bound:
RV n = O p n N ϕ a ( h K ) .
To control the covariance term RC n , we follow the decomposition technique of Masry [34]. We define two subsets of pairs ( i , j ) as follows:
S 1 = ( i , j ) I ( 1 , n , a , 1 ) 2 : 0 < i j c n , S 2 = ( i , j ) I ( 1 , n , a , 1 ) 2 : i j > c n .
When the actual sequence c n approaches infinity.
We split RC n into two components corresponding to the sets S 1 and S 2 :
RC n = ( i , j ) S 1 Cov N i ( a ) , N j ( a ) + ( i , j ) S 2 Cov N i ( a ) , N j ( a ) = RC n 1 + RC n 2 .
For the first term RC n 1 , we have:
RC n 1 = ( i , j ) S 1 E K i ( x ) K j ( x ) E K i ( x ) E K j ( x ) C ρ n N c n N ϕ x ( h K ) ϕ x ( h K ) 1 / a + ϕ x ( h K ) C ρ n N c n N ϕ x ( h K ) ( a + 1 ) / a .
For the second term RC n 2 , we use:
RC n 2 = ( i , j ) S 2 Cov N i ( x ) , N j ( x ) .
Since the kernel K is bounded, it holds that:
Cov N i ( x ) , N j ( x ) C φ i j .
For higher-order covariances, especially when t 1 = s u , we analyze:
Cov i = 1 u N s i , j = 1 v N t j C M E [ K 1 ] u + v E [ | γ 1 | u + v ] C K H M E [ K 1 ] u + v E [ | N 1 | ] ,
leading to:
Cov i = 1 u N s i , j = 1 v N t j ϕ x ( h K ) C ρ n N n ^ ϕ x ( h K ) u + v .
Now, by invoking quasi-association and assumption (H5), we derive the more precise inequality:
Cov i = 1 u N s i , j = 1 v N t j h K 1 Lip ( K ) + h H 1 Lip ( H ) 2 C M E [ K 1 ] u + v 2 i = 1 u j = 1 v λ s i , t j h K 1 Lip ( K ) + h H 1 Lip ( H ) 2 C ρ n N n ^ E [ K 1 ] u + v v λ t 1 s u h K 1 Lip ( K ) + h H 1 Lip ( H ) 2 C ρ n N n ^ ϕ x ( h K ) u + v v e α ( t 1 s u ) .
When t 1 < s u , by assumption (H6), we have:
Cov i = 1 u N s i , j = 1 v N t j C K H M E [ K 1 ] u + v 2 E | N s u N t 1 | + E | N s u | E | N t 1 | C M ϕ x ( h K ) u + v C ρ n N ϕ x ( h K ) 2 C ρ n N n ^ ϕ x ( h K ) u + v C ρ n N ϕ x ( h K ) 2 .
Moreover, by applying Hölder-type interpolation on (A34) and (A35) via powers γ and 1 γ , we obtain:
Cov i = 1 u N s i , j = 1 v N t j ϕ x ( h K ) C M ϕ x ( h K ) u + v ϕ x ( h K ) C ρ n N n ^ ϕ x ( h K ) u + v .
As a consequence, the variables Q i , i = 1 , , 2 N satisfy the assumptions of the lemma with:
K n = C 2 N ϕ x ( h K ) , M n = C 2 N ϕ x ( h K ) , A n = Var i = 1 2 N i .
Thus, applying Bernstein’s inequality:
P i = 1 2 N i > η log n ^ n ^ ϕ x ( h K ) exp η 2 · log n ^ n ^ ϕ x ( h K ) Var i = 1 2 N B i + log 5 n ^ ( n ^ ϕ x ( h K ) ) 7 6 exp η 2 · log n ^ C + log 5 n ^ n ^ ϕ x ( h K ) 6 C exp C η 2 log n ^ .
Regarding the second covariance term, we can deduce the following estimate:
RC n 2 C ( i , j ) S 2 φ i j C p n N i : i c n φ i C ρ n N c n N a i : i c n i N a φ i .
We take c n = ϕ x ( h K ) 1 / ( N a ) , then:
RC n 2 C ρ n N c n N a i c n i N a φ ( i ) C ρ n N ϕ x ( h K ) i c n i N a φ ( i ) .
From assumption (H2), we obtain:
RC n 2 C ρ n N ϕ x ( h K ) .
Similarly, with the same choice of c n , we also have:
RC n 1 C ρ n N ϕ x ( h K ) .
Hence, the total variance becomes:
Var [ γ 1 ] = O ρ n N ϕ x ( h K ) .
Using this result, we write:
2 N n ^ E [ K 1 ] 2 Var [ γ 1 ] = O 2 N ρ n N ϕ x ( h K ) ( n ^ ϕ x ( h K ) ) 2
= O 1 n ^ M ϕ x ( h K )
= O 1 n ^ ϕ x ( h K ) .
Moreover,
Var [ γ 1 * ] = O p n N ϕ x ( h K ) .
Using the expressions for p n , M, and η , together with the last bound, we conclude:
RE 1 exp C η 0 log n ^ .
In light of all the above, the result of the lemma follows by choosing a suitable constant η 0 .
Proof 
(Proof of Lemma 2). Let us define:
Δ i = K i ( x ) H i ( t j ) E K i ( x ) H i ( t j ) .
It is clear that:
Δ i C Δ i ( x ) ,
for some constant C > 0 .
We aim to show that:
P max z G n F N x ( z ) E [ F N x ( z ) ] > η 0 log n ^ n ^ ϕ x ( h K ) z n ( B 1 + B 2 ) ,
where G n is a finite grid of evaluation points, and B 1 , B 2 are error terms derived from concentration inequalities.
Using the definition of z n , the Equation (A21), and assumption (H7), we obtain:
n I n n ^ α + 1 2 RE 2 < .
For a suitable choice of η 0 , we also have:
n I n n ^ α + 1 2 RE 1 < .
Therefore, Lemma 2 follows as a consequence of these two results, combined with the bounds in Equations (23) and (24). □
Proof 
(Proof of Lemma 3). Using assumption H4 (15), and noting that the random variables are identically distributed, we obtain the following:
F x ( y ) E F K N ( a , b ) = E K 1 ( a ) B e B ( a , h K ) ( A ) E [ H 1 ( b ) A ] F a ( b ) , b S .
Next, we apply integration by parts:
E [ H 1 ( b ) A ] = R H h H 1 ( b z ) f A ( z ) d z = h H 1 R H h H 1 ( b z ) F A ( z ) d z .
Performing the change of variable t = b z h H , we obtain:
E [ H 1 ( b ) A ] = R H ( t ) F A ( b h H t ) d t .
Since H is a probability density function and applying assumption H4 (15), it follows that:
E [ H 1 ( b ) A ] F x ( b ) R H ( t ) F A ( b h H t ) F a ( b ) d t .
Finally, we obtain:
B e B ( a , h K ) ( A ) · E [ H 1 ( b ) A ] F a ( b ) R H ( t ) h K m 1 + | t | m 2 h H m 2 d t , b S .
This completes the proof of the lemma under assumption H6 (6). □
Proof 
(Proof of Corollary 1). Since we already have that E [ F K D ( a ) ] = 1 , we apply the basic inequality from probability theory:
P F K D ( a ) < 1 2 = P F K D ( a ) 1 2 P F K D ( a ) E [ F K D ( a ) ] > 1 2 ,
and by Lemma 1, the quantity F K D ( a ) E [ F K D ( a ) ] is controlled by a rate of almost complete convergence. Thus, we deduce that:
n N N P F K D ( a ) < 1 2 < ,
which implies the almost sure convergence of the denominator bounded away from zero, and completes the proof. □

References

  1. Newman, M. The Role of Research in This Association. Journal of School Health 1984, 54. [Google Scholar] [CrossRef]
  2. Aneiros, G.; Cao, R.; Fraiman, R.; Genest, C.; Vieu, P. Recent advances in functional data analysis and high-dimensional statistics. J. Multivariate Anal.170, 3–9. [CrossRef]
  3. Ramsay, J.; Silverman, B. W. Functional Data Analysis; Springer: New York, USA, 2005. [Google Scholar]
  4. Ramsay, J.; Hooker, G.; Graves, S. Functional Data Analysis with R and MATLAB; Springer: New York, USA, 2009. [Google Scholar]
  5. Doukhan, P.; Louhichi, S. A new weak dependence condition and applications to moment inequalities. Stochastic Process. Appl. 1999, 84, 313–342. [Google Scholar] [CrossRef]
  6. Matula, P. A note on the almost sure convergence of sums of negatively dependent random variables. Stat. Probab. Lett. 1992, 15, 209–213. [Google Scholar] [CrossRef]
  7. Newman, C.M. Asymptotic independence and limit theorems for positively and negatively dependent random variables. In Inequalities in Statistics and Probability; IMS Lecture Notes Monograph Series 1984, 5, 127–140. [Google Scholar]
  8. Roussas, G.G. Positive and negative dependence with some statistical applications. In Asymptotics, Nonparametrics and Time Series; Ghosh, S., Ed.; Marcel Dekker, Inc.: New York, USA, 1999; pp. 757–788. [Google Scholar]
  9. Doukhan, P.; Louhichi, S. A new weak dependence condition and applications to moment inequalities. Stochastic Process. Their Appl. 1999, 84, 313–342. [Google Scholar] [CrossRef]
  10. Bulinski, A.; Suquet, C. Normal approximation for quasi-associated random fields. Stat. Probab. Lett. 2001, 54, 215–226. [Google Scholar] [CrossRef]
  11. Douge, L. Théorèmes limites pour des variables quasi-associées hilbertiennes. Ann. Inst. Stat. Univ. Paris 2010, 54, 51–60. [Google Scholar]
  12. Attaoui, S.; Laksaci, A.; Ould-Said, E. Asymptotic Results for an M-estimator of the Regression Function for Quasi-Associated Processes. In Functional Statistics and Applications. Contributions to Statistics; Springer International Publishing: Cham, 2015; pp. 3–28. [Google Scholar] [CrossRef]
  13. Daoudi, H.; Mechab, B.; Chikr Elmezouar, Z. Asymptotic normality of a conditional hazard function estimate in the single index for quasi-associated data. Commun. Stat. Theory Methods 2020, 49, 513–530. [Google Scholar]
  14. Daoudi, H.; Mechab, B. Asymptotic Normality of the Kernel Estimate of Conditional Distribution Function for the quasi-associated data. Pak. J. Stat. Oper. Res. 2019, 15, 999–1015. [Google Scholar] [CrossRef]
  15. Daoudi, H.; Mechab, B.; Benaissa, S.; Rabhi, A. Asymptotic normality of the nonparametric conditional density function estimate with functional variables for the quasi-associated data. Int. J. Stat. Econ. 2019, 20, 94–106. [Google Scholar]
  16. Laksaci, A.; Mechab, W. Nonparametric relative regression for associated random variables. Metron 2016, 74, 75–97. [Google Scholar] [CrossRef]
  17. Allaoui, S.; Bouzebda, S.; Chesneau, C.; Liu, J. Uniform almost sure convergence and asymptotic distribution of the wavelet-based estimators of partial derivatives of multivariate density function under weak dependence. J. Nonparametr. Stat. 2021, 33, 170–196. [Google Scholar] [CrossRef]
  18. Bouaker, I.; Belguerna, A.; Daoudi, H. The consistency of the kernel estimation of the function conditional density for associated data in high-dimensional statistics. J. Sci. Arts 2022, 22, 247–256. [Google Scholar] [CrossRef]
  19. Daoudi, H.; Belguerna, A.; Elmezouar, Z.C.; Alshahrani, F. Conditional Density Kernel Estimation Under Random Censorship for Functional Weak Dependence Data. J. Math. 2025, Article ID 2159604. Available online: https://doi.org/10.1155/jom/2159604.
  20. Daoudi, H.; Elmezouar, Z.C.; Alshahrani, F. Asymptotic Results of Some Conditional Nonparametric Functional Parameters in High Dimensional Associated Data. Mathematics 2023, 11, 4290. [Google Scholar] [CrossRef]
  21. Bouzebda, S.; Laksaci, A.; Mohammedi, M. The k-nearest neighbors method in single index regression model for functional quasi-associated time series data. Rev. Mat. Complut. 2023, 36, 361–391. [Google Scholar] [CrossRef]
  22. Bouzebda, S.; Laksaci, A.; Mohammedi, M. Single Index Regression Model for Functional Quasi-Associated Time Series Data. REVSTAT-Statistical Journal 2023, 20, 605–631. [Google Scholar]
  23. Mechab, W.; Laksaci, A. Nonparametric relative regression for associated random variables. Metron 2016, 74, 75–97. [Google Scholar] [CrossRef]
  24. Daoudi, H.; Mechab, B.; Elmezouar, C.Z. Asymptotic Normality of a Conditional Hazard Function Estimate in the Single Index for Quasi-Associated Data. Commun. Stat. Theory Methods2018, Article ID 1549248.
  25. Tran, L.T. Kernel density estimation on random fields. J. Multivariate Anal. 1990, 34, 37–53. [Google Scholar] [CrossRef]
  26. Dabo-Niang, S.; Yao, A.F. Kernel regression estimation for continuous spatial processes. Math. Methods Statist. 2007, 16, 298–317. [Google Scholar] [CrossRef]
  27. Carbon, M.; Francq, C.; Tran, L.T. Kernel regression estimation for random fields. J. Statist. Plann. Inference 2007, 137, 778–798. [Google Scholar] [CrossRef]
  28. Li, J.; Tran, L.T. Nonparametric estimation of conditional expectation. J. Statist. Plann. Inference 2009, 139, 164–175. [Google Scholar] [CrossRef]
  29. Fix, E.; Hodges, J.L. Discriminatory Analysis, Nonparametric Discrimination: Consistency Properties. Technical Report 4, USAF School of Aviation Medicine, Randolph Field, 1951.
  30. Cover, T.M.; Hart, P.E. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
  31. Cressie, N.A. Statistics for Spatial Data; Wiley: New York, USA, 1991. [Google Scholar]
  32. Biau, G.; Cadre, B. Nonparametric spatial prediction. Stat. Inference Stoch. Process. 2004, 7, 327–349. [Google Scholar] [CrossRef]
  33. Ferraty, F.; Rabhi, A.; Vieu, P. Estimation non-paramétrique de la fonction de hasard avec variable explicative fonctionnelle. Rev. Roumaine Math. Pures Appl. 2008, 53, 1–18. [Google Scholar]
  34. Masry, E. Recursive probability density estimation for weakly dependent stationary processes. IEEE Trans. Inf. Theory 1986, 32, 254–267. [Google Scholar] [CrossRef]
  35. Gasser, T.; Hall, P.; Presnell, B. Nonparametric estimation of the mode of a distribution of random curves. J. R. Stat. Soc. Ser. B 1998, 60, 681–691. [Google Scholar] [CrossRef]
  36. Ferraty, F.; Vieu, P. Nonparametric Functional Data Analysis: Theory and Practice; Springer Series in Statistics: New York, USA, 2006. [Google Scholar]
  37. Ferraty, F.; Laksaci, A.; Tadj, A.; Vieu, P. Rate of uniform consistency for nonparametric estimates with functional variables. J. Stat. Plann. Inference 2010, 140, 335–352. [Google Scholar] [CrossRef]
  38. Kara-Zaitri, L.; Laksaci, A.; Rachdi, M.; Vieu, P. Uniform in bandwidth consistency for various kernel estimators involving functional data. J. Nonparametr. Stat. 2017, 29, 85–107. [Google Scholar] [CrossRef]
  39. Ferraty, F.; Peuch, A.; Vieu, P. Modèle à indice fonctionnel simple. Comptes Rendus Math. 2003, 336, 1025–1028. [Google Scholar] [CrossRef]
  40. Ezzahrioui, M.; Ould-Saïd, E. Asymptotic Results of a Nonparametric Conditional Quantile Estimator for Functional Time Series. Commun. Stat. Theory Methods 2008, 37, 2735–2759. [Google Scholar] [CrossRef]
  41. Ezzahrioui, M.; Ould-Saïd, E. On the asymptotic properties of a nonparametric estimator of the conditional mode for functional dependent data. J. Nonparametr. Stat. 2008, 20, 3–18. [Google Scholar] [CrossRef]
  42. Laksaci, A. Convergence en moyenne quadratique de l’estimateur à noyau de la densité conditionnelle avec variable explicative fonctionnelle. Ann. Inst. Stat. Univ. Paris 2007, 51, 69–80. [Google Scholar]
  43. Laksaci, A.; Maref, F. Estimation non paramétrique de quantiles conditionnels pour des variables fonctionnelles spatialement dépendantes. Comptes Rendus Math. 2009, 347, 1075–1080. [Google Scholar] [CrossRef]
  44. Doob, J.L. Stochastic Processes; John Wiley and Sons: New York, USA, 1953. [Google Scholar]
  45. Kallabis, R.S.; Neumann, M.H. An exponential inequality under weak dependence. Bernoulli 2006, 12, 333–350. [Google Scholar] [CrossRef]
  46. Bosq, D. Linear Processes in Function Spaces: Theory and Applications; Lecture Notes in Statistics 149; Springer: Berlin/Heidelberg, Germany, 2000. [Google Scholar]
  47. Ramsay, J.O.; Silverman, B.W. Applied Functional Data Analysis: Methods and Case Studies; Springer-Verlag: New York, USA, 2002. [Google Scholar]
  48. Ferraty, F.; Laksaci, A.; Vieu, P. Estimating some characteristics of the conditional distribution in nonparametric functional models. Stat. Inference Stoch. Process. 2006, 9, 47–76. [Google Scholar] [CrossRef]
  49. Ferraty, F.; Laksaci, A.; Vieu, P. Functional time series prediction via conditional mode estimation. C. R. Math. Acad. Sci. Paris 2005, 340, 389–392. [Google Scholar] [CrossRef]
  50. Cardot, H.; Crambes, C.; Sarda, P. Spline estimation of conditional quantiles for functional covariates. C. R. Math. Acad. Sci. Paris 2004, 339, 141–144. [Google Scholar] [CrossRef]
  51. De Gooijer, J.; Gannoun, A. Nonparametric conditional predictive regions for time series. Comput. Statist. Data Anal. 2000, 33, 259–257. [Google Scholar] [CrossRef]
  52. Araujo, A.; Gin, E. The Central Limit Theorem for Real and Banach Valued Random Variables; Wiley Series in Probability and Mathematical Statistics; John Wiley and Sons: New York, NY, USA; Chichester, UK; Brisbane, Australia, 1980; p. xiv+233.
  53. Mohammedi, M.; Bouzebda, S.; Laksaci, A.; Bouanani, O. Asymptotic normality of the k-NN single index regression estimator for functional weak dependence data. Commun. Stat. Theory Methods 2022, 33, 1–26. [Google Scholar] [CrossRef]
  54. Daoudi, H.; Mechab, B.; Belguerna, A. Asymptotic Results of a Conditional Risk Function Estimate for Associated Data Case in High-Dimensional Statistics. In Proceedings of the International Conference on Recent Advances in Mathematics and Informatics (ICRAMI), 2021; Conference Paper. Available online: https://ieeexplore.ieee.org/document/9585904.
Figure 1. Sample of 35 functional covariates A i ( t ) , spatially quasi-associated.
Figure 1. Sample of 35 functional covariates A i ( t ) , spatially quasi-associated.
Preprints 174042 g001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated