Preprint
Article

This version is not peer-reviewed.

Realized Volatility Forecasting in the Spanish Electricity Market During the 2021–2025 Energy Crisis

A peer-reviewed version of this preprint was published in:
Mathematics 2026, 14(12), 2100. https://doi.org/10.3390/math14122100

Submitted:

07 May 2026

Posted:

08 May 2026

You are already at the latest version

Abstract
This paper analyzes volatility forecasting in the Spanish electricity spot market over the period 2021–2025, characterized by uncertainty, frequent price jumps, and the increasing occurrence of zero and negative prices. To accommodate these features, electricity prices are shifted to ensure welldefined logreturns, and predictable intraday and seasonal patterns are removed using the Ullrich demeaning procedure. Daily realized volatility measures are constructed from highfrequency data, including jumprobust and noiserobust estimators such as Median Realized Volatility and Realized Kernel. A broad set of volatility models, comprising GARCHtype specifications and multiple extensions of the Heterogeneous Autoregressive (HAR) framework, is evaluated using a coherent outofsample forecasting procedure. Model comparison is conducted through the Model Confidence Set methodology based on the QLIKE loss function, which identifies a Superior Set of Models with equal predictive ability. Conditional diagnostics, including OutofSample ROOS2measures and Mincer–Zarnowitz regressions, are subsequently used to characterize forecast accuracy, unbiasedness, and efficiency. The empirical results show that all GARCH models are systematically excluded from the superior set, while HARtype specifications based on realized volatility dominate. Within this set, a HAR model incorporating Median Realized Volatility, jump components, and dayoftheweek effects delivers the strongest economic performance, achieving an OutofSample ROOS 2close to 0.5 with unbiased forecasts. Overall, the findings highlight the importance of longmemory dynamics, discontinuous price movements, and residual weekly seasonality for volatility forecasting in modern electricity markets.
Keywords: 
;  ;  ;  ;  ;  

1. Introduction

The Iberian Electricity Market (MIBEL), established in 2007, has undergone significant structural transformation driven by market liberalization [1], rapid renewable penetration [2,3], and increased exposure to geopolitical and macroeconomic shocks [3,4]. The 2021–2025 period is particularly characterized by the European energy crisis, triggered by the Russia–Ukraine conflict, extreme fuel price fluctuations, and the accelerated integration of wind and solar generation in Spain [5]. These factors have substantially altered price formation in the Spanish electricity spot market, increasing volatility, amplifying extreme price movements, and intensifying the occurrence of zero and negative prices. A key manifestation of this transition is the “duck curve” phenomenon, where periods of high renewable output lead to sharp price collapses [6].
Volatility is a statistical measure of the variability of an asset’s returns over time. It is usually interpreted as an indicator of risk associated with price movements, and is a latent variable [7,8]. Therefore is a central object of analysis in electricity markets, as it directly affects risk management, hedging strategies [9], investment decisions, and regulatory oversight. Unlike traditional financial assets, electricity cannot be stored, and supply and demand must be balanced in real time [10,11]. Prices are driven by weather conditions, renewable intermittency, and regulatory interventions. This generates intraday and seasonal patterns, long-memory dependence, and abrupt jumps, which challenge standard econometric models.
The initial literature on volatility modeling is based on conditional heteroskedasticity models such as ARCH [12] and GARCH [13], along with extensions designed to capture asymmetries, structural breaks, and exogenous effects, including Markov-Switching GARCH [14], GARCH-X, and Taylor–Schwert-type specifications [15,16]. While used in energy markets, these models rely on low-frequency information and squared returns, limiting their ability to fully exploit high-frequency dynamics, especially under strong seasonality and jump behavior.
The generalization of intraday data facilitated the creation of realized volatility metrics, which offer non-parametric estimates of underlying volatility through high-frequency returns [17]. This procedure led to robust estimators designed to address jumps and microstructure noise, including Bipower Variation, Realized Kernel estimators [18,19] and Median Realized Volatility [20]. Within this literature, the Heterogeneous Autoregressive (HAR) model [21] offers a parsimonious representation of long-memory volatility dynamics through the aggregation of daily, weekly, and monthly components. In electricity markets, HAR-type models consistently show strong empirical performance relative to GARCH-based specifications, particularly when augmented with jump and calendar effects [22,23].
Despite these advances, several challenges remain. The presence of zero and negative prices prevents the direct application of logarithmic transformations, requiring appropriate preprocessing such as price shifting. Strong deterministic intraday and seasonal patterns also necessitate demeaning procedures, such as those proposed by [10]. In addition, realized volatility measures are subject to measurement error and jump contamination, which complicates estimation and forecast evaluation. As pointed out by [8], very short-term predictions using realized measures may be biased. Therefore, it is necessary to verify afterward if the models are biased.
Forecast comparison in this context is non-trivial due to model uncertainty, the risk of data-snooping and the complexity of analyzing a latent variable such as volatility. To address this, this study employs a loss based evaluation strategy using the QLIKE function [24], combined with the Model Confidence Set procedure [25], which identifies a set of statistically superior models with equal predictive ability. Additional diagnostic tools, including out-of-sample R2 and Mincer–Zarnowitz regressions, are used to assess forecast properties such as bias and efficiency without affecting model selection.
This paper contributes to the literature by providing a comprehensive out-of-sample comparison of volatility forecasting models for the Spanish electricity spot market during the 2021–2025. The analysis considers a wide range of specifications, including GARCH-type models and HAR-based extensions with jump components, functional transformations, and calendar effects, using high-frequency data and robust realized volatility estimators.
The remainder of the paper is organized as follows. Section 2 describes the data and the main characteristics of the Spanish electricity market. Section 3 presents the volatility estimators, model specifications, and the forecasting evaluation. Section 4 reports and discusses the empirical results of the study. Section 5 concludes this study by summarising the main empirical findings and highlighting their implications for volatility modelling in electricity markets.

2. Data and Spanish Market Characteristics

This work uses hourly price data from January of 2021 to December of 2026 of the Spanish day-ahead electricity market, which is operated by the Spanish Market Operator (OMIE) as part of the Iberian Electricity Market (MIBEL). This market was established through a 2004 agreement between Spain and Portugal and has been functional since 2007. The market was created due to European liberalization reforms introduced by Directive 1996/92/EC and reinforced by Directive 2003/54/EC. These directives required the separation of generation, distribution, and supply activities. In MIBEL, OMIE manages the spot market, the Portuguese Market Operator (OMIP) oversees derivatives trading, and Red Eléctrica de España takes care of system operations.
Despite attempts to integrate, Spain has limited connections with Europe [26]. It is often called an “energy island. which leads to specific market dynamics.
An important structural feature during this period is the rapid growth of renewable energy. According to Red Eléctrica de España [27], renewables made up about 56% of total production in 2025. Wind power remains the main technology, while solar photovoltaic energy has grown rapidly. The increase in renewable generation has changed the market dynamics [2].
Since 2023, the market has seen more zero and negative price hours—events rare earlier—mainly during times of high renewable output and low demand, especially in spring and early summer. This shift has increased intraday volatility, caused abrupt price jumps, and occasional price collapses, making standard volatility models less effective. There is a body of literature suggesting that a higher share of renewable energy leads to an increase in volatility [28,29,30].
Electricity prices also exhibit distinct patterns, including marked intraday, weekly, and seasonal cycles driven by demand and solar profiles; persistent and high volatility and long memory; and exogenous jumps related to weather, renewable intermittency, and policy actions and nonlinear dynamics in prices, limiting traditional modeling approaches [10,31].
To capture these features, intraday prices are aggregated to produce daily realized volatility, based on hourly returns. This approach keeps the information from high-frequency data while supporting daily risk forecasts. Realized volatility is especially suitable because squared daily returns miss latent volatility due to jumps and intraday effects. The high renewable share also increases jump activity, making jump-robust measures like Median Realized Volatility essential for separating persistent volatility from temporary shocks.
The analysis covers the period from 2021 to 2025. The Figure 1, Figure 2, Figure 3 and Figure 4 illustrate the corresponding price dynamics.
Figure 1 shows hourly prices from 2021 to 2025. Prices were stable early in 2021, between 40 and 100 €/MWh. After mid-2021, both levels and variability sharply increased, with spikes over 300–400 €/MWh and occasional drops near 0 €/MWh, indicating increased volatility. In Spain, although renewable penetration mitigated part of the upward pressure, electricity prices remained closely linked to gas-fired generation through the marginal pricing mechanism [32].
In 2022, Figure 2, prices spiked due to the outbreak of the Russia–Ukraine conflict, which triggered a severe supply shock in European gas markets. The sharp reduction in Russian gas flows led to unprecedented increases in gas prices, directly translating into extreme electricity price spikes across EU power markets [3,32]. In OMIE, this period is characterized by abrupt price surges and exceptionally high volatility, with peak prices reflecting both elevated gas marginal costs and heightened risk premia
Figure 3 depicts a partial normalisation in 2023, with prices trending downward. In March and April, prices frequently neared 0 €, driven by more renewables and the gas cap introduced by the Spanish government [33,34]. The moderation observed has also been documented in other European Markets during the same period [35].
This pattern became more pronounced in 2024 (Figure 4), with zero-price episodes increasing, especially in spring. Although prices recovered drops to zero remained common.
Figure 4 is the hourly prices of 2024, where the frequency of zero-price episodes rose significantly, particularly in spring. Despite some price recovery in the second half of the year, the market continued to exhibit recurrent drops to zero.
By 2025, Figure 5, the downtrend continued, with zero-price hours increasing and negative prices appearing more frequently. Prices stabilized around 100 €/MWh toward year’s end, but high variance and recurring zeros persisted. High volatility during the 2024–2025 could be attributed to geopolitical tensions possibly disruptions of the production and growing share of renewable energy production [9,35,36].
Overall, the Spanish electricity market from 2021 to 2025 presents a highly complex environment for volatility modeling due to extreme price swings, renewable growth, regulatory actions, and negative prices, offering a rigorous test for forecasting methods amid elevated uncertainty. To better understand the evolution of these market dynamics, Figure 6 illustrates two defining patters in price behavior.
Figure 6 shows average hourly spot electricity prices (€/MWh) by hour of the day for the period 2021–2025, highlighting a marked transformation in intraday price patterns in the Spanish electricity market. A clear “duck curve” shape emerges in the most recent years, particularly from 2024 onward.
During 2021, the intraday profile is relatively flat, with moderate night–day price differentials and limited midday depression, reflecting pre crisis conditions and lower renewable penetration. In 2022, prices increase sharply across all hours, and the duck curve structure is largely obscured by crisis driven price levels. Elevated gas prices during the peak of the European energy crisis dominate price formation, compressing intraday variability upward.
From 2023 onward, and especially in 2024–2025, the intraday pattern becomes increasingly pronounced. Prices fall sharply during midday hours and recover rapidly toward the evening peak. This corresponds to the canonical duck curve, where high solar generation depresses midday prices (the “belly” of the duck), followed by steep price ramps in the late afternoon and evening as solar output declines and demand rises. The depth of the midday trough and the steepness of the evening ramp intensify over time, indicating a growing influence of variable renewable generation on price formation.
Overall, Figure 6 illustrates a shift from crisis-dominated price levels in 2022 to a post-crisis regime characterised by structural intraday heterogeneity driven by renewable intermittency. While average price levels moderate after the energy crisis, the amplification of the duck curve pattern implies greater intraday volatility and sharper ramps, reinforcing the need for volatility models capable of capturing time of day effects, jumps, and persistent heteroskedasticity in modern electricity markets.

3. Volatility Modeling Methodology

This section describes the methodology used to model and forecast volatility in the Spanish electricity market. The proposed procedure is organized around four key components. First, classic volatility models are introduced as benchmark specifications. Second, realized volatility measures constructed from intraday data are defined to approximate latent volatility. Third, specific data pre-processing steps are applied to address features unique to electricity prices, such as zero or negative values and strong intraday seasonality. Finally, a rich class of HAR-type realized-volatility models is presented, incorporating heterogeneous persistence, jumps, robust estimators, functional transformations, and calendar effects.

3.1. GARCH Models

Autoregressive Conditional Heteroskedasticy (ARCH) [12] and Generalized Autoregressive Conditional Heteroskedasticty (GARCH) [13] volatility models constitute a common benchmark for modeling and forecasting conditional variance in financial and commodity markets. These models treat volatility as a latent process driven by past returns and innovations, without explicitly exploiting intraday price information. The seminal contribution of Engle [12] introduced the Autoregressive Conditional Heteroskedasticity (ARCH), later generalized by Bollerslev [13] through the GARCH model to allow for a more parsimonious representation of volatility persistence.
We consider the daily log-returns defined as r t = l o g ( P t P t 1 )   . We assume the mean follows an ARMA(0,0) process, it remains constant over time ( μ t ). This differs from an ARMA-GARCH model, where the conditional mean varies dynamically. In this paper, we focus solely on volatility dynamics, considering the mean as fixed intercept:
r t   =   μ t   +   e t
where μ is a constant and the innovation e t   follows
e t = h t   z t ,
with z t being an i.i.d. sequence with zero mean and unit variance, typically assumed to follow a Gaussian or Student- t distribution.

3.1.1. Classic GARCH Model

The conditional variance σ t 2 = V a r ( e t F t 1 ) , given the information set F t 1 , evolves according to:
σ t 2 = α 0 + i = 1 s α i e ( t i ) 2 + [ j = 1 ] r β j σ ( t j ) 2
α 0 > 0 : It constitutes the constant term, symbolizing the baseline or average level of long-term volatility.
α i 0 : They are the ARCH coefficients, which model the immediate effect of the residuals.
β j 0 : These are the GARCH coefficients, which incorporate the moving average component for the conditional variance.
( i = 1 s ( α i ) + j = 1 r ( B j ) ) < 1 This is the variance persistence. If it equals 1, it indicates infinite persistence in variance, suggesting an IGARCH [37] model should be used. However, if the values are close to 1, it could also imply a change in the mean value, requiring the use of different models [38].
Equation (2) presents the conditional variance. However, [7] explains, the classical GARCH model assumes symmetric effects of shocks, which is unrealistic because empirically most cases involve asymmetric effects.

3.1.2. Absolute Value GARCH (AVGARCH)

Absolute Value GARCH, also called Taylor-Schwert GARCH proposed by [15,16] is a variant of the GARCH model defined as:
σ t = α 0 + i = 1 p α i | ( e t - i ) | + j = 1 q β j σ t - j
This GARCH model characterizes the conditional standard deviation utilizing the absolute value of residuals. The principal objective of this model is to reduce the influence of extreme or atypical observations [39].
Using the same restrictions as the classical GARCH model.
α 0 > 0 : It constitutes the constant term, symbolizing the baseline or average level of long-term volatility.
α i 0 : They are the ARCH coefficients, which model the immediate effect of the residuals.
β j 0 : These are the GARCH coefficients, which incorporate the moving average component for the conditional variance.

3.1.3. GARCH-X

The GARCH-X model is an extension of the classical GARCH that incorporates exogenous variables (Xt) into the variance equations as follows. While standard GARCH relies solely on internal history, GARCH-X allows external factors to influence volatility predictions. So the GARCH-X (s,r,k) is defined as:
σ t 2 = α 0 + i = 1 s α i e ( t i ) 2 + [ j = 1 ] r β j σ ( t j ) 2 + p = 1 k γ X t p
α 0 > 0 : It constitutes the constant term, symbolizing the baseline or average level of long-term volatility.
α i 0 : They are the ARCH coefficients, which model the immediate effect of the residuals.
β j 0 : These are the GARCH coefficients, which incorporate the moving average component for the conditional variance.
γ 0 it captures the impact of exogenous variables.
While the GARCH-X can accommodate a vector of k covariates, this study focuses on a single exogenous variable: Realized Volatility (RV). This choice is driven by the ‘Information Hypothesis,’ as RV leverages high-frequency intra-day data to offer a more accurate and observable measure of price variation, unlike squared daily returns, which are noisier proxies for volatility.
GARCH models have been extensively applied in energy and electricity markets to capture time-varying volatility and volatility clustering (see, among others [30,37]). However, empirical evidence suggests that their forecasting performance is often constrained by the exclusive reliance on daily returns, which discard a substantial amount of information contained in intraday price movements. This limitation is particularly relevant in electricity markets, where prices are characterized by strong intraday seasonality, frequent jumps, and extreme observations.
These shortcomings are especially pronounced in the Spanish electricity market during the 2021–2025 period, marked by high renewable penetration, increased jump activity, and the emergence of zero and negative prices. As a result, daily return-based volatility estimates may provide a noisy and incomplete representation of the underlying volatility process. For this reason, GARCH-type models are employed in this study primarily as benchmark specifications, against which the performance of realized-volatility-based and HAR-type models is evaluated within a unified out-of-sample comparison procedure.
While return-based GARCH models rely exclusively on daily price changes and treat volatility as a latent process, realized-volatility-based approaches explicitly exploit intraday information, allowing volatility to be measured rather than inferred. This informational advantage motivates the use of HAR-type models built on realized volatility measures, which are introduced in the next section.

3.2. Realized Volatility Measures

Intraday data availability across financial markets has grown, exposing volatility patterns that classical models like GARCH or stochastic volatility often miss, especially when trying to capture the persistent and long-memory traits seen in realized volatility [17]. To overcome this, the Heterogeneous Autoregressive model of Realized Volatility (HAR-RV) was introduced by [21].
d p t = u t d t + σ t d W t + k t d q t
Let the logarithmic price of an asset follow a jump-diffusion process over time, with the dynamics driven by both a continuous diffusion component and a discrete jump component. In this model, the drift term is represented by ut. σ t is a stochastic volatility process independent of the W t standard Brownian motion, k t and denotes the jump size in log prices.
Realized Volatility exhibits some unique traits, including notable autocorrelations at long lags, indicating long-range dependence. It also shows a leverage effect, where volatility bursts are more likely following negative returns and jumps [3].
The volatility of the price process at day t is measured by quadratic variation ( i n t e g r a t e d   v a r i a n c e   a s s o c i a t e d ,   I V t ) :
I V t = t t + 1 σ s 2 d s + t < s < t + 1 k s 2
Being σ s 2 the contribution of the continuous sample path. And t < s < t + 1 k s 2 is the contributions of the jumps.
However, it is not directly observable, so an estimator is needed [41]. If there were no noise due to microstructure, the Realized Variance would be the best estimator. In the presence of noise, several robust estimators are available. Therefore, in this work, we have estimated the classic Realized Variance and also used robust estimators, as described below.

3.2.1. Realized Volatility

Realized volatility is then as follows:
R V t = j = 1 M r t , j 2
As the sampling frequency increases, R V t converges in probability to the integrated variance of the continuous-time price process [17]. Empirically, RV has been shown to substantially outperform squared daily returns as a proxy for latent volatility. However, RV is sensitive to jumps and microstructure noise [18,20], which may dominate volatility dynamics in electricity markets characterized by abrupt price movements and extreme observations.

3.2.2. Noise-Robust Realized Volatility Measures

The Kernel Realized Variance, proposed in [19], is defined as:
R K t = γ 0 , t + h = 1 H K ( h 1 H ) ( γ h , t + γ h , t )
where γ h , t denotes intraday return autocovariances of order h on day t:
γ h , t =   j = h + 1 M r t , j r t , j h
K ( ) is a kernel weighting function, and H is a bandwidth parameter controlling the number of autocovariances included in the estimation. By aggregating both positive and negative return autocovariances through a spectral weighting scheme, the Kernel Realized Variance estimator corrects for the impact of microstructure noise contaminating raw high-frequency returns.
The Kernel Realized Variance estimator provides a robust benchmark. Its inclusion allows us to assess whether volatility dynamics and forecasting performance are sensitive to alternative noise-robust volatility proxies. In this work, we only utilize the Bartlett Kernel, defined by K(x) = 1 − x for 0 x 1 .

3.2.3. Jump-Robust Realized Volatility Measures

To address the sensitivity of standard Realized Variance to discontinuities in the price process, the literature has developed alternative estimators that are robust to jumps. Among these, Median Realized Volatility (MedRV) is used due to its robustness and ease of implementation [20].
Median Realized Volatility is a jump-robust estimator defined as
M e d R V t = π 6 4 3 + π M M 2 i = 2 M 1 [ m e d ( r t , i 1 , r t , i , r t , i + 1 ) ] 2
where m e d ( ) denotes the median operator. [20] show that MedRV is consistent for the integrated variance under a wide class of jump-diffusion processes and effectively filters extreme intraday returns associated with discontinuous price changes.
By construction, MedRV isolates the continuous component of quadratic variation, thereby providing a more reliable measure of persistent volatility dynamics in the presence of jumps. This property is particularly important in electricity markets, where price formation is frequently affected by abrupt supply-demand imbalances, renewable intermittency, and system constraints.
In the Spanish electricity market over the 2021–2025 period, high renewable penetration has been associated with an increased incidence of price spikes and outliers. In such an environment, jump-robust estimators such as MedRV offer a more accurate representation of the underlying continuous volatility component that drives persistence and predictability.

3.3. Data Pre-Processing: Price Shift and Deaming Returns

Electricity markets occasionally exhibit zero or negative prices, which makes impossible todo the logarithm. We apply a price displacement (shift) to ensure that all values are positive
P t , j s h i f t = P t , j + λ
where λ = 1 m i n ( P ) ensures that the minimum price is equal to 1. This linear transformation allows applying the log-return calculations, While the price displacement ensures that P t , j s h i f t > 0 , allowing for the application of logarithmic return calculations. It is important to acknowledge that this linear transformation could introduce a specific degree of measurement bias. By shifting the price level, the returns are altered, particularly during periods of low absolute prices where λ the constant represents a larger proportion of the total value. However, in the context of the Spanish Electricity Market, this bias is considered a necessary trade-off to address the numerical instability caused by zero or negative prices, which would otherwise lead to undefined log-returns and extreme values. To get the daily prices, we calculated the average for each day.
In addition, predictable intraday and seasonal patterns driven by demand cycles and renewable production may distort volatility estimates. To isolate stochastic price movements, intraday returns are demeaned following [10].
In the work of [10] it is used to demean the month of the year, day of the week, and half-hour of the day. In the MIBEL market, spot prices are recorded hourly until October 2025. Therefore, instead of using half-hour periods, the data is adjusted using hourly values [42].
μ t , j ^ = m e d ( r m m , d y , h r )
We calculate the median return for the day t in month mm, on day of the week dy and hour j.
Then the demeaning returns are:
r t , j * = r t , j μ t , j ^
All realized volatility measures are computed using the adjusted returns r t , j * , ensuring that volatility is constructed on a stable and economically meaningful return series and reflects genuine uncertainty rather than deterministic price patterns. These preprocessed volatility measures are used as inputs to HAR-type models capturing heterogeneous volatility persistence across multiple time horizons.

3.4. HAR Volatility Models

To model the realized volatility, we employ Heterogenous Autoregressive (HAR) specifications, which characterize volatility persistence through linear combinations of realized volatility components aggregated over multiple time horizons. Introduced by Corsi [21], the Heterogeneous Autoregressive (HAR) model provides a parsimonious linear approximation to long-memory behavior by combining daily, weekly, and monthly realized volatility measures, without imposing explicit fractional integration.
Let R V t denote a realized volatility measure for day t , constructed from preprocessed intraday returns as described in Section 3.2. The baseline HAR-RV model serves as the reference specification and captures heterogeneous volatility persistence across short-, medium-, and long-run horizons.
Building on the general jump-diffusion representation of the price process, the HAR model can be extended to explicitly account for discontinuous price movements, giving rise to HAR-J models that separate persistent volatility dynamics from transitory jump-induced variation. In addition, since realized volatility measures are subject to time-varying measurement error, the HAR specification is further augmented using realized quarticity, leading to HAR-Q models. These extensions are jointly combined in the HAR-QJ specification, which simultaneously captures heterogeneous persistence, jump dynamics, and measurement uncertainty.
Finally, electricity markets exhibit pronounced calendar effects, particularly day-of-the-week patterns in volatility arising from systematic demand cycles, generation constraints, and institutional market arrangements. To account for residual seasonal heteroskedasticity, HAR-type models are further augmented with weekday dummy variables, allowing calendar effects to be modeled jointly with volatility persistence, jump behavior, and time-varying measurement precision within a unified and internally consistent framework.

3.4.1. HAR-RV Model

The baseline HAR-RV model is specified as
R V t = β 0 + β 1 R V t 1 d + β 2 R V t 1 w + β 3 R V t 1 m + ε t
where R V t d = R V t 1 denotes daily realized volatility. The weekly and monthly components are defined as
    R V t w = 1 7 i = 0 6 R V t i
    R V t m = 1 30 i = 0 29 R V t i
Corresponding to 7-day and 30-day moving averages of realized volatility.
The coefficients β 1 , β 2 , and β 3 measure volatility persistence across short-, medium-, and long-run horizons, respectively. By construction, the HAR model provides a parsimonious linear approximation to long-memory behavior by combining realized volatility components observed at multiple aggregation scales, without imposing explicit fractional integration. This representation is well suited to environments characterized by persistent volatility dynamics and recurring regime changes.

3.4.2. HAR-Q Model

Let I V t denote the latent daily volatility given by quadratic variation (Equation (7)). The realized variance R V t , constructed from intraday returns, provides a consistent but noisy estimator of I V t , which can be expressed as:
    R V t = I V t + η t
where η t denotes the measurement error. In finite samples, this error arises from discretization, heavy-tailed intraday returns, and extreme price movements, and its variance is time-varying, being larger during periods of intense intraday activity.
The conditional variability of the measurement error depends on the fourth moment of intraday returns. This motivates the use of Realized Quarticity (RQ) as a proxy for time-varying measurement uncertainty, defined as:
    R Q t =   M 3 i = 1 M   r t , i 4
where r t , i denotes intraday returns and M is the number of intraday observations.
The HAR-Q model [43] incorporates realized quarticity to adjust the predictive contribution of realized volatility components according to their measurement precision. It is specified as:
R V t =   β 0 + ( β 1 + β 1 Q R Q t 1 ( d ) 1 2 ) R V t 1 d +   ( β 2 + β 2 Q R Q t 1 ( w ) 1 2 ) R V t w + ( β 3 + β 3 Q R Q t 1 ( m ) 1 2 ) R V t 1 m +   ε t
where R Q t 1 ( d ) , R Q t 1 ( w ) , and R Q t 1 ( m ) denote daily, weekly, and monthly aggregated quarticity measures. High quarticity values correspond to low-precision volatility estimates and therefore lead to an endogenous down-weighting of realized volatility components, improving estimation efficiency and forecasting accuracy [41,42].

3.4.3. HAR-J Model

Electricity prices are characterized by frequent discontinuous movements, renewable intermittency, weather shocks, and regulatory or market interventions [2,28,36]. From a continuous-time perspective, these features imply that the price process cannot be adequately described by a pure diffusion and must allow for jumps. Consequently, the total daily quadratic variation of prices contains both a continuous component and a jump component:
    R V t = C t + J t
Continuous component ( C t ) : Let R V t denote the realized variance obtained from intraday returns. In the presence of jumps, R V t captures both continuous and discontinuous variation. To isolate the continuous component of quadratic variation, we use the Bipower Variation (BPV) estimator proposed by [18], defined as:
    B P V t =   π 2 ( M M i ) i = 1 M i   | r t , i | | r t , i + 1 |
where r t , i are intraday returns, M is the number of intraday observations, and i is a lag parameter.
Under standard regularity conditions, B P V t is a consistent estimator of the integrated variance associated with the continuous diffusion component, even in the presence of jumps. Intuitively, the product of adjacent absolute returns eliminates the dominant impact of isolated large jumps, which tend to inflate squared returns disproportionately
Jump identification: Although both R V t and B P V t   are observable, a statistical procedure is required to determine whether their difference reflects jump activity or sampling variability. This is achieved through the Z t proposed by [46]:
    Z t =   ( R V t B P V t ) / R V t ( ( π 2 ) 2 + π 5 ) 1 M m a x ( 1 , T P Q t B P V t 2 )
Being TPQ the tripower quarticity for day t:
    T P Q t =   1 4 [ Γ ( 7 / 6 ) Γ ( 1 / 2 ) ] 3 M 2 M 2 i j = 1 M 2 i | r t , j | 4 / 3 | r t , j + i | 4 / 3 | r t , j + 2 i | 4 / 3
Which provides a robust estimator of the integrated fourth moment of intraday returns. Tripower Quarticity accounts for time-varying volatility and heavy-tailed return distributions and is used to consistently estimate the asymptotic variance of the bipower variation. Here, Γ ( ) denotes the Gamma function.
Under the null hypothesis of no jumps and as M , the standardized test statistic satisfies [44]
  Z t d N ( 0,1 ) ,
Significant jumps are identified when the Z t statistic exceeds a standard normal critical value Φ α [44]. In this study, following the recommendations of [10]. In our empirical pipeline, we set alpha = 0.01, representing a 99% confidence level, with a critical value of Φ 0.01 = 2.326. Consequently, the daily jump component is defined as:
    J t =   I ( Z t > Φ α ) ( R V t B P V t )
where I ( ) denotes the indicator function. This construction ensures that only statistically significant deviations between realized variance and bipower variation are attributed to jumps, while small discrepancies are treated as estimation noise.
Following [10] we increase the lag (i = 3) for the Bipower variation and Tripower quarticity to improve the performance of the test. In our case we use the Jumps for 1 day.
The HAR-J model is defined as:
R V t =   β 0 +   β 1 R V t 1 d +   β 2 R V t 1 w +   β 3 R V t 1 m + β 4 J t 1 +   ε t

3.4.4. HAR-QJ Model

The HAR-QJ model combines the two previous extensions by jointly accounting for measurement error and jump dynamics. This specification is particularly relevant in electricity markets, where realized volatility is affected simultaneously by intraday noise and frequent discontinuities.
The HAR-QJ model jointly accounts for time-varying measurement error and jump dynamics
R V t = β 0 + ( β 1 + β 1 Q R Q t 1 ( d ) ) R V t 1 d + ( β 2 + β 2 Q R Q t 1 ( w ) ) R V t 1 w + ( β 3 + β 3 Q R Q t 1 ( m ) ) R V t 1 m + β 4 J t 1 + ε t .
By construction, HAR-QJ allows volatility persistence to be estimated conditionally on both time-varying measurement precision and jump activity, yielding a more robust representation of realized volatility dynamics.

3.4.5. HAR-C and HAR-CJ Models

Following [11], we decompose the realized variance into its continuous and jump components to assess their individual predictive powers. We define the jump-adjusted continuous component Ct as RVt when no significant jump is detected, and as BPV otherwise. This leads to the HAR-C (Continuous) and HAR-CJ (Continuous-Jump) models:
C t = { R V t , if   no   jump   is   detected , B P V t , otherwise .
This leads to HAR-C model,
R V t = β 0 + β 1 C t 1 d + β 2 C t 1 w + β 3 C t 1 m + ε t
And the HAR-CJ model,
R V t = β 0 + β 1 C t 1 d + β 2 C t 1 w + β 3 C t 1 m + β 4 J t 1 + ε t

3.4.6. HAR Median and HAR Kernel Model and Transformations

To assess the role of robustness in volatility forecasting, the HAR model is further extended by replacing standard realized variance with jump-robust and noise-robust volatility estimators.
E s t t { MedRV t ,   RK t }
where MedRV denotes Median Realized Volatility and RK, Kernel Realized Volatility.
HAR-M (Median) and HAR-K (Kernel):
R V t =   β 0 +   β 1 E s t t 1 d +   β 2 E s t t 1 w +   β 3 E s t t 1 m +   ε t
where E s t t d , E s t t w , and E s t t m denote daily, weekly, and monthly aggregated components of the alternative realized volatility estimator
HAR-MJ and HAR-KJ:
R V t =   β 0 +   β 1 E s t t 1 d +   β 2 E s t t 1 w +   β 3 E s t t 1 m +   β 4 J t 1 + ε t
where J t denotes the jump component identified using bipower variation and jump tests.
Given the heavy-tailed nature of electricity price volatility and the frequent occurrence of extreme observations, all HAR-type models are estimated under alternative functional transformations of realized volatility. Specifically, model estimation is performed using R V t , l o g ( R V t ) , and R V t , corresponding to level, logarithmic, and square-root transformations, respectively. These transformations are employed to stabilize variance, reduce the influence of outliers, and enhance numerical stability and out-of-sample forecasting performance.

3.4.7. HAR Models with Day-of-the-Week Effects

Electricity markets are heavily influenced by the day of the week, as both consumption and prices exhibit a pronounced weekly seasonal pattern. Following the approach in [22], we incorporate day-of-the-week dummy variables to examine their specific impact on realized variance. This is a crucial step because, while the demeaning of returns (Equations (13) and (14)) accounts for deterministic patterns in the mean price level, it does not address the seasonal heteroskedasticity inherent in the variance process.
R V t = H A R m o d e l + i = 2 7 φ i D i , t + ε t
where HAR model denotes any of the HAR specifications introduced in Section 3.4.1, Section 3.4.2, Section 3.4.3, Section 3.4.4, Section 3.4.5 and Section 3.4.6.
Let D i , t denote indicator variables defined as:
D i , t = { 1 , if   day   t   corresponds   to   weekday   i , 0 , otherwise ,           ;     i = 2 , , 7 ,
Monday serves as the reference category. Since the electricity market operates continuously (24/7), we include six dummy variables to represent the remaining days of the week. This seasonal adjustment is implemented across all HAR specifications and functional transformations previously described.
The coefficients ϕ measure systematic deviations in realized volatility on weekday I relative to Monday, conditional on the information contained in the HAR dynamics. A statistically significant ϕ indicates the presence of weekly seasonal heteroskedasticity.
Importantly, the inclusion of weekday dummies does not alter the dynamic structure of the HAR model but augments it with deterministic regressors that capture residual calendar regularities in the variance process. This allows volatility persistence, jump dynamics, measurement error, and calendar effects to be modeled jointly within a unified linear framework.

3.5. Out of Sample Evaluation and Statistical Model Selection

The models are estimated by using an expanding window approach to evaluate their predictive accuracy. The initial estimation subsample (In-Sample) spans from 2021-01-02 to 2024-12-31, while the entire year of 2025 is reserved for out-of-sample (OOS) evaluation. One day-ahead forecasts are generated iteratively, with the estimation window expanding after each prediction to incorporate the most recent observation. Model parameters are re-estimated at each step, ensuring that forecasts are based on all available information up to the prediction date.
Figure 7 provides an overview of the out-of-sample evaluation and statistical model selection procedure. Out-of-sample volatility forecasts are evaluated and compared using a multi-stage procedure designed to ensure statistical coherence and robustness. Model selection is based exclusively on a loss-based criterion within the Model Confidence Set (MCS). Specifically, forecasts are evaluated using the QLIKE loss function, and the MCS procedure yields a Superior Set of Models (SSM) satisfying the Equal Predictive Ability hypothesis [25]. Conditional on SSM membership, additional diagnostics—namely the out-of-sample R2 and the Mincer–Zarnowitz regression—are employed to assess forecast accuracy and bias. These diagnostics are purely descriptive and do not influence the loss-based model selection.

3.5.1. Loss-Based Model Comparison: MCS with QLIKE

Model Confidence Set (MCS)
The Model Confidence Set proposed by [25] provides a statistically approach for jointly comparing multiple forecasting models and identifying a subset of specifications that are not statistically dominated under a common loss function. Let { L i , t } t = 1 T denote the sequence of out-of-sample losses generated by model i M 0 , where M 0 is the initial model set. For each pair of models ( i | j ) , define the loss differential as:
d i j , t = L i , t L j , t
with expected value E [ d i j , t ] . The null hypothesis of Equal Predictive Ability (EPA) for the model set M M 0 is given by
H 0 , M :   E d i j , t = 0 , i , j M .
Rejection of the null hypothesis implies that at least one model in the set exhibits inferior predictive performance. The MCS procedure sequentially eliminates the model associated with the largest standardized average loss until the EPA hypothesis can no longer be rejected.
T MAX = m a x i M ( d ˉ i V a r ^ ( d ˉ i ) ) ,
where
d ˉ i = 1 M 1 j M j i d ˉ i j
Denotes the average differential loss of model i relative to all remaining competitors, and V a r ^ ( d ˉ i ) is a consistent estimator of its variance. Statistical inference is conducted using a moving block bootstrap to account for serial dependence in the loss differentials. The output of the procedure is the Superior Set of Models (SSM), defined as a subset of models for which the EPA hypothesis cannot be rejected at the chosen significance level. In this application, the Model Confidence Set enables a statistically coherent comparison of volatility forecasting models estimated under different information sets, functional transformations, and volatility proxies, while avoiding ad-hoc pairwise testing and data-snooping biases.
QLIKE Loss Function
The error function we use is QLIKE, an asymmetrical metric that penalizes underprediction more than overprediction [24,45]. We use realized volatility as a proxy because it is less noisy than squared returns, although QLIKE itself is a robust error measure that handles noisy proxies well [24].
Q L I K E = ln ( h t ) + R V t h t
The choice of QLIKE is motivated by properties in volatility forecasting. First, QLIKE is robust to measurement error in volatility proxies and remains consistent for ranking competing forecasts even when the realized measure is noisy or affected by jumps and microstructure effects [24].
Second, QLIKE is intrinsically asymmetric, imposing a larger penalty on volatility underprediction than on over-prediction. This feature has a clear economic interpretation in electricity markets, where underestimating volatility entails asymmetric costs related to risk management, such as insufficient collateral requirements, mispricing of derivatives, and exposure to extreme price movements. Given the empirical characteristics of electricity prices, frequent jumps, heavy tails, and pronounced volatility clustering, this asymmetry renders QLIKE particularly appropriate.
Importantly, the use of QLIKE ensures coherence between the forecast evaluation metric and the MCS testing, as the loss differentials entering the EPA hypothesis are computed using a strictly consistent volatility scoring rule. Consequently, model selection decisions within the MCS are driven by robust loss differences.

3.5.2. Forecast Evaluation Within the Superior Set of Models

The MCS procedure yields a Superior Set of Models (SSM) containing volatility forecasting specifications that are statistically indistinguishable in terms of expected QLIKE loss. Inclusion in the SSM reflects loss-based equivalence rather than absolute forecast dominance. Consequently, additional performance measures are employed conditionally on SSM membership, with the objective of characterizing forecast accuracy and statistical optimality rather than overriding the loss-based selection mechanism.
R2 Out-of-Sample
The MCS process yields a set of models that are predictively the best, although statistically they are indistinguishable. Hence, we use the Out-of-sample R2 ( R O O S 2 ) and the Mincer-Zarnowitz regression to identify the best, unbiased model from this set.
R2 Out-Of-Sample proposed by [48] is:
R O O S 2 = 1 t = 1 T ( R V t h t ) 2 t = 1 T ( R V t R V ¯ m e a n , t ) 2
where h t is the prediction of the model for the day t and R V ¯ m e a n , t is the mean value at the day t. Therefore, with this R2, we assess if the prediction outperforms the average value. This measure assesses economic forecast gains relative to a naïve benchmark and is used exclusively to compare models within the SSM, not to determine statistical dominance.
Mincer-Zarnowitz Regression
We use the proposed Mincer-Zarnowitz regression [49]. Using this particular regression we can check if the predictions are unbiased and efficient or not [50]. The Mincer-Zarnowitz regression is defined as:
R V t =   α + β 1 h t + ε t
where h t is the prediction of the model for the day t.
We do a Joint Hypothesis Test H0: α = 0 ,   β 1 = 1 : This test determines if the forecast is perfectly unbiased. α = 0 ensures no systematic constant bias (intercept at the origin   β 1 = 1 . Ensures the forecast reacts correctly to the scale of volatility. As with the R OOS 2 , the Mincer–Zarnowitz regression is applied conditionally on SSM membership, ensuring that forecast interpretation remains consistent with the loss-based model selection strategy.

4. Results and Discussion

This section reports the results of the out-of-sample volatility forecasting exercise based on the hierarchical evaluation procedure described in Section 3.5. Model selection is first carried out using the Model Confidence Set (MCS) methodology under the QLIKE loss function, which identifies the Superior Set of Models (SSM) satisfying Equal Predictive Ability. Conditional on SSM membership, forecast performance is further characterized using the out-of-sample coefficient of determination R OOS   2 and Mincer–Zarnowitz regressions, which assess forecast accuracy, unbiasedness, and scale efficiency without affecting the loss-based selection.
Figure 8 illustrates the impact of the price displacement (shift) transformation. As observed, the transformation strictly performs a linear shift of the price levels, while preserving the series’ temporal structure and original dynamics. The daily returns series illustrates a progressive increase in market volatility over time. A surge is observed starting in 2023, with fluctuations becoming particularly pronounced throughout 2024 and 2025. This trend can be attributed to the increased penetration of renewable energy in the total power mix, as well as heightened geopolitical and political uncertainty.
This volatility could be closely linked to the European Union’s (EU) recent efforts to accelerate its decarbonization agenda. By implementing stricter emissions regulations and shifting toward decentralized energy sources, the EU has triggered a structural transformation of the energy market. While these policies aim for long-term sustainability, the transition phase has introduced significant short-term instability and price sensitivity across the sector.
And the realized volatility tells a story of two distinct regimes. Prior to 2022, the series had only occasional spikes. The energy shock of 2022 marks a clear structural break causing more spikes and a higher mean volatility level.
Table 1 shows that although the minimum value changes from −15 to 1, the standard deviation stays the same as the original series. This is important because variance, which is key to our modeling, does not change with linear shifts. Therefore, this adjustment does not create substantial bias in the volatility analysis.
Additionally, alternative approaches like excluding negative observations—which make up 1.84% of our sample—were not used because removing data points could break the time-series continuity and may distort the long-memory features of volatility. Although more complex non-linear transformations are available (such as those suggested by [51]), the selected adjustment preserves interpretability and numerical stability, ensuring that the estimated realized variance accurately captures the market’s original fluctuations without losing information. Applying the price displacement (shift) removes negative and near-zero values, allowing for stable calculation of logarithmic returns.
Table 2 shows descriptive statistics for two series: Raw, which includes intraday log-returns with shifted prices, and Ullrich, which uses the demeaning method outlined in Equation (11).
Consistent with existing research on high-frequency energy data, neither series follows a normal distribution, with high kurtosis and notable skewness. When comparing them, the Ullrich adjustment slightly raises the mean, while the median remains exactly zero. This is an important result, as it shows that the demeaning process effectively centered the returns, isolating idiosyncratic price changes from hourly and seasonal patterns.
Additionally, the Ullrich-adjusted shows a marked decrease in standard deviation, as evidenced by the narrower gap between its minimum and maximum values compared to the Raw series. This lowered dispersion indicates that much of the ‘raw’ volatility was due to predictable seasonal patterns. By eliminating these deterministic factors, the resulting realized volatility better reflects the actual stochastic uncertainty in the MIBEL market.
Table 3 presents the descriptive statistics for the daily realized variance estimators Realized Variance (RV), Median Realized Variance (MedRV), and Kernel Realized Variance, using Bartlett estimator (KRV).
The mean and median values differ considerably across the three measures. The MedRV exhibits the lowest central tendency, which can be attributed to its robustness against outliers and jumps. While the standard RV is inflated by price spikes (jumps), the Median transformation “smooths” the series, providing a more stable estimate of the continuous volatility component.
In contrast, the KRV using the Bartlett kernel shows the highest mean and median values. This indicates that the kernel does not reduce central tendency; instead, it captures a higher level of total variance by accounting for the microstructure noise and autocovariances inherent in hourly price data.
Regarding dispersion, the MedRV successfully reduces the standard deviation, whereas the KRV shows an increase, indicating higher variability in its daily estimates. However, a significant improvement is observed in the “shape” of the distributions:
In terms of skewness, the MedRV shows only a slight improvement over the raw series. However, the KRV significantly reduces skewness, approaching a more symmetric distribution.
The Kernel Realized Volatility kurtosis achieves the lowest excess kurtosis among all estimators, moving closer to a more mesokurtic distribution.
In summary, we observe a technical trade-off. The Kernel estimator shows a “worsening” in terms of central tendency and dispersion (higher and more volatile values), but a clear “improvement” in its distributional shape, producing a series that behaves more like a Normal distribution. On the other hand, the Median Realized Volatility acts as a global smoother, reducing noise and the impact of jumps across all statistical fields.
Figure 9 shows the three estimators over the full sample period. The top panel displays relatively stable and low values during 2021, with isolated spikes appearing in 2022. From 2023 onwards, both the frequency and magnitude of these spikes increase, reaching the highest values in the sample around early 2025.
The middle panel follows the same general pattern but with noticeably smoother fluctuations. The extreme values visible in the RV series are attenuated here, which reflects the median estimator’s ability to reduce the influence of outlying observations. This makes it easier to identify the underlying trend in volatility over time.
The bottom panel tends to record higher values than both RV and MedRV throughout the sample, and its spikes are more pronounced. This is consistent with the descriptive statistics reported earlier, where KRV showed the highest mean and maximum values across the sample. The difference in magnitude relative to the other two estimators becomes more evident from 2023 onward, when market conditions appear to have become more volatile.
The divergence across estimators is worth noting. While all three capture the same broad volatility dynamics, their sensitivity to extreme observations differs considerably. Realized volatility and Kernel are more reactive to large individual returns, whereas Median provides a more stable signal. In practice, this suggests that the choice of estimator may have non-trivial implications for downstream applications such as volatility forecasting or risk measurement, particularly during turbulent market periods.
Table 4 reports the Superior Set of Models (SSM) identified by the Model Confidence Set procedure based on the QLIKE loss function. All specifications retained in the SSM are statistically indistinguishable in terms of expected loss and therefore satisfy the Equal Predictive Ability criterion. Consequently, differences in subsequent performance measures should be interpreted conditionally within this statistically equivalent set, rather than as evidence of loss-based dominance.
Within the SSM, Out-of-Sample R OOS 2 values are used to quantify the magnitude of forecast gains relative to the historical mean benchmark, while Mincer–Zarnowitz regressions assess forecast unbiasedness and scale efficiency. These diagnostics serve to characterize the economic relevance and statistical optimality of the competing forecasts without affecting the loss-based selection. Among the models belonging to the SSM, the HAR-MJ-D specification combines the highest R OOS   2 with statistically unbiased and efficient forecasts and is therefore retained as the preferred model for subsequent analysis.
Several empirical regularities emerge from the loss-based selection results reported in Table 4. First, all GARCH-type specifications are excluded from the Superior Set of Models, indicating systematically inferior predictive performance relative to HAR-type formulations when evaluated under the QLIKE loss. This result is consistent with the inability of classical conditional variance models to fully exploit the information contained in high-frequency price data, particularly in electricity markets characterized by pronounced long-memory dynamics, jump behavior, and strong calendar effects.
Figure 10 provides a comparative analysis between two GARCH specifications, the optimal HAR model (HAR-M-J with day-of-the-week), and the empirical realized volatility. The results indicate that the GARCH models consistently underestimate volatility throughout most of the sample period. While the AVGARCH model shows a slight improvement over the sGARCH in terms of bias, both models remain significantly decoupled from actual volatility levels. Consequently, they are heavily penalized for this systematic underestimation and are subsequently excluded from the Superior Set of Models (SSM).
Second, the Superior Set is dominated by HAR-type specifications estimated on realized volatility measures, with no logarithmic transformations surviving the MCS procedure. This finding suggests that, once deterministic intraday and seasonal patterns are removed via the Ullrich demeaning adjustment, additional variance-stabilizing transformations provide no further gains in out-of-sample performance. In contrast, both the inclusion of jump components and day-of-the-week dummy variables remains highly relevant, underscoring the importance of capturing discontinuous price movements and residual weekly seasonality in volatility forecasting for the Spanish electricity market.
Within the SSM, the HAR-MJ-D specification—based on Median Realized Volatility, an explicit jump component, and weekday indicators—exhibits the highest Out-of-Sample R OOS 2 while simultaneously satisfying the Mincer–Zarnowitz unbiasedness and efficiency conditions. This combination indicates that, among statistically equivalent competitors, the HAR-MJ-D model delivers the most economically meaningful forecast gains relative to a historical mean benchmark without exhibiting systematic bias or scale distortions. These results are consistent with [40], confirming that Median Realized volatility yields the highest forecasting accuracy among the estimators considered.
In electricity markets, Out-of-Sample R OOS 2 values are typically moderate in absolute magnitude. This reflects the intrinsic characteristics of power prices, which differ substantially from standard financial assets. Electricity prices exhibit frequent and sudden jumps, strong non-storability constraints, pronounced weather-driven shocks, regulatory interventions, and structural changes associated with renewable integration. These features generate a large unpredictable component in realized volatility that cannot be fully anticipated using past information alone.
To further examine the sources of this superior performance and assess the economic relevance of the model components, the following Table 5 reports the in-sample parameter estimates of the preferred HAR-MJ-D specification. These estimates provide insight into volatility persistence across heterogeneous horizons, the role of jump dynamics, and the impact of residual weekly seasonality. The results are summarized in Table 5.
Restricting attention to statistically significant regressors, the fitted HAR-MJ-D model can be written as
R V t = 0.0298 + 0.3284   MedRV t d + 0.2601   MedRV t w + 0.4564   MedRV t m   + 0.2324   J t + 0.2133   D 6 , t + 0.4370   D 7 , t + ε t ,
where J t   denotes the jump component and D 6 , t   and D 7 , t correspond to Saturday and Sunday, respectively, with Monday as the reference category.
The in-sample estimation of the HAR-M-J, in Table 5, (Heterogeneous Autoregressive model with MedRV and Jumps), augmented with weekday dummies, provides significant insights into the volatility structure of the series. With an Adjusted R2 of 0.3101, the model captures approximately 31% of the total variation in future realized volatility, indicating a robust fit for high-frequency financial data.
The model results strongly support the Heterogeneous Market Hypothesis, where agents with different time horizons contribute to the volatility process.
Daily Component: The daily median realized volatility is highly significant, confirming a strong short-term momentum effect.
Weekly Component: This remains significant at the 1% level, reflecting the influence of medium-term market participants.
Monthly Component: The monthly component exhibits the highest coefficient. This suggests a “long-memory” property in the series, where long-term structural trends—could be linked to the geopolitical and renewable factors— influences on the current volatility regime.
The Jump component is statistically significant at the 1% level. This confirms that the continuous part of volatility is insufficient to describe the market’s behavior. Despite the Ullrich demeaning and using a robust estimator. Discontinuous price shocks (jumps) possess their own predictive power and contribute significantly to the escalation of future volatility. This indicates that sudden “bursts” of information or unexpected events are quickly priced into the volatility of subsequent periods.
To account for potential day-of-the-week effects, the model includes dummy variables, using Monday as the baseline reference.
Working days: The coefficients for Tuesday through Friday are not statistically significant. This suggests that volatility during the standard workweek does not deviate meaningfully from the levels observed on Monday.
The Weekend Effect (Sat–Sun): In contrast, the coefficients for Saturday and Sunday are both positive and highly significant. The magnitude of the Sunday coefficient is particularly striking, being the largest dummy in the model. This reveals a substantial volatility during the weekend.
Overall, the results reported in this section provide consistent evidence that volatility dynamics in the Spanish electricity market are best captured by models that explicitly account for long-memory behavior, discontinuous price movements, and residual calendar effects. The out-of-sample evaluation based on the QLIKE loss function and the Model Confidence Set highlights the superiority of HAR-type specifications over GARCH models, while conditional diagnostics further indicate that incorporating jump components and day-of-the-week effects improves forecast accuracy without introducing bias or inefficiency. Taking together, these findings suggest that reliable volatility forecasting in electricity markets requires models capable of exploiting high-frequency information and accommodating structural features unique to power prices. The implications of these results for risk management, market monitoring.
Importantly, volatility forecasting in electricity markets is fundamentally constrained by the dominance of jump-driven and exogenous shocks, implying that very high R OOS 2 values would be unexpected and potentially indicative of overfitting. Consequently, moderate out-of-sample explanatory power is fully consistent with the realized volatility forecasting.

5. Conclusions

This paper analyzes volatility forecasting in the Spanish electricity spot market over the period 2021–2025, characterized by uncertainty, frequent price jumps, and the presence of zero and negative prices. To address these features, we combine a price-shift transformation and the Ullrich demeaning procedure [10] with realized volatility measures constructed from high-frequency data, and evaluate a wide range of competing volatility models within a coherent out-of-sample forecasting procedure.
Model comparison is conducted using a loss-based approach using QLIKE loss function and the Model Confidence Set methodology. This procedure provides statistically robust inference on predictive performance while avoiding multiple-testing and data-snooping biases. The empirical results show that all return-based GARCH specifications are excluded from the Superior Set of Models, whereas HAR-type models based on realized volatility measures systematically dominate in terms of expected loss.
Conditional diagnostics within the Superior Set further reveal that models incorporating jump components and day-of-the-week effects achieve superior economic performance. In particular, the HAR-MJ-D specification based on Median Realized Volatility combines the highest out-of-sample R OOS 2 with unbiased and efficient forecasts according to Mincer–Zarnowitz tests. These results highlight the importance of modeling long-memory volatility components, discontinuous price movements, and residual weekly seasonality in modern electricity markets.
Although Out-of-Sample R OOS 2 values are moderate in absolute terms, this magnitude is expected in electricity markets, where volatility dynamics are dominated by jumps, weather shocks, and structural changes. Given the strength of the historical mean benchmark and the high intrinsic unpredictability of power prices, the reported R OOS 2 values represent substantial and economically meaningful forecast gains.
Overall, the findings underscore the relevance of coherent loss-based evaluation procedures and high-frequency information for volatility forecasting. And the importance of demeaning, using robust estimators and modeling the day-effects. Future research could extend this approach to multiple time steps, using dummy variables to account for structural changes. The effect of climatic variables—such as dummy variables for periods of drought or wind speed, among others—could be examined.

Author Contributions

Conceptualization, D.V-C. and J.C.G-D.; methodology, D.V-C. and J.C.G-D.; software, D.V-C.; validation, D.V.-C and J.C.G-D.; formal analysis, D.V-C. and J.C.G-D.; investigation, D.V-C.; writing—original draft preparation, D.V-C.; writing—review and editing, D.V-C. and J.C.G-D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

During the preparation of this manuscript, the authors used artificial intelligence tools for language editing and stylistic improvements. The authors reviewed and edited the content and take full responsibility for the scientific accuracy and integrity of the work.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ARCH Autoregressive Conditional Heteroskedasticity
GARCH Generalized Autoregressive Conditional Heteroskedasticity
HAR Heterogenous Autoregressive
MIBEL Iberian Electricity Market
MCS Model Confidence Set
SSM Superior Set of Models
MedRV Median Realized Volatility
RV Realized Volatility
KRV Kernel Realized Volatility

References

  1. Pepermans, G. European energy market liberalization: Experiences and challenges. Int. J. Econ. Policy Stud. 2019, 13, 3–26. [Google Scholar] [CrossRef]
  2. Ciarreta, A.; Pizarro-Irizar, C.; Zarraga, A. Renewable energy regulation and structural breaks: An empirical analysis of Spanish electricity price volatility. Energy Econ. 2020, 88, 104749. [Google Scholar] [CrossRef]
  3. Cevik, S.; Zhao, Y. Shocked: Electricity price volatility spillovers in Europe. Int. Econ. Econ. Policy 2026, 23, 1–21. [Google Scholar] [CrossRef]
  4. Qin, Y.; Hong, K.; Chen, J.; Zhang, Z. Asymmetric effects of geopolitical risks on energy returns and volatility under different market conditions. Energy Econ. 2020, 90, 104851. [Google Scholar] [CrossRef]
  5. Escribano, G.; Gouveia, A.F.; Fachada, J.; Arbeloa, I.U. After the Energy Crisis: Policy Responses in the Iberian Peninsula. Brookings Institution Articles. Available online: https://www.brookings.edu/articles/after-the-energy-crisis-policy-responses-in-the-iberian-peninsula/ (accessed on 17 April 2026).
  6. Durán-Castillo, G.; Weis, T.; Leach, A.; Fleck, B.A. Toward Sustainable Electricity Markets: Merit-Order Dynamics on Photovoltaic Energy Price Duck Curve and Emissions Displacement. Sustainability 2025, 17, 4618. [Google Scholar] [CrossRef]
  7. Cipra, T. Time Series in Economics and Finance. Time Ser. Econ. Financ. 2020, 1–410. [Google Scholar] [CrossRef]
  8. Tsay, R. Analysis of Financial Time Series; Wiley: Hoboken, NJ, USA, 2010. [Google Scholar]
  9. Billimoria, F.; Mays, J.; Poudineh, R. Hedging and tail risk in electricity markets. Energy Econ. 2025, 141, 108132. [Google Scholar] [CrossRef]
  10. Ullrich, C.J. Realized volatility and price spikes in electricity markets: The importance of observation frequency. Energy Econ. 2012, 34, 1809–1818. [Google Scholar] [CrossRef]
  11. Qu, H.; Duan, Q.; Niu, M. Modeling the volatility of realized volatility to improve volatility forecasts in electricity markets. Energy Econ. 2018, 74, 767–776. [Google Scholar] [CrossRef]
  12. Engle, R.F. Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation. Econometrica 1982, 50, 987–1007. [Google Scholar] [CrossRef]
  13. Bollerslev, T. Generalized autoregressive conditional heteroskedasticity. J. Econom. 1986, 31, 307–327. [Google Scholar] [CrossRef]
  14. Haas, M.; Mittnik, S.; Paolella, M.S. A New Approach to Markov-Switching GARCH Models. J. Financ. Econom. 2004, 2, 493–530. [Google Scholar] [CrossRef]
  15. Taylor, S.J. Modelling financial time series. In Modelling Financial Time Series; World Scientific Connect: Singapore, 1986; pp. 1–270. [Google Scholar] [CrossRef]
  16. Schwert; William, G. Stock Volatility and the Crash of ’87. Rev. Financ. Stud. 1990, 3, 77–102. [Google Scholar] [CrossRef]
  17. Andersen, T.G.; Bollerslev, T. Answering the Skeptics: Yes, Standard Volatility Models do Provide Accurate Forecasts. Int. Econ. Rev. 1998, 39, 885–905. [Google Scholar] [CrossRef]
  18. Barndorff-Nielsen, O.E.; Shephard, N. Power and Bipower Variation with Stochastic Volatility and Jumps. J. Financ. Econom. 2004, 2, 1–37. [Google Scholar] [CrossRef]
  19. Barndorff-Nielsen, O.E.; Hansen, P.R.; Lunde, A.; Shephard, N. Designing Realized Kernels to Measure the ex post Variation of Equity Prices in the Presence of Noise. Econometrica 2008, 76, 1481–1536. [Google Scholar] [CrossRef]
  20. Andersen, T.G.; Dobrev, D.; Schaumburg, E. Jump-robust volatility estimation using nearest neighbor truncation. J. Econom. 2012, 169, 75–93. [Google Scholar] [CrossRef]
  21. Corsi, F. A Simple Approximate Long-Memory Model of Realized Volatility; Social Science Research Network: Rochester, NY, USA, 2009; p. 1365738. Available online: https://papers.ssrn.com/abstract=1365738 (accessed on 14 November 2025).
  22. Gong, X.; Lin, B. Adding dummy variables: A simple approach for improved volatility forecasting in electricity market. J. Manag. Sci. Eng. 2023, 8, 191–213. [Google Scholar] [CrossRef]
  23. Li, K.; Cursio, J.D.; Jiang, M.; Liang, X. The significance of calendar effects in the electricity market. Appl. Energy 2019, 235, 487–494. [Google Scholar] [CrossRef]
  24. Patton, A.J. Volatility forecast comparison using imperfect volatility proxies. J. Econom. 2011, 160, 246–256. [Google Scholar] [CrossRef]
  25. Hansen, P.R.; Lunde, A.; Nason, J.M. The Model Confidence Set. Econometrica 2011, 79, 453–497. [Google Scholar] [CrossRef]
  26. Chuliá, H.; Furió, D.; Uribe, J.M. Volatility spillovers in energy markets. Energy J. 2019, 40, 173–197. [Google Scholar] [CrossRef]
  27. Red Eléctrica. Renewable Energies Generated 56% of Spain’s Electricity Mix in 2024. Red Eléctrica. Available online: https://www.ree.es/en/press-office/press-release/news/press-release/2025/01/renewable-energies-generated-56-per-cent-spains-electricity-mix-2024 (accessed on 19 April 2026).
  28. Boubaker, H.; Bannour, N. Coupling the Empirical Wavelet and the Neural Network Methods in Order to Forecast Electricity Price. J. Risk Financ. Manag. 2023, 16, 246. [Google Scholar] [CrossRef]
  29. Geis, J.; Neumann, F.; Lindner, M.; Härtel, P.; Brown, T. Price formation in a highly-renewable, sector-coupled energy system. Energy Econ. 2026, 157, 109213. [Google Scholar] [CrossRef]
  30. Naeem, M.; Jassim, H.S.; Saleem, K.; Fatima, M. Forecasting Volatility of the Nordic Electricity Market an Application of the MSGARCH. Risks 2025, 13, 58. [Google Scholar] [CrossRef]
  31. Inglada-Pérez, L.; Gil, S.G.Y. A Study on the Nature of Complexity in the Spanish Electricity Market Using a Comprehensive Methodological Framework. Math 2024, 12, 893. [Google Scholar] [CrossRef]
  32. Zakeri, B.; et al. The role of natural gas in setting electricity prices in Europe. Energy Rep. 2023, 10, 2778–2792. [Google Scholar] [CrossRef]
  33. Robinson, D.; Arcos-Vargas, A.; Tennican, M.; Núñez, F. The Iberian Exception: An overview of its effects over its first 100 days. 2023.
  34. Hidalgo-Pérez, M.; Collado, N.; Galindo, J.; Mateo, R. The Iberian exception: Estimating the impact of a cap on gas prices for electricity generation on consumer prices and market dynamics. Energy Policy 2024, 188, 114092. [Google Scholar] [CrossRef]
  35. Pavlík, M.; Kurimský, F.; Ševc, K. Renewable Energy and Price Stability: An Analysis of Volatility and Market Shifts in the European Electricity Sector (2015–2025). Appl. Sci. 2025, 15, 6397. [Google Scholar] [CrossRef]
  36. Bento, P.; Mariano, S.; Carvalho, P.; Calado, M.D.R.; Pombo, J. Soaring electricity prices in the day-ahead Iberian market: Policy insights, regulatory challenges and lack of system flexibility. Int. J. Energy Sect. Manag. 2023, 18, 312–333. [Google Scholar] [CrossRef]
  37. Engle, R.F.; Bollerslev, T. Modelling the persistence of conditional variances. Econom. Rev. 1986, 5, 1–50. [Google Scholar] [CrossRef]
  38. Lamoureux, C.G.; Lastrapes, W.D. Persistence in variance, structural change, and the GARCH model. J. Bus. Econ. Stat. 1990, 8, 225–234. [Google Scholar] [CrossRef]
  39. Bollerslev, T. Glossary to ARCH (GARCH). Creat. Res. Pap. 2008, 297.
  40. Naimoli, A.; Storti, G. Forecasting Volatility and Tail Risk in Electricity Markets. J. Risk Financ. Manag. 2021, 14, 294. [Google Scholar] [CrossRef]
  41. Corsi, F.; Audrino, F.; Renò, R. HAR Modeling for Realized Volatility Forecasting. In Handbook of Volatility Models and Their Applications; Wiley: Hoboken, NJ, USA, 2012; pp. 363–382. [Google Scholar] [CrossRef]
  42. Ciarreta, A.; Zarraga, A. Modeling realized volatility on the Spanish intra-day electricity market. Energy Econ. 2016, 58, 152–163. [Google Scholar] [CrossRef]
  43. Bollerslev, T.; Patton, A.J.; Quaedvlieg, R. Exploiting the errors: A simple approach for improved volatility forecasting. J. Econom. 2016, 192, 1–18. [Google Scholar] [CrossRef]
  44. Andersen, T.G.; Bollerslev, T.; Diebold, F.X. Roughing It Up: Including Jump Components in the Measurement, Modeling, and Forecasting of Return Volatility. Rev. Econ. Stat. 2007, 89, 701–720. [Google Scholar] [CrossRef]
  45. Clements, A.; Preve, D.P.A. A Practical Guide to harnessing the HAR volatility model. J. Bank. Financ. 2021, 133, 106285. [Google Scholar] [CrossRef]
  46. Huang, X.; Tauchen, G. The Relative Contribution of Jumps to Total Price Variance. J. Financ. Econom. 2005, 3, 456–499. [Google Scholar] [CrossRef]
  47. Urniezius, R.; et al. Enhancing Prediction by Incorporating Entropy Loss in Volatility Forecasting. Entropy 2025, 27, 806. [Google Scholar] [CrossRef]
  48. Campbell, J.Y.; Lo, A.W.; MacKinlay, A.C. The Econometrics of Financial Markets; Princeton University Press: Princeton, NJ, USA, 1997. [Google Scholar] [CrossRef]
  49. Mincer, J.; Zarnowitz, V. The Evaluation of Economic Forecasts. In Economic Forecasts and Expectations: Analysis of Forecasting Behavior and Performance; NBER Chapters; National Bureau of Economic Research, Inc.: Cambridge, MA, USA, 1969; pp. 3–46. Available online: https://EconPapers.repec.org/RePEc:nbr:nberch:1214 (accessed on 12 April 2026).
  50. Franses, P.H. Testing bias in professional forecasts. J. Forecast. 2021, 40, 1086–1094. [Google Scholar] [CrossRef]
  51. Uniejewski, B.; Weron, R.; Ziel, F. Variance Stabilizing Transformations for Electricity Spot Price Forecasting. IEEE Trans. Power Syst. 2018, 33, 2219–2229. [Google Scholar] [CrossRef]
Figure 1. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), illustrating increasing price levels and volatility throughout 2021.
Figure 1. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), illustrating increasing price levels and volatility throughout 2021.
Preprints 212441 g001
Figure 2. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), characterized by extreme price spikes and elevated volatility during 2022.
Figure 2. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), characterized by extreme price spikes and elevated volatility during 2022.
Preprints 212441 g002
Figure 3. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), showing a partial normalization in price levels alongside persistent intraday volatility during 2023.
Figure 3. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), showing a partial normalization in price levels alongside persistent intraday volatility during 2023.
Preprints 212441 g003
Figure 4. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), reflecting lower average price levels alongside pronounced intraday variability during 2024.
Figure 4. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), reflecting lower average price levels alongside pronounced intraday variability during 2024.
Preprints 212441 g004
Figure 5. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), showing moderate price levels together with sustained intraday volatility during 2025.
Figure 5. Time series of hourly spot electricity prices (€/MWh) in Spain (OMIE), showing moderate price levels together with sustained intraday volatility during 2025.
Preprints 212441 g005
Figure 6. Average hourly spot electricity prices in Spain (OMIE), 2021–2025, highlighting the emergence of a duck curve pattern. Midday price depressions and steep evening ramps become increasingly pronounced in the post crisis period, reflecting the growing impact of solar generation and renewable intermittency on intraday price formation.
Figure 6. Average hourly spot electricity prices in Spain (OMIE), 2021–2025, highlighting the emergence of a duck curve pattern. Midday price depressions and steep evening ramps become increasingly pronounced in the post crisis period, reflecting the growing impact of solar generation and renewable intermittency on intraday price formation.
Preprints 212441 g006
Figure 7. Overview of the Out-Of-Sample evaluation and model selection strategy.
Figure 7. Overview of the Out-Of-Sample evaluation and model selection strategy.
Preprints 212441 g007
Figure 8. Price, returns, and realized volatility dynamics. The top panel shows the spot electricity price series (shifted), the middle panel displays daily returns, and the bottom panel reports realized volatility computed from intraday data.
Figure 8. Price, returns, and realized volatility dynamics. The top panel shows the spot electricity price series (shifted), the middle panel displays daily returns, and the bottom panel reports realized volatility computed from intraday data.
Preprints 212441 g008
Figure 9. Realized volatility, median realized volatility, and kernel realized volatility dynamics. The top panel shows the realized volatility, the middle panel displays daily the median realized volatility, and the bottom panel reports kernel realized volatility.
Figure 9. Realized volatility, median realized volatility, and kernel realized volatility dynamics. The top panel shows the realized volatility, the middle panel displays daily the median realized volatility, and the bottom panel reports kernel realized volatility.
Preprints 212441 g009
Figure 10. Out-of-Sample comparison between GARCH models, realized and the best HAR model.
Figure 10. Out-of-Sample comparison between GARCH models, realized and the best HAR model.
Preprints 212441 g010
Table 1. Summary statistics of hourly electricity prices.
Table 1. Summary statistics of hourly electricity prices.
Statistic Prices
Original Shifted 1
Observations 43,752 43,752
Mean 98.9770 114.9770
Median 92.9450 108.9450
Std. deviation 69.2104 69.2104
Minimum −15.0000 1.0000
Maximum 700.0000 716.0000
Skewness 1.0355 1.0355
Excess kurtosis 2.6243 2.6243
Negative values (%) 1.84 0.00
1 Shift offset = 1 − min(P) = applied uniformly so that min(P*) = 1, enabling well-defined log-returns.
Table 2. Summary statistics of intraday log-returns.
Table 2. Summary statistics of intraday log-returns.
Statistic Returns
Shifted 1 Ullrich-adj
Observations 43,751 43,751
Mean 0.0000 −0.0029
Median −0.0034 0.0000
Std. deviation 0.1835 0.15004
Minimum −2.1150 −2.1125
Maximum 2.1401 2.1005
Skewness 0.6317 0.3437
Excess kurtosis 14.7102 20.8944
Jarque-Bera (p-value) 0.0000 0.0000
1 JB test H0: normality. Excess kurtosis relative to 3. Demeaning subtracts the median for each month × weekday × hour combination.
Table 3. Summary statistics of realized variance estimators.
Table 3. Summary statistics of realized variance estimators.
Estimator 1 Central tendency Dispersion Shape
Mean Median Std. dev. Min Max P25 P75 Skewness Ex. kurtosis
RV 0.5407 0.1324 0.8333 0.0087 8.1929 0.0596 0.6938 2.7490 11.3210
MedRV 0.4041 0.1131 0.6199 0.0072 5.0362 0.0511 0.4975 2.7320 9.6380
KRV (Bartlett) 0.8623 0.1523 1.4149 0.0000 8.4651 0.0665 0.9666 2.1310 4.1430
1 MedRV and KRV computed with the highfrequency R package. RV computed manually. Excess kurtosis relative to 3.
Table 4. Model Confidence Set results and out-of-sample forecast evaluation (MCS, R OOS 2 and Mincer–Zarnowitz).
Table 4. Model Confidence Set results and out-of-sample forecast evaluation (MCS, R OOS 2 and Mincer–Zarnowitz).
Model Specification MCS 3 Predictive Accuracy Mincer-Zarnowitz Regression 5
Model Transform. 1 Week 2 p-Value R OOS 2  4 α ^ β ^ p-Value H0 Bias
HAR-M-J No Yes 1.0000 0.4798 0.02467 0.9915 0.9313 Unbiased
HAR-J No Yes 1.0000 0.4727 0.10791 0.9739 0.2093 Unbiased
HAR No Yes 1.0000 0.4725 0.10192 0.9798 0.2213 Unbiased
HAR-Q-J No Yes 0.9984 0.4710 0.11325 0.9699 0.1963 Unbiased
HAR-Q No Yes 0.9936 0.4708 0.10785 0.9758 0.2035 Unbiased
HAR-J No No 0.9743 0.4570 0.12082 0.9635 0.1997 Unbiased
HAR No No 0.9599 0.4562 0.11598 0.9682 0.2102 Unbiased
HAR-M No Yes 0.9529 0.4770 0.01248 0.9914 0.9926 Unbiased
HAR-CJ No Yes 0.8494 0.4602 0.10050 1.0298 0.0437 Biased
HAR-RV-J No Yes 0.8494 0.4602 0.10050 1.0298 0.0437 Biased
HAR-Q-J SQRT Yes 0.8268 0.4298 0.16327 1.1547 0.0000 Biased
HAR-Q SQRT Yes 0.8261 0.4296 0.15898 1.1590 0.0000 Biased
HAR-M-J No No 0.8085 0.4632 0.04187 0.9778 0.8913 Unbiased
HAR-K-J No Yes 0.7579 0.4456 0.17302 0.8396 0.1115 Unbiased
HAR-Q-J No No 0.7528 0.4545 0.12802 0.9586 0.1798 Unbiased
HAR-K-J No No 0.6789 0.4333 0.18375 0.8309 0.1217 Unbiased
HAR-K No Yes 0.6517 0.4431 0.17043 0.8360 0.1166 Unbiased
HAR-Q No No 0.6447 0.4538 0.12354 0.9636 0.1843 Unbiased
HAR-K-J SQRT Yes 0.5853 0.4302 0.29411 0.8896 0.0003 Biased
HAR-K No No 0.5785 0.4311 0.18124 0.8274 0.1215 Unbiased
HAR-K SQRT Yes 0.5407 0.4282 0.29724 0.8761 0.0007 Biased
HAR-MJ SQRT Yes 0.5375 0.4610 0.17497 1.0232 0.0001 Biased
HAR-QJ SQRT No 0.3677 0.4064 0.15132 1.1881 0.0000 Biased
HAR-KJ SQRT No 0.3624 0.4166 0.28276 0.9110 0.0008 Biased
HAR-MJ SQRT No 0.3540 0.4404 0.18174 1.0286 0.0002 Biased
HAR-Q SQRT No 0.3540 0.4057 0.14794 1.1913 0.0000 Biased
HAR-M SQRT Yes 0.3217 0.4632 0.16057 1.0255 0.0009 Biased
HAR SQRT Yes 0.3155 0.4380 0.21597 1.0724 0.0000 Biased
HAR-J SQRT Yes 0.3134 0.4377 0.21952 1.0688 0.0000 Biased
HAR-K SQRT No 0.2963 0.4149 0.28765 0.8953 0.0015 Biased
HAR-M SQRT No 0.2475 0.4414 0.17767 1.0187 0.0009 Biased
HAR-CJ No No 0.2189 0.4443 0.10619 1.0263 0.0413 Biased
HAR-RV-J No No 0.2189 0.4443 0.10619 1.0263 0.0413 Biased
HAR-M No No 0.2009 0.4607 0.03153 0.9760 0.9619 Unbiased
HAR-RV-J SQRT Yes 0.1025 0.4181 0.23426 1.1062 0.0000 Biased
HAR-C No Yes 0.0593 0.3935 0.22647 0.9287 0.0263 Biased
1 No: R V t   a n d SQRT: R V t . 2 Week: models with day-of-the-week effects. 3 MCS p-value (R-statistic). A model survives the MCS if p ≥ α = 20% (Hansen et al., 2011). 4  R OOS 2   benchmarked against the in-sample mean RV. 5 H0 (MZ): α = 0 and β = 1 jointly. p ≥ 0.05 → unbiased forecast. Highlighted row: best unbiased model by R OOS 2 . .
Table 5. In-sample estimation of the HAR-MJ-D model.
Table 5. In-sample estimation of the HAR-MJ-D model.
Variable 1 Estimate Std. Error t-Stat p-Value Sig. 2
Realized variance components (MedRV)
Intercept −0.029813 0.049066 −0.607616 0.5435
MedRV t d (daily) 0.328403 0.056789 5.782828 0.0000 ***
MedRV t w (weekly) 0.260141 0.100465 2.589383 0.0097 ***
MedRV t m (monthly) 0.456403 0.114239 3.995174 0.0001 ***
Jump component (daily) 0.232385 0.068986 3.368574 0.0008 ***
Weekday dummies  D i t
D 2 , t (Tuesday) 0.041505 0.058675 0.707364 0.4794
D 3 , t (Wednesday) 0.100200 0.062523 1.602600 0.1092
D 4 , t   (Thursday) 0.032433 0.058584 0.553620 0.5799
D 5 , t (Friday) 0.026091 0.057671 0.452406 0.6510
D 6 , t (Saturday) 0.213303 0.060400 3.531498 0.0004 ***
D 7 , t (Sunday) 0.436962 0.085466 5.112722 0.0000 ***
1 MedRV (weekly) = 7-day rolling mean of MedRV. MedRV (monthly) = 30-day rolling mean. Jump component = max(RV − MedRV, 0) on Huang-Tauchen jump days. 2 Significance: *** p < 0.01 · ** p < 0.05 · * p < 0.10. Goodness of fit: N = 1,792 · R2 = 0.3139 · Adj. R2 = 0.3101.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Accessibility

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated