Forecasting Wholesale Electricity Prices With Artificial Intelligence Models : The Italian Case

Electricity price forecasting has become a crucial element for both private and public decision-making. This importance has been growing since the wave of deregulation and liberalization of energy sector worldwide late 1990s. Given these facts, this paper tries to come up with a precise and flexible forecasting model for the wholesale electricity price for the Italian power market on an hourly basis. We utilize artificial intelligence models such as neural networks and bagged regression trees that are rarely used to forecast electricity prices. After model calibration, our final model is bagged regression trees with exogenous variables. The selected model outperformed neural network and bagged regression with single price used in this paper, it also outperformed other statistical and non-statistical models used in other studies. We also confirm some theoretical specifications of the model. As a policy implication, this model might be used by energy traders, transmission system operators and energy regulators for an enhanced decisionmaking process. JEL Codes: C14, C45, C53, Q47.


Introduction
The course of deregulation and the emergence of competitive electricity markets, witnessed in the last decade of the 20 th century, have been restructuring the power sectors which were traditionally controlled by governments as monopolists.In countries where power sectors were liberalized, market rules substituted central government rules for trading various electricity products.Furthermore, there is still a debate among energy researchers and practitioners whether to consider electricity as a good or a service.Nevertheless, we can strictly argue that electricity, as a product, is distinguished from other products in the sense that it is not physically storable, and thus the real-time balancing between generation and consumption is needed to keep the system in equilibrium.This leads us to the various factors driving electricity demand.Weather conditions such as, temperature and wind, and the daily activities during the business day and weekend day determine the shape of the daily demand which is further categorized as base demand, peak and off-peak.As a consequence, these special characteristics of electricity demand lead to some phenomena, or broadly speaking anomalies, in the behavior of electricity price not witnessed in other markets, perhaps the temporal seasonality and the sudden short-lasting spikes are the major observed behaviors of electricity price.In this manner, these facts have been encouraging energy researcher to intensify their work for developing accurate price forecasting models, (Weron 2014;Kaminski, 2013;Shahidehpour, Yamin, & Li, 2002).Furthermore, electricity price forecasting has become a crucial element for business decision-making as well as for public policy-making since energy security is a vital issue in all societies, (Weron 2014;Bunn, 2004;Eydeland & Wolyniec, 2003).Additionally, experience shows that energy business is one of the most sensitive sectors in the economy since cost pass-through to final customers has many social and political implications as shown by California crisis of 2000-2001, (Joskow, 2001).Electric companies are, in this regard, very vulnerable to bankruptcies as trading any extra/shortage of energy as needed might cause them huge economic losses.In electricity markets, hedging against volumes and price movements was mainly motivated by the extreme electricity price variations, especially during peak hours.Nevertheless, hedging positions in electricity markets can't be achieved without the temporal and on-going price forecast for different time frequencies ranging from half an hour to several months or years ahead.This forecast serves as the core interest of market participants on both demand and supply sides, as well as for investment managers in power sectors to continuously modify their actual production/consumption or for a temporal adjustment of financial portfolios.In the same vein, and based on the aforementioned attributes, the motives for this research lie under three main pillars as follows: 1-The emergence of a new set of forecasting models called artificial intelligence.
2-These models are not widely tested on the Italian market, thus verifying their accuracy is needed.
3-The important changes in the wholesale electricity markets in Europe in general and Italy with its surrounding countries, such as regulatory changes, market coupling, the growing impact of renewables and the distributed generation.
Our study contributes to the existing literature by investigating and verifying on of the rapidly growing group of models "artificial intelligence", in particular, bootstrap aggregation of regression trees which is not widely explored in energy studies even though it shows a relatively better forecasting power.Furthermore, few forecasting papers are conducted on the Italian energy market and this model is the first time to be tested on such a market.Finally, we provide detailed breakdowns of forecast errors by hour, hour type (peak and off-peak), day and by month which are rarely provided in other studies.
Before we proceed, we brief the remaining parts of this paper, part two is the review of the related literature, part three concerns the model and the bagging technique, part four is dedicated for data, model calibration and model specifications, part five for the forecasting results, and finally part six concerns conclusions and policy implications.

Literature Review
The inspiration of the liberalization of power sectors worldwide came as a consequence of liberalization in other sectors, during which privatization showed growing efficiencies and economic values of nations' wealth.However, as indicated earlier, electricity is not like any other commodity, it is immediately produced as a response for demand, making real-time balancing and price forecasts important elements in electricity markets.In this context, wholesale energy markets were emerged as a consequence of the deregulation and the liberalization of power sectors.Wholesale exchanges can be broadly categorized as power pool in which the participation of market agents is compulsory as in Nordic EU countries, and power exchange with a voluntary involvement as in continental EU countries, and the latter is a more common type of power markets.Trading in power exchanges is mostly concentrated on day-ahead spot products.Spot power markets work on a full efficiency to meet the full-time functioning, it constitutes hourly auctions for 24 hours a day, seven day a week and 365 days a year.Moreover, the dynamics and the special characteristics of this type of markets makes price forecasting, from few hours ahead to several days, of high importance for market participants, (Girish et al., 2013;Girish & Vijayalakshmi, 2013).
Regarding the outcome of the power market, a market clearing price is finalized as a result of the intersection between the market aggregate demand and supply curves corresponding to each trading hour and each trading zone, and since the supply curve seems to be flat for base demand stake, the electricity price, therefore, is less sensitive to changes in demand in that stake of the curve, (Girish & Vijayalakshmi, 2013).Furthermore, the shape of the supply curve is simply the ranking of all power plants based on their marginal cost and their flexibility to be run as needed.As demand for electricity starts to increase, more expensive power plants have to be switched on to meet the extra demand driving the supply curve to be steeper, Normally, in this case, generation plants such as gas and oil fired plants with more expensive variable cost, are called to generate the extra electricity requested driving the marginal cost of production to be higher.Therefore, a small change in electricity demand in this area of the curve leads to a significant change in electricity price since the price is set as the marginal cost of the plant that met the required demand.Additionally, these plants have to charge higher prices to compensate themselves for the expensive fuel used and for the few hours of work.This situation occurs normally during the peak hours, and when demand starts to revert back to its base shape, the expensive generation plants are turned off.Furthermore, other factors might have significant effects on electricity price such as sudden outages, maintenance, network congestion and the exercise of market power by power market participants, ( In the same vein, (Weron, 2014) classifies forecasting models into five main groups, agents-based, fundamental, reduced-form, statistical, and artificial intelligence models.In general These are the main forecasting groups, however, they can also be classified according to different criteria such as dynamic/static, linear/nonlinear, parametric/non-parametric and deterministic/stochastic.Additionally, they may also be classified as optimization models, equilibrium models, agent-based models, and artificial intelligence models or game theory models, simulation models and the timeseries models as in (Aggarwal et al., 2009).Table (1) summarizes the EPF models as in (Weron, 2014), however, we add the last type in computational intelligence models (bootstrap aggregation of regression trees).

Table 1: Electricity price forecasting models
Source: Self reproduction (Weron, 2014) The groups of models are summarized as follows as in (Weron, 2014): • Multi-agent models are based on simulating the behaviors of heterogeneous agents in the power market, and then construct a pricing process by matching the demand and supply in the market.• Fundamental methods try to look for important fundamental variables that might explain electricity price dynamics.) propose an agent-based computational model.In particular, they investigate the relationship between the strategic behavior of the thermal power plants and the level of the price at a zonal and national levels.Their model replicates exactly the market clearing process in the day-ahead market and the Italian high-voltage transmission network with its zonal subdivision.Moreover, (Beccali et al., 2007) utilize an artificial intelligence model, specifically, Neural Networks to forecast the shortterm household electricity consumption related to a suburban area in Sicily.

Multi
Various methods and models have been applied to electricity price forecasting (EPF), with varying degrees of success depending on the country, economic cycle, time evolution, and other micro/macrostructure variations.In this study, we focus on the last type of computational intelligence models which is called bootstrap aggregation or bagged regression trees.Our study contributes to the existing literature by investigating and verifying on of the rapidly growing set of models "artificial intelligence models", specifically, bootstrap aggregation of regression trees which is not widely explored in energy aspects though it shows a relatively better forecasting power.Furthermore, few forecasting papers are conducted on the Italian energy market and this model is the first time to be tested on such a market.Finally, we provide detailed breakdowns of forecast errors by hour, hour type, day, and by month that are rarely provided in other studies.
Our objective is trying to come up with a precise and a flexible forecasting model for the hourly wholesale electricity price in Italy which might be utilized by the regulatory authorities and utility providers for a better decision-making process.

The Italian Context
Since the end of 1962, the whole electricity market was under the monopoly of a single vertically integrated and state-owned company Enel, this includes, generation, transmission, and distribution, with the exception of few cities where the local municipalities serve as the electricity undertaker.Moreover, The liberalization process, started early 1990s by Bersani decree, it brought about the replacement of old administrative procedures with market mechanisms more suited to the new framework for power trade, and consequently, the Italian Power Exchange was created, (AEEG, 2008).The Italian Power Exchange (IPEX) is the main instrument through which the execution of the requirements provided by law no.79/1999 for the implementation of the free market.It is an organized system that is able to promote both competition relating to the production and sale of electricity, and the protection of end customers through a maximum transparency ensured through the coordination of the unified electricity market entrusted to the Electricity Market Operator (GME) The Services Market (MSD) is the instrument through which Terna S.p.a (the transmission system operator) procures the resources necessary for the management and the control of the system (congestion resolution intra-zonal, creation of energy reserve, real-time balancing).In the MSD, Terna acts as a central counterparty and offers accepted are remunerated at the price presented (pay-asbid), (GME, 2015; Geanfreda, 2012).Recently, European energy markets and the Italian market in particular, are passing through a period of transformation (transitional regulatory period) which will reshape the structure of the energy markets through the integrity, transparency and security of energy sector in Europe.Perhaps the most noticeable structural change is the market coupling among different EU neighboring nations.Market coupling was derived from the European policy in establishing a European internal electricity market for the purpose of achieving a better capacity allocation, more efficient congestion management, security of supply, transparency, liquidity and finally the convergence of electricity prices in the coupled zones.Italy and Slovenia -among others-entered in a coupling agreement in January 2011 and translated into an actual and official market coupling in February 2015, since then the two markets operate in an integrated way and utilize a common matching algorithm for quantity allocation and price determination.
The other noticeable structural change in the European energy sector is the financialization of energy sector activities under the new European financial reforms under which a considerable part of energy derivative contracts will be regulated under EU financial regulations.
is a company wholly owned by the Ministry of Economy and Finance and officially (Gestore Mercati Energetici) GME On the wholesale side, bilateral contracts and power exchange capture the vast majority of wholesale transactions.Whereas on the retail side, customers are classified as free market customers or eligible customers and the protected segment (standard offer market 2 ) that will be phased out by 2018.

The Model
In the model description we start from the broad group of intelligence models, then to regression trees and finally to bootstrap aggregation (bagging) of regression tress.
According to (Weron, 2014;Aggarwal et al., 2009), in general, there is no common consensus of what Artificial Intelligence (AI) means, and it may not mean the same thing for all people.We can broadly say that the failure of other forecasting models, statistical in particular, to adapt for complex systems was the main drivers for the emergence of AI models.AI models have the added value over other models to adapt to complex dynamic settings and to handle non-linear relations by creating approaches using learning elements, fuzzy logic, and evolution.And with no doubt this added value makes them excellent for short-term price modeling and forecasting as indicated by different authors.
Nonetheless artificial neural networks (ANN) have received a great attention in empirical studies for load and price forecasting, other dynamic techniques are also used in hybrid structures such as fuzzy logic, genetic algorithms, evolutionary programming, swarm intelligence and bagging.
As in (Breiman et al., 1984), regression trees (RTs) or decision trees are designed to predict the value of a numeric variable.RTs use learning elements for building prediction models from a sample data.The data space is recursively split and a simple fitting model is created for each partition, then the final model is obtained by aggregating all partitions together.Regression trees are represented in hierarchical structures (as shown in figure 2), which contains nodes that represent the tests on the input elements, branches that correspond to the results of the test, and the prediction for the value of the target attribute is stored in a leaf.Regression tree leaves contain constant values as predictions for the target variable.
In the same regard, Bootstrap Aggregation, or simply Bagging, was introduced by Breiman as a technique for improving the prediction of regression trees.Since then it has been growing responding to the increased interest in bootstrapping and other processes in statistical modeling.Bagging works on creating multiple similar datasets, re-running the tree analysis, and then collecting the aggregate of the results and re-calculating the tree and associated statistics based on this aggregate, (Breiman, 1998;1996).Notwithstanding Bagging is mainly applied to regression tree as an accuracy enhancement technique of predictions, it can also be applied to other methods such as neural ergy supply with reference to a standard price set by Standard offer market is the option that guarantees consumers the en networks.Furthermore, it is worth noting that Bagging is mainly very useful for unstable regression being bagged, means that, when small changes in the learning sample can simply result in significant variations in the predictions obtained using a specified method, bagging can result in an appreciable reduction in average prediction error (Sutton, 2005).

Tree Bagging Algorithm
The training algorithm applies the general technique of bootstrap aggregating to tree learners to improve the predictive performance of tree models.Bagging works as follows, (ETH Zurich, 2008): 1-Create a bootstrap sample Then compute the bootstrapped estimator * (•).
Bagging algorithm can also be considered an approximation Which can be improved by increasing B. the original point is that we should use * [ * (•)] as a new estimator.
A trivial identity hints at some properties of bagging: write (the theoretical version of bagging with B = ∞) Instead of subtracting the bootstrap bias estimate, we are adding it!What we can hope for is a variance reduction at the price of a higher bias.This turns out to be true if (•) is a tree-based estimator.
It can be shown that for tree-based estimators (•), for very many .Thus, bagging is a variance reduction technique.The reason for this is that a bagged tree turns out to be a product of probit functions Φ( − •) instead of indicator functions [• ] .This causes a variance reduction at the price of some bias.For example, If ~ (0,1) and = 0, the latter quantity equals ¼.On the other hand, Which reduced the variance by the factor 3!.We should use large trees for bagging, because the variance reduction due to bagging asks for a large tree to balance the bias-variance trade-off.
The key point in bagging to reduce the variance without affecting the bias is that the estimates of a single tree are very sensitive to noise, whereas the average aggregated value of many trees is not a noise-sensitive given that the trees are not correlated.Bootstrap sampling is a way of de-correlating the trees by showing them different training sets as a non-deterministic approach.Furthermore, cross-validation might be used to determine the optimal number of trees B even it is a free parameter.Normally, a number of trees ranging from few hundreds to thousands can be used depending on the size of the training set.B can also be determined by observing the out-of-bag error.The training and test error tend to level off after some number of trees have been fit (Hastie et al., 2008;Breiman, 2001).• Leaf is the lowermost node on the tree also called a terminal node.
• Node is a variable that significantly contributes to model explanatory power.
• Recursive Partitioning is the method by which RT operates -subsequently dividing the data into smaller portions to isolate most important variables.• Root is the top most node on the tree -the most important explanatory variable.
• Splitting (Purity) Criterion: In RT some criteria need to be used to determine the "best" primary way to split the data.While there are different criteria available, the Gini 3 impurity criterion is most widely used.In RT, this process of splitting makes the tree more "pure."Figure (3) demonstrates the accepted range of leaves in the RT.As shown in figure (3), the best leaf size is between 20 and 50 observations per leaf.

Data and Model Calibration
We gather hourly data for the wholesale energy market for load, electricity price and natural gas price.Load and gas price are traditionally used as strong determinants of electricity price.Our data sources are the Italian authority of energy and the GME (Gestore dei Mercati Energetici (3 Where in equations ( 9) and ( 10) is the actual price for a certain hour, and is the corresponding forecasted price for the same hour.In this regard, the calibration of the forecasting models is conducted on three phases as follows: Phase one: we select different artificial intelligence models as shown in table (3): networks model did not perform well in our case with respect to bagging technique.Equations ( 12) and ( 13) show the input variables for our models.Equation ( 12) is applied to the first model of bagged regression and to the neural network, in both models we utilize the same set of exogenous variables (multivariate equation).
Meanwhile, the second model of bagged regression without exogenous variables utilizes only the historical series of electricity price with dummy variables, and equation ( 13) represents the model.The frequency of previous prices and the dummies are used to capture the seasonality effect on prices.
Input data for the multivariate equation: Phase two: the second phase of the model calibration process is to select the first bagged regression trees with exogenous variables model and simulate it for a different number of leaves of the tree.It is worth mentioning that the number of leaves has an influence on the forecasting accuracy, and as mentioned earlier that the optimal number of leafs range between 20 to 50.Table (4) show the results of the model simulation:

Table 4: Bagged regression tree with diff. leaves
Table (4) clearly shows that forecasting errors change by changing the number of tree leaves.A tree with 25 leafs produces the least forecasting errors with respect to other trees.Phase three: in this phase we select the bagged regression tree with 25 leaves and then we simulate for different forecasting terms (horizon of the forecast).We use week ahead forecast, month ahead forecast, quarter ahead forecast, and semi-annum ahead forecast, and in all cases the frequency of the forecast is the hourly PUN.Table (5) summarizes the results of the horizon forecast simulation.

Table 5: Bagged regression with diff term forecast
Table (5) shows that as we move to longer horizons, forecasting performance improves, and the 6month ahead forecast produces better results in terms of forecasting errors compared to other term forecasts.

Descriptive Statistics
Table (6) demonstrates the basic descriptive statistics for the main variables used in this paper.We can clearly notice the declining trend of load and gas price which leads to a decline in the PUN price, this might be due to different facts such as the economics crises dated back to 2011, less severe winters in the last couple of years, or increased regulatory, consumption and management efficiencies of energy, the growth of renewables, and finally the sharp decline in oil prices internationally.Additionally, the standard deviation is decreasing as well supporting the premise of increasing efficiencies in regulation, consumption, and management.

Figure 4: PUN trend
As shown in figure (4), the average PUN exhibits a negative trend in the last 3 years.The average of the first half of 2015 is €50.1 and based on our forecast, the average of second half PUN is €51.6 producing an annual average of €50.85 being 2.4% less than 2014 PUN.

Model Results
The output of our model is the hourly PUN forecast for the other 194 days of the years (second half of 2015) starting from June 2015, classified as daily, weekly, and monthly, for each 24 hours a day.
The following graph summarizes the price profile of the model forecast.First of all, the price profiles resemble peak and off-peak price movement.Moreover, the average price forecast is higher in the second half of 2015, the price ranges between €21.4 and €75.7.Additionally, peak hours experience higher standard deviation as well as higher prices compared to other types of hours as we will see in next part of the analysis.The penetration of renewables can partially explain the price variance and min-max spread given that if the demand is met by renewables during off-peak hours, this drags the price down and stretching the difference between peak-off peak prices.

Forecasting accuracy
The forecasting accuracy is measured by MAE, MAPE, and MAPE at peak. Figure (8) and the upper part of figure (7) together show how the forecasting error is normally distributed around zero, and the mean value of the error is very close to zero, which makes the model appropriate for forecasting purposes.Additionally, the lower parts of figure (7) demonstrate the distributions of both MAE and MAPE on the tails of error distribution.We can, moreover, notice that MAE is less than 2 and MAPE is less than 5 which are clarified numerically in table (3).In this part, we show how the forecasting error changes with respect to the frequency of the forecast.Figure (9) shows the breakdown of the forecasting error for each hour of the day.We can clearly notice that peak hours (from 9:00 to 19:00) have a higher forecasting error.This result confirms the official reports published by the Italian Power Exchange and the Italian Energy Regulator, however, it was difficult to compare our result with other studies since our paper is -to our knowledge-the only to provide this type of detailed analysis.As we mentioned earlier, growth in renewables might partly explain forecasting errors at peaks hours, we say partly given the fact that the renewables share in the Italian generation mix is not that high compared to Germany or to the cheap nuclear generation in France.

Figure 9: Forecast error by hour
In table (7) we classify the forecasting error by each type of hour (peak, base, and off-peak) for each month of the forecast.Additionally, we graph the MAE per each type of hour per month in figure (10).The graph demonstrates the results provided in table (7) but in a more observable manner, we can see that hour 1 type which represent the peak hours experience higher error forecast with respect to other hour types especially in the month of December in which we will see later that December has the highest forecast error.Regarding the breakdown of forecast error for days of the week; figure (11) shows that Saturdays and Mondays have relatively higher forecast error.It is worth mentioning here that Saturday hours are considered on-peak or normal hours in terms of electricity demand but not the forecasting error, and higher error does not necessarily mean higher electricity demand.

Figure 11: Forecast error by day
Our results here are very much comparable to (Weron, 2006, table 1) in which he shows forecast error by days of the week for nine different time series models, we both show the same daily risk profile, but ours slightly outperformed.Notwithstanding (Adel Aal, 2012, table 3) demonstrates the same daily risk profile, his results outperformed both of us.In the same manner, the last breakdown (figure 12) shows the forecast error by month from June to December, and as indicated previously, the month of December exhibits the highest forecast error especially for the peak hours.Once again, the breakdown by month corresponds to official reports published but is not comparable to other research since it is an additional contribution to our paper.The classification of the forecasting error by hour, day, and month has an important implication on the physical as well as the commercial relationships in the power market, it helps in setting the power reserves for various demand profiles, and bidding strategies.Before concluding the analysis, it is worth comparing our results to other related papers.As indicated in (Weron, 2006), the use of errors varies a lot since there is no clear standard for how errors are defined in the literature, and what makes it even more complex is that the same error term is used differently.Accordingly, the comparability among different papers does not give a clear picture even though the same dataset is used.For example, some papers define the "Mean Weekly Error" as the weekly MAPE (literally as the average of the seven daily "average prediction errors", i.e. daily MAPE values) while others normalize it for the hours of the week.Additional complexity for the comparison is that our study forecasts hourly price while others forecast daily or zonal price.However, we try to do our best to find common forecast measures to construct a comparison table.

Conclusions & Implications
In the present paper, we forecast the Italian hourly electricity price "known as PUN" using a particular forecasting model.Artificial intelligence models are rarely applied on the Italian context.We focus on two main intelligence models, the neural networks, and the bagged regression trees.Furthermore, we calibrate the models using historical data of 2012, 2013, 2014, and the first half of 2015 while our objective is the hourly price forecast for the second half of 2015.During the calibration process we exclude neural network for their underperformance compared to bagged regression trees, then we exclude the bagged regression trees with single price for producing higher forecasting errors.
By the end of model calibration, we end up using regression trees with exogenous variables".This model is one of the least used forecasting models and additionally has never been applied to forecast the Italian wholesale electricity price.All models' calibration and forecasting are conducted using MATLAB 2015a.Our final model outperformed neural network model tested in this study and it also shows superiority to other models used in other studies in Italy and in other markets, and above all, it is a flexible model that can be well-customized to fit particular needs (White box modeling).We confirm the theoretical model specification in which the optimal leaf size lies between 20 and 50 (in our case is 25).Moreover, we also that lager sample size with longer forecasting horizons may reduce the variance without affecting the bias.
Various policy implications might be drawn from this study.First: exogenous variables such as gas price and load still play a positive role in the accuracy of the models.Second: the average PUN forecast for the second half of 2015 can serve public policy makers such as the energy regulators for price reference for the standard offer market, electricity market operators for their short-term operating needs and hedging purposes, as well as for investment companies for their energy portfolio balancing.Third: for regulatory authorities and generators, higher reserves have to be dedicated for hours, days, and months with higher forecasting errors.And lastly: the model can be adopted by the analysis departments of the interested parties and can be customized, calibrated, and simulated according to their particular needs.Finally, further research might be needed to validate the accuracy of the model for different time periods or extended sample series; it can also be verified by applying the model to different energy markets.In our study, we assume that load demand accounts for various exogenous variables, but the research might also be extended to include more of these variables such as weather forecast measures, the impact of renewable on the electricity network, production technologies, the new trend of distributed generation of energy, and market coupling.

2
the regulatory authority and is revised each semester, this market will be phased out by 2018 under the Italian energy regulations.Preprints (www.preprints.org)| NOT PEER-REVIEWED | Posted: 2 July 2016 doi:10.20944/preprints201607.0001.v1

Figure 5 :
Figure 5: Forecasting resultsFigure(5) demonstrates the basic statistics for each of 24 hours averages for the 194 days of forecast.First of all, the price profiles resemble peak and off-peak price movement.Moreover, the average price forecast is higher in the second half of 2015, the price ranges between €21.4 and €75.7.Additionally, peak hours experience higher standard deviation as well as higher prices compared to other types of hours as we will see in next part of the analysis.The penetration of renewables can partially explain the price variance and min-max spread given that if the demand is met by renewables during off-peak hours, this drags the price down and stretching the difference between peak-off peak prices.

Figure 6 :Figure ( 6 )
Figure 6: Price forecast by hour Figure (6) clarifies the series of all predicted 4,656 hours starting from the first hour of 21/06/2015 to the last hour of 31/12/2015 which corresponds to the hour 4,105 to 8,760 respectively.

Figure 8
Figure 8: Error statistics 4.4 Breakdown of the forecasting error

Figure 10 :
Figure 10: Forecast error by hour type and month

Figure 12 :
Figure 12: Forecast error by month

Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 2 July 2016 doi:10.20944/preprints201607.0001.v1 •
Reduced-form models look at the statistical features of temporal electricity prices for the purpose of derivatives valuation and risk management.•Statistical approaches utilize various econometric models ranging from time series with single price to various types of regression models with exogenous variables.• Artificial intelligence techniques combine elements of learning, evolution and fuzziness to create approaches that are capable of adapting to complex dynamic systems.
the goodness of the forecasting model.The most common measures are MAE (Mean Absolute Error), MAPE (Mean Absolute Percentage Error), MASE (Mean Absolute Scaled Error), and RMSE (Root Mean Square Error).With respect to the Italian market, very few studies use different models for electricity price at the wholesale market.(Gianfreda & Grossi, 2013) use a combined statistical model of ARFIRMA-GARCH with exogenous variables to forecast zonal electricity prices and then the PUN simulation, they measure predictive goodness by RMSE, MAPE, and Theil's U-index.(Guerci, Rastegar, & Cincotti, 2010 1.The sale of electricity is carried out daily by resorting to bargaining on an hourly basis and for each Geographic Area (North, Central North, Central, South, Sicily, and Sardinia) to the Single National Price (PUN).The Italian Power Exchange started in Italy as of 1 April 2004 constitutes, therefore, the place where the offers of producers meet consumers' demand and electricity wholesalers.The exchange commenced with a traded volume of 232 TWh in 2004 reaching 282 TWh in 2014 with a peak volume of 337 TWh achieved in 2008.Regarding the structure of the market, IPEX consists of three different platforms, the Spot Electricity Market (MPE), is a platform for physical delivery; the Forward Electricity Market (MTE), launched on November 2008; and the platform for physical delivery of financial derivatives (CDE) negotiated on the financial exchange (IDEX).As regarding the market mechanisms, the MPE is divided into Day-Ahead Market (MGP), Intra-Day Market (MI), Services Market (MSD).The Day-Ahead Market (MGP) is the venue for the majority of transactions for the sale of electricity and in which power is exchanged for the next day.It opens at 8:00 for nine days before the delivery day until 9:00 of the day before the day of delivery when the MGP closes.Here operators shall submit their offers in which they define the amount and the maximum and minimum price at which they are willing to buy and sell electricity.The Intra-Day Market (MI) allows operators to modify the schedules defined in the Day-Ahead Market through further purchase or sale.MI takes place in four sessions (MI1, MI2, MI3 and MI4) with different opening and closing times.

Table 2 : Model samples
(2)oreover, the data is gathered on hourly basis for 2012, 2013, 2014, and the first half of 2015.Each year consists 8,760 hours (except 2012 was a leap year which consisted one additional day making 8,784 hours) resulting in 30,408 observations for the given period.In this vein, and for the purpose of performing the model, we divide the data into data for training set which consists 26,304 observations from 2012 to 2014, and a testing data set which consists 4,104 observation representing the first half of 2015 until 20/06/2015.Training data is used to train the model by applying the appropriate algorithm of the model then fitting current observations to future ones.Then when the system sets the model, we test the model to forecast the first half of 2015 (actual) then compare the forecast with actual to calculate the forecast error.In this regard, we utilize various artificial intelligence models to fit the same data to be tested on the first half of 2015, then we select the model that minimizes errors of the forecast to generate future predictions of the second half of 2015 starting from 21/06/2015 to 21/12/2015 consisting 4,656 hours.All model calibration and analysis were performed using MATLAB 15.a.Table(2)demonstrates the division of model samples.

Table 3 : Trained models
The performance shows that the first model "bagged regression trees" with exogenous variables produces the least error forecast as shown in bold.Additionally, it is worth to notice that neural Preprints (www.

Table 6 :
Descriptive statisticsPUN and load data are directly obtained from power market platforms while gas price is referred to PSV Italian gas hub price (Punto di Scambio Virtuale).

Table 8 : Comparison of forecast performanceTable ( 8
Weron 2006ates the results of various forecasting measures used in different papers.We can clearly notice that our model "bagged regression tree with exogenous variables" outperformed other models in terms of MAE and MAPE, we additionally provide the peak MAPE which measures the forecast error at spikes.Furthermore, if we can consider WMAE as a variant MAPE (as inWeron 2006), our model is superior to the models of Weron & Misiorek for both California and Nord pool.