The framework analyzes the behavior of urban greenery typologies during heatwaves by considering the raster data indices NDVI, NDMI and LST determined by multispectral satellite images. The next paragraph describes in detail the framework; paragraph 2.2 discusses the case study given by the city of Naples (Italy) and describes the data used to test our method.
2.1. The Proposed Framework
The framework is schematized in
Figure 1.
After extracting the raster data and giving the spatial distribution of the three indices during a heatwave, for each index, a raster classification process is performed. This process is implemented using the Natural Breaks thematic classification algorithm which, in partitioning the real interval corresponding to the index definition domain, determines the lower and upper bounds of the subset corresponding to each class (the breaks), so that the average standard deviation of the value of the pixel index assigned to each class is as small as possible.
We adopt the Elbow method to set the number of classes. The Elbow method is a heuristic approach applied in cluster analysis to set the optimal number of clusters, given by the number of clusters for which the curve of the trend of the quadratic sum of the distances of the data points from the centers of the clusters they belong to has an elbow, that is, for higher values of the number of clusters, the curve flattens.
Compared to other validity indices used for determining the optimal number of clusters, the Elbow method has the advantage of being fast; however, its main flaw is the ambiguity in determining the precise location of the curve elbow [
20,
21]. To overcome this limit, we propose to set the elbow point equal to the value k of the number of classes for which the reduction in variance compared to that obtained with k -1 classes is less than 25%.
Formally, let K be the number of classes in which is partitioned the domain of the index and let
be the variance of the index for the i
th class, where i = 1,2,…,K. The average variance obtained considering K classes is given by:
The reduction in variance with respect to the use of K-1 classes is given by:
The first value of the number of classes K for which is less than 0.25 is selected as the Elbow point.
In
Figure 2 shows an example of application of this approach for determining the optimal number of classes. The selected value for K is K = 7, where a value of
less than 0.25 is achieved.
The result of the raster classification process is a raster dataset in which each pixel is assigned the identifier of the thematic class to which it belongs.
A label is assigned to each thematic class following the order of the class and assigned to the central classes the term Medium, if K is odd or the terms Medium low and Medium high if K is even. For example, if K = 5, the third class, which constitutes the central class, is assigned the label Medium, the second and fourth classes, respectively, the labels Medium low and Medium high. Assuming K = 6, the third and fourth classes are assigned the labels Medium low and Medium high, respectively, the labels Low and High are assigned to the second and fifth classes, and the first and last class the labels Very low and Very high.
In the next step, a classification of the urban green areas based on each index is performed, where the urban green areas in the area of study are given by polygons and each polygon is assigned the type of greenery. To carry out this classification, each thematic class into which the index domain has been partitioned is assigned a Gaussian fuzzy set whose parameters are given by the average value and the standard deviation of the pixels belonging to the thematic class.
Formally, let μ
k and σ
k be, respectively, the mean and the standard deviation of the values of the pixels belonging to the k
th thematic class, where k = 1,…,K. To this thematic class is assigned a Gaussian fuzzy cluster having membership function:
And having as label the label of the thematic class.
For example, let consider the following table, in which are shown the average and the standard deviation of the NDMI index for all the six thematic classes in which is partitioned the domain (
Table 1).
Assigning a fuzzy set to each thematic class, are obtained the following Gaussian fuzzy sets (
Figure 4)
Using a zonal statistics process, to each polygon representing an urban green area is attributed a value of the index given by the average of the values of the pixels covering this polygon; then to this polygon is assigned the label of the Gaussian fuzzy set to whom it belongs having the highest membership degree.
For example, if the average of the NDMI values of the pixels covering this polygon is 0.11, it is assigned the label Medium high, as it belongs to the Gaussian fuzzy set termed Medium high with the highest membership degree (0.92). This membership degree is interpreted as the uncertainty of this assignment.
The outcomes of this step are three thematic maps in which the green areas are represented, respectively, by NDVI, NDMI and LST class. Each urban green area is assigned the labels of the NDVI, NDMI and LST classes, the uncertainties of the assignment to each of the three classes, as well as the type of greenery.
In the last step, an analysis of the behavior of the different types of urban greenery during the heatwave is conducted. For each type of urban greenery and for each of the three indices, the frequencies of green areas falling into each of the thematic classes into which the index has been divided are determined. By comparing the frequency distributions of the three indices it is possible to evaluate the behavior of the type of urban greenery during heatwaves. This analysis will allow us to detect the presence of types of urban greenery in which significant frequencies of urban areas belonging to specific NDMI, NDVI and LST classes are recorded and to verify whether for these types of urban greenery, these classes relating to the three indices are correlated.
Furthermore, for each type of greenery and each class a reliability is determined consisting of the average of the membership degrees of the green areas to the corresponding Gaussian fuzzy set.
To clarify this process, the frequencies of vineyards belonging to six NDMI classes and the average of membership degrees are shown as an example in
Table 2.
The horizontal histogram in
Figure 4 shows the distribution of frequencies obtained. The histogram highlights that approximately 70% of vineyards are classified with NDMI High or Medium high; furthermore, over 76% of the vineyards are classified with at least Medium high NDMI and a reliability of at least 67%.
The mean membership degrees can be interpreted as the reliability of the assignment to a class of that frequency of urban greenery. In the example in
Table 2 it is higher than 0.5 and can be considered acceptable, as the uncertainty in assigning vineyards to an NDMI class is negligible.