2.2. Collection of Data
This study has developed multiple thematic layers using various primary data such as soil, rainfall, and secondary spatial data, like satellite images, and elevation. It also employed socio-economic data gathered from the community and flood events inventory through the field survey.
Table 1 lists all of the data used, along with the sources of each data point. A “multi-source data collection” technique was used to guarantee a thorough mapping and impact evaluation of flood susceptibility.
A comprehensive floods susceptibility mapping and impact assessment was conducted in this study using a multi-source data collection approach, integrating both primary and secondary data. Various thematic layers were created to analyze flood-prone areas, as mentioned in
Table 1. This study has chosen nine factors influencing flood based on the literature and their significance too flood susceptibility. These factors include slope, precipitation, Topographic Wetness Index (TWI), elevation, distance to rivers, distance to roads, drainage density, Normalized Difference Vegetation Index (NDVI), and land use/land cover (LULC). Furthermore, these factors were divided into five groups using a scale of 1 to 5, where 5 denotes extremely high flood risk and 1 denotes very low flood risk[
30]. The local community’s knowledge on floods assessed through the survey was also employed in classifying flood susceptibility factors.
Figure 2 illustrates the methodology used in generating flood susceptibility map of the catchment under this study.
Natural break (jenk) is an algorithm used to reassign values to raster or vector data based on specific criteria, helping to categorize data into meaningful or homogeneous classes [
31]. It is one of the most widely used and precise algorithms for geographical environmental unit division. Since field verification cannot define the range of each class, we classified each geographical data layer into five categories using Jenks' natural break algorithm(Chen et al., 2013; Gui et al., 2022) (
Table 2). A Topographic Wetness Index (TWI) measures how much water is likely to accumulate in a given area based on its topography. As the saturation level increases, groundwater levels rise, leading to a higher risk of flooding. Consequently, TWI raises the likelihood of flooding in the area. In this study, the TWI is derived from the Digital Elevation Model(DEM) using the Equation (1) and the spatial analysis tools in ArcGIS [
34]. It is frequently used to measure the influence of topography on hydrological processes and terrain-driven soil moisture change. This index depends on both the upper stream's area per unit width orthogonal to the follow direction and the slope.
Where: β represents a local slope, and α represents a particular catchment area. The local slope in radians is expressed by tanβ, whereas the local upslope area draining through a specific location is represented by α per unit contour length.
Precipitation is among of the primary causes of river flooding. Precipitation falling in the area determines the runoff there could be, and the area becomes more vulnerable to flooding as the runoff increases [
35]. Rainfall data was also obtained from the meteorological stations nearby Sebeya catchment. Using the Kriging method, the mean annual rainfall of these stations was calculated using yearly rainfall data from 1991 to 2021. The data was then clipped inside the study area's boundaries.
Land use/land cover (LULC) is one of the key determinants of flood-prone areas. Urban areas with impermeable surfaces, such roads and buildings, are particularly vulnerable to flooding, whereas places covered with vegetation have a higher rate of infiltration and are hence less vulnerable [
36,
37]. This study used supervised classification on sentinel-2 imagery to categorize the land into five classes: bare land, agriculture, vegetation, water bodies, and settlement.
The Normalized Difference Vegetation Index (NDVI) as in equation (2), measures the difference between red light, which vegetation absorbs, and near-infrared light, which plant strongly reflects, to quantify the vegetation characteristics in a given area [
38]. The range between -1 to +1, with a number close to +1 denotes vegetation that serves as flood protection [
39].
NIR expresses Near Infra-Red values, whereas R expresses visible (red) values. B4 and B5 express Band 4 and Band5 respectively. Those Bands are mostly applied for NDVI calculation.
Distance to the river is another factor that contributes to flooding because the river and its surrounding lands are the primary flood channel, making them extremely vulnerable [
35].The river network used in this investigation was derived from DEM data. The Euclidean function in the ArcGIS platform was used to determine the distance to each river.
The distance to a road influences the likelihood of flooding because the impervious surface grows close to the road, increasing the risk of flooding[
40]. The Overpass-turbo was also used to retrieve the road network from Open-Street Map. The ArcGIS's Euclidean function was used to calculate the distance from each road.
The ratio of the basin's area to the entire drainage channel is known as the drainage density. A high drainage density increases the amount of water that accumulates in a given area, making it more vulnerable to flooding[
41]. A drainage network was established in the study area by using the DEM and the Raster calculator tool in ArcGIS to create flow accumulation. The drainage density was then determined using the Line Density tool on this drain network.
Precipitation, drainage density, and distance to the river have been given the highest weight in
Table 2 since they are directly related to flooding in this area. Historical floods demonstrate that the short-rainy season's rainfall causes the river's maximum discharges and the locations close to the river and with the highest drainage density are the most vulnerable to flooding.
Elevation, slope, and TWI were given a medium weight because they are important characteristics for floods that have less impact. They play a medium influence in floods and are mostly found in lowland parts with less slope and wetlands. Similar to this, LULC, NDVI, and road distance are given less weight because other criteria have overshadowed their significance.
Figure 3.
Flood susceptibility factors: a)Elevation; b) Slope; c) TWI; d)LULC ; (e NDVI; f) Precipitation; g) Drainage Density; h) Distance to River; and i) Distance to Road.
Figure 3.
Flood susceptibility factors: a)Elevation; b) Slope; c) TWI; d)LULC ; (e NDVI; f) Precipitation; g) Drainage Density; h) Distance to River; and i) Distance to Road.
2.3. Analytical Hierarchy Process (AHP)
AHP is a multi-criteria decision-making approach that was created by Saaty in 1990 [
19], with the goal of streamlining and enhancing the decision-making process. This approach allows planners and users to quantitatively determine a scale of preference derived from a collection of options[
39]. The AHP applies the pairwise comparison approach to determine each criterion's weight or priority vector[
25]. The use of a pairwise comparison matrix (PCM)enables evaluating the relative weights of several criteria according to the expert's assessment[
21]. The flood conditioning factor is prepared as a pairwise comparison matrix with n × n dimensions. Each of these separate flooding factors is given a value on a scale from 1 to 9, where a lower number of 1 indicates that both variables are equally important and a higher number of 9 indicate that the row factor in PCM is more essential than the column factor according to the Saaty scale[
19]in
Table 3.
This methodology created a pairwise comparison matrix of selected flood conditioning factors of dimension 9 × 9 based on a variety of literature reviews. Each row is compared with each column element to determine the relative relevance for producing the rating score, and the diagonal elements are always equal to 1 in the pairwise comparison matrix displayed in
Table 4[
42]. The normalized pairwise matrix and final weights, as indicated in
Table 5, were calculated using the importance of each factor to the flood, the data from prior studies, and the expert's judgment of those who have worked in flood in the past [
43].
When calculating the value for a pairwise comparison matrix, there may be numerous discrepancies; therefore, the Consistency Ratio (CR) must be calculated as a consistency check [
44]. The consistency ratio, which is the ratio of a matrix of the same size's Consistency Index (CI) to Random Inconsistency Index (RI), must always be less than 0.1 in order to be considered acceptable for weighting [
39]. The CR is calculated according to the Equation (3).
The equation (4) was used to calculate Consistency Index (CI).
Where n expresses the number of factors, and λ expresses average value of the consistency vector.
While calculating the CI, we determine consistency vector (CV) by multiplying the original pairwise matrix (A) by the weight vector (w), and then divide each element of the consistency vector by the corresponding element in the weight vector (w) (
Table 6). The λ_max (Lambda Max), is calculated by considering average of these values.
The Lambda is calculated by taking the sum of the ratios mentioned in the
Table 7 divided by the number of factors employed in constructing the AHP model.
Lambda= 86.234/9= 9.595092
According to the formula (4), the Consistency Index (CI) measures the deviation from consistency, where n is the number of criteria (n=9).
CI = (9.595092- 9) / (9 - 1) = 0.074386
The Random Index (RI) is an empirically computed baseline value of inconsistency for completely random pairwise-comparison matrices of a given size (
Table 9). It’s used to normalize the consistency of a real decision maker’s matrix to verify whether the judgments made are acceptably consistent or inconsistent as random guessing. It is a constant that depends on the random sampled pairwise matrix (n).
For our example, n=3, and therefore RI = 1.45.
Finally, the CR is calculated by comparing CI to the RI. Referring to the formula (3),CR = 0.074386/ 1.45 = 0.0513. Since 0.0513≤ 0.10, the consistency of our pairwise comparison matrix is acceptable. We can confidently use the derived weights (0.142, 0.137, 0.112, 0.152, 0.071, 0.067, 0.155, 0.059, and 0.106) for the rest of our AHP analysis.
In this study, nine factors were processed in ArcGIS software to discover flood risk susceptible zones.
Table 2 states that each of the factor maps has been categorized into five different classes and transformed to a raster format of size. Using the weighted overlay technique, the sum of these outcomes was multiplied by the factor weight of each reclassified map layer. The overall map of flood susceptibility in the study area was produced by equation (5).
Where FS expresses flood susceptibility,
wi as a factor weight, and
xi represent class of flood susceptibility for each factor
i.