Preprint
Article

Espresso : a Global Deep Learning Model to Estimate Precipitations from Satellite Observations

Altmetrics

Downloads

155

Views

113

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

26 July 2023

Posted:

27 July 2023

You are already at the latest version

Alerts
Abstract
Estimating precipitation is of critical importance to climate systems and decision-making processes. This paper presents Espresso, a deep learning model designed for estimating precipitation from satellite observations on a global scale. Conventional methods, like ground-based radars, are limited in terms of spatial coverage. Satellite observations, on the other hand, allow global coverage. Combined with deep learning methods these observations offer the opportunity to address the challenge of estimating precicpation on a global scale. This research paper presents the development of a deep learning model using geostationary satellite data as input and generating instantaneous rainfall rates, calibrated using data from the Global Precipitation Measurement Core Observatory (GPMCO). The performance impact of various input data configurations on Espresso was investigated. These configurations include a sequence of four images from geostationary satellites and the optimal selection of channels. Additional descriptive features were explored to enhance the model’s robustness for global aplications. When evaluated against the GPMCO test set, Espresso demonstrated highly accurate precipitation estimation, especially within equatorial regions. A comparison against six other operational products using multiple metrics indicated its competitive performance. The model’s superior storm localization and intensity estimation were further confirmed through visual comparisons in case studies. Espresso has been incorporated as an operational product at Météo-France, delivering high-quality, real-time global precipitation estimates every 30 minutes.
Keywords: 
Subject: Environmental and Earth Sciences  -   Remote Sensing

1. Introduction

Precipitation is a crucial element in Earth’s climate system. Its significant influence spans across various areas of human life, such as agriculture and water resource management. Notably, its variability and extremes can pose considerable risks and dangers to population and property, underscoring the importance of disaster mitigation strategies. Accurate and prompt precipitation estimates are vital for effective decision-making within these domains. Traditional precipitation estimation methods primarily depend on ground-based observations from rain gauges and weather radar systems. However, the spatial distribution of rain gauges is too sparse to accurately capture the variability of precipitation. While radars can provide real-time precipitation measurements with high spatiotemporal resolution, installation and maintenance costs limit their widespread use. As of 2023, one-third of Earth’s countries, including highly populated regions like Africa, lack coverage from precipitation radars [1].
On the other hand, meteorological satellites allow for global observations and satellite remote sensing has emerged as a powerful tool for precipitation estimation. This method harnesses the unique capabilities of Earth observation satellites, offering comprehensive spatial coverage and high-frequency measurements. Algorithms for satellite-based precipitation estimation typically merge infrared (IR) measurements from geosynchronous-Earth-orbiting (GEO) satellites with passive microwave (PMW) data from low-Earth-orbiting satellites (LEO). GEO satellites provide nearly global monitoring with a timestep ranging from 5 to 15 minutes at high spatial resolution. However, visible and IR channels only provide information about the cloud top, rendering the precipitation estimation indirect. Meanwhile, PMW measurements, which are directly sensitive to hydrometeors, are only available twice a day for a given area and satellite [2] and have relatively poor spatial resolution.
Active microwave observations from radars onboard satellites, such as the Tropical Rainfall Measuring Mission [3] (TRMM, 1998-2014), and the ongoing Global Precipitation Measurement Core Observatory mission (GPMCO, 2014 - present) [4], are also available. However, these satellites cover a very limited area. For instance, the swath of the GPMCO only spans 245km.
Over the past two decades, numerous satellite-based precipitation estimation products have been developed by combining various types of data. These products are widely used for monitoring natural disasters, initializing numerical weather forecasting models, and evaluating precipitation forecasts. For instance, some products exclusively utilize IR data as input. The Global Hydro Estimator (GHE; [5]) employs a fixed relationship between IR data and rainfall rates, calibrated initially with radar data. The PERSIANN Dynamic Infrared–Rain Rate Model (PDIR; [6]) calibrates IR data with PMW datasets and climatology data via several machine learning algorithms. On the other hand, Quantitative Precipitation Estimation (QPE, [7]) and P-IN-SEVIRI [8] use IR data calibrated in real-time with the most recent PMW data through the SCaMPR [9] and Rapid Update [10] algorithms, respectively.
Certain products also incorporate several data types. The Global Satellite Mapping of Precipitation (GSMaP; [11]) combines microwave data with IR data and rain gauges through a Kalman filter. The Integrated Multi-satellitE Retrievals for GPM (IMERG; [12]) aims to intercalibrate and merge all PMW precipitation estimates with IR estimates, precipitation gauge analyses, and potentially other precipitation estimators. All the mentioned products offer rainfall estimation within an hour, except for IMERG, whose "Early Run" is available four hours post the start of data acquisition. Table 1 summarizes the characteristics of these products, which will be used for comparison.
Recent advancements in computing technology and increased availability of large-scale satellite datasets have unveiled new opportunities for enhancing precipitation estimation accuracy with deep learning methods. Deep learning, a branch of machine learning, has earned significant interest across various scientific disciplines due to its ability to autonomously learn intricate patterns and relationships from extensive datasets. Applying deep learning techniques, particularly deep convolutional neural networks (CNNs), to satellite data has demonstrated promising results in several domains, including image classification [13], object detection [14], and image segmentation [15]. Importantly, CNNs are capable of efficiently extracting complex spatial features from images without the need for meticulous feature engineering.
Within the context of precipitation estimation, deep learning algorithms hold the potential to leverage the abundant information in satellite imagery for accurate and real-time precipitation estimation, a potential that has already been successfully demonstrated in several studies. Many researchers have used IR channels as their sole input and ground radars in the United States of America (USA) as their training target to train various networks such as Stacked Denoising Autoencoders [16], Convolutional Neural Networks (CNN) [17,18], U-Net, and Conditional Generative Adversarial Networks (cGANs) [19]. Some researchers, also working with USA ground radar data, have attempted to combine IR data with PMW data using Multi-Layer Perceptron (MLP) [20] or Generative Adversarial Networks (GAN) [21]. The authors of [22] pretrained their network on USA ground radar data and fine-tuned the network on sparse Chinese radar data, achieving superior performance compared to direct training on Chinese data. More recently, [23] is the sole study using Level 3 IMERG GPM data instead of ground radar data, with authors focusing on the southeast coast of China and employing five IR channels as input to train an Attention U-Net.
Deep learning has not yet been globally applied to satellite precipitation estimation to investigate if deep neural networks can generalize to various parts of the globe and diverse climates. In this study, the objective is to build upon previous methodologies by training a state-of-the-art DeepLabV3+ architecture on a global precipitation dataset. This dataset is a combination of geostationary satellite data and additional descriptive features as input, with precipitation measurements from the GPMCO satellite serving as the target. Various configurations of input data, loss functions, and hyperparameters will be examined to identify the highest-performing model. The model’s performance will be evaluated globally against GPMCO data. Additionally, we will compare our model with six operational satellite-based precipitation products using ground-based radar measurements from the French radar network as the reference. This evaluation will be conducted across four distinct regions spread across the globe.
This paper is structured as follows: Section 2.1 introduces the datasets and study regions; Section 2.2 describes the neural network and the experimental protocol; Section 3 evaluates the quality of the model with various input data (Section 3.1), against a test set of GPMCO data (Section 3.2), in comparison with other operational products (Section 3.3), and finally through case studies (Section 3.4); Section 4 and Section 5 presents the conclusions and outlines future prospects.

2. Materials and Methods

2.1. Data

2.1.1. Target data

Precise and global precipitation is a prerequiste to develop a global precipitation estimation product. To circumvent issues associated with intercalibration between different ground-based radar networks and to ensure coverage of diverse climates and regions, we opt to use the GPM DPR and GMI Combined Precipitation L2B product from the GPM mission [24]. This product provides highly accurate, high-resolution surface rainfall estimates by merging radar and PMW data from the GMI and DPR instruments onboard GPMCO.
GPMCO orbits the Earth approximately every 1 hour and 30 minutes, leading to 16 swaths daily covering between latitudes 65°N and 65°S. Each GPMCO swath from 2018 to 2022 is projected onto a 4000x8000 grid using the equirectangular projection (0.45° resolution) and subsequently divided into contiguous 512x512 pixel tiles covering the respective swath. This process produces over 320,000 precipitation tiles that contain instantaneous precipitation data in mm/h. Figure 1a) presents an example of such a tile.
Tiles to be included in our dataset are carefully selected. Rainfall events, especially heavy rain, are uncommon. Heavy rainfall is particularly rare in regions closer to the poles. Convective rainfall, which usually produces the heaviest precipitation, is more common near the equator. For instance, tiles within latitudes [-70°;-50°] and [50°;70°] account for 42% of all tiles, but they represent only 15% of the tiles with rainfall exceeding 20 mm/h. To ensure the neural network effectively generalizes across the globe, it’s essential that heavy rain is uniformly represented in the training dataset across all latitudes.
Tiles were grouped into latitude bands of 20° and yearly groups, and were sorted based on their maximum precipitation value. For regions closer to the poles, the top 600 tiles with the highest recorded rainfall were selected. For regions closer to the equator, we randomly selected 600 tiles each year from each latitude band with a maximum rainfall exceeding 75 mm/h. This process resulted in a dataset comprising 21,000 tiles. Figure A1 in Appendix A illustrates the distribution of the maximum precipitation of the selected 21,000 GPMCO data tiles for the seven latitude bands. It can be noted that, despite this selection process, heavy rainfalls are less represented in latitudes closer to the poles. For example, for the band [-70°,-50°], data with maximum precipitations under 50 mm/h had to be included to adequately populate the dataset.

2.1.2. Input data

To develop a global, real-time product, input data is based on geostationary (GEO) data. During the experimental phase, data from five geostationary satellites was selected. The details are provided in Table 2.
This study is limited to four channels available on all five geostationary satellites. Theses channels are centered around the 10.8 µm and 12.0 µm wavelengths for cloud and surface temperature, the 6.2 µm wavelength for high-layer water vapor, and the 0.6 µm visible wavelength. These channels will be referred to as IR108, IR120, WV062, and VIS006, respectively.
For each precipitation tile, four GEO channels are associated, using the most recent image available prior to the passage of GPMCO. In addition, we include three previous GEO images of each channel, allowing the neural network to extract information from this series of images over time. GEO data is projected onto the same 0.45° grid as the GPMCO precipitation tile and corrected for parallax. Visible images are also corrected for solar angle. These images are then normalized between 0 and 1, using the maximum and minimum values derived from the entire training dataset. Figure 1 provides an example of a precipitation tile and its associated most recent GEO channels.
In addition, the following descriptive features are included: the normalized latitude, longitude, sun elevation, and altitude of each pixel within the tile. Finally, seasonal information in the form of two masks representing the sine and cosine of the day in the yearly cycle is included.

2.1.3. Dataset split

After merging the precipitation tiles with the GEO data and additional descriptive features, the dataset is split into three parts. The training set includes samples from 2018 to 2020 and the odd-numbered months of 2021. The validation set comprises the even-numbered months of 2021, which enables to monitor the convergence of the training algorithms and evaluate the performance of intermediate experiments, such as the selection of hyperparameters and input data. Data from 2022 is reserved for the test set, which remains untouched throughout the intermediate experiments. This reserved test set is only used at the final stage to evaluate the accuracy of the selected best neural network and compute scores against the GPMCO data.

2.1.4. Evaluation data

In a final experiment, the precipitation estimation method is assessed by comparing it with six operational products (listed in Table 1). To ensure consistency in time frequencies between the different products, the data is aggregated over a 1-hour period. Therefore, the GPMCO instantaneous rainfall test data cannot be used as a reference. Instead, data from the French radar network is selected. This data covers four distinct regions, each with different climatic characteristics and under the responsibility of Meteo-France: mainland France, the French West Indies (Martinique and Guadeloupe islands), the Réunion island, and New Caledonia. These areas are referred to as FR, AG, RE, and NC, respectively. Figure 2 shows the radar coverage of these regions. While FR experiences a temperate climate, the other three regions are located in tropical areas and undergo seasonal tropical cyclones in the Atlantic Ocean (AG), the Indian Ocean (RE), and the South Pacific Ocean (NC). By including these four regions, our aim is to cover a diversity of climates that allows assessing the generalization ability of our neural network and compare its performance to the other products.
The evaluation set was assembled using data from 2022, as this year’s data was not used to train the neural network. To focus on situations with heavy rainfall, we selected the 1,000 situations during the year with the highest number of pixels indicating heavy rainfall for each radar domain. Since there are many days without rain, these 1,000 samples are distributed across 110 to 170 days, depending on the radar domain. For each precipitation estimation product and the four radar areas, the data was projected and resized onto a 0.45° grid to construct the 1,000 samples of each region. Data were provided by [25] for IMERG, [26] for GSMAP, [27] for QPE, [28] for GHE, [29] for PDIR-NOW, and [30] for P-IN-SEVIRI.

2.2. Methodology

2.2.1. Model

The deep neural network used in our study is DeepLabv3+ [31]. We selected DeepLabv3+ over the commonly used U-Net [23,32] due to its Spatial Pyramid Pooling module and encoder-decoder structure. These architectural features allow DeepLabv3+ to encode multi-scale information, extract denser feature maps, and generate sharper object boundaries. Despite having a larger receptive field than U-Net, DeepLabv3+ has fewer weight parameters and is less prone to overfitting. We employed the implementation of DeepLabv3+ from Keras Code Examples [33] and modified the final activation layer to a ReLU function to transform it into a regressor. Moreover, the dilation rates inside the Spatial Pyramid Pooling module were adjusted to reduce the receptive field of the network, which was originally larger than the size of our images. Precisely, we changed the four dilation rates in the module from [1,6,12,18] to [1,2,4,6].
A variant of the Mean Squared Error (MSE) that incorporates an additional weight was adopted for the loss function. Its weight is primarily determined by the target value of the precipitation and is exponentially related to it. The aim of this modification is to assign higher weights to pixels with higher target precipitation values. Without this modification, the standard MSE tends to smooth out extreme values, which are of great interest in our applications.
The weight in the loss function also takes into account the latitude of each pixel, giving more importance to samples from northern and southern latitudes where heavy rainfall is less common. Moreover, the lower tropopause near the poles results in the attenuation of the IR brightness temperature, making it harder for the network to detect. Incorporating this weight in the loss function supports addressing the imbalance caused by the relative scarcity of these samples in our dataset.
Finally, a threshold is incorporated on this weight, which leads to the following loss function:
L = 1 n i = 1 n W i ( Y i Y ^ i ) 2 with W i = w i = ( 1 + l i ) exp ( Y i β + γ ) if w i 20 20 else
where Y i is the target precipitation, Y ^ i is the estimated precipitation, l i is the normalized latitude, β = 50 , and γ = 1.9 .
Figure 3 illustrates the experimental training framework, including all the various input features available.

2.2.2. Metrics

The various precipitation estimation models were evaluated against a set of several metrics. The first metric is the Root Mean Squared Error (RMSE), which is suitable for our regressor model. However, the RMSE alone cannot inform about the model’s ability to effectively detect rain or accurately estimate heavy rainfall. Hence, post-processing is carried out on the rain estimates by classifying them into four categories: No rain (mm/h < 1), Light rain (1 < mm/h < 5), Medium rain ( 5 < mm/h < 20 ), and Heavy rain (mm/h 20 ). From these categories, the confusion matrix is computed, and for the three rain thresholds, the Probability of Detection (POD) and False Alarm Rate (FAR) are calculated. The equations for all the mentioned scores are as follows:
RMSE = 1 n i = 1 n ( Y ^ i Y i ) 2
POD = True Positive True Positive + False Negative
FAR = False Positive False Positive + True Negative
In the above equations:
  • " Y ^ i " represents the predicted values;
  • " Y i " represents the actual values;
  • "True Positive" refers to the number of correctly predicted positive values;
  • "False Positive" refers to the number of incorrectly predicted positive values;
  • "True Negative" refers to the number of correctly predicted negative values;
  • "False Negative" refers to the number of incorrectly predicted negative values.
The obtained scores compare the precipitation estimates and targets pixel-wise and do not consider spatial uncertainty. In real-world scenarios, even if the model correctly detects a storm cell but locates it 2 kilometers away from its actual position, it would still be considered a good estimation, and forecasters would still issue alerts to local authorities. Unfortunately, contingency scores would penalize such cases as both false positives and false negatives. This phenomenon is known as the double penalty. To overcome this, we also examine the Fraction Skill Score (FSS), which accounts for spatial uncertainty. Using gridded neighborhoods, this method compares the forecast and observed rain frequencies using a variant of the Brier score. The Fractions Brier Score (FBS) is defined as:
FBS = 1 N N ( P f c s t P o b s ) 2
where P f c s t and P o b s are the fractional forecast and observed rain areas in each neighborhood, and N is the number of neighborhoods in the domain. Computing a skill score with respect to the FBS for the perfectly mismatched case results in the positively oriented Fractions Skill Score (FSS):
FSS = 1 FBS 1 N N P f c s t 2 + N P o b s 2
The FSS ranges from 0 for a complete mismatch to 1 for a perfect match. The FSS is computed on neighborhoods of 4 by 4 pixels, equivalent to 20km wide at the equator, for each of the three rainfall thresholds defined (1, 5, 20 mm/h).

2.2.3. Experimental Protocol

In the first series of experiments (Section 3.1), the impact of input data on the performance of the model is examined. The FSS on the validation set is used to identify the optimal combination of input data. Specifically, the benefits of using a series of four GEO images as input, the effectiveness of various channel combinations, and the benefits of additional features are particularly investigated. Each network is trained for 25 epochs, and the best checkpoint based on the validation set is saved. Adam [34] is used as optimizer, with β 1 = 0.9 , β 2 = 0.999 , and a learning rate α = 0.0001 .
After determining the model with the most effective combination of input data, its performance against the GPMCO data is assessed on the test set (Section 3.2). The confusion matrix for the four categories of rain is examined and the variations in FSS, POD, and FAR across different latitudes are observed.
Subsequently, the required 1-hour cumulated rainfall estimation data is generated for the four defined radar domains to match our set of hourly cumulated radar data. Comparison of the model is performed against the six other operational products (Section 3.3). The RMSE, FSS, POD, and FAR are analysed to evaluate the performance of the various models.
Finally, a rigorous review of fifteen instances of extreme rainfall events in 2022 was carried out by forecasters from Météo-France, using a double-blind methodology. Their conclusions of the comparison with Espresso with the other products are outlined and a detailed analysis of two specific cases is examined: one in Montpellier, located in the south of mainland France, and another in the island of Guadeloupe. This provides an opportunity to examine and interpret the subjective variances across Espresso and the six products. (Section 3.4).

3. Results

3.1. Choice of Best Configuration

This section presents investigations regarding the influence of input features on the quality of the output from the neural network. The aim is to identify an optimal configuration to enhance the accuracy of our precipitation estimation model based on satellite observations. Investigations are led methodically: including all available data and then progressively removing different input features.
First, we evaluated the effect of varying the number of GEO images used as input. Figure 4a) illustrates the performance of the model when varying the number of input images. Interestingly, using only the most recent image resulted in an improved FSS for precipitation estimation. As depicted in the figure, the model’s performance improved as historical images are removed, although the performance does not strictly follow the number of images. This indicates that the most recent image provides the most relevant information for estimating current precipitation. Including additional images introduces either noise or redundant information.
Next, we investigated the influence of different channels on the model’s performance. Figure 4b) provides valuable insights into the significance of channel selection. Firstly, it demonstrates that combining the IR108 channel with WV062 significantly enhances the FSS score across all rain rate thresholds, compared to using either of these channels alone. Secondly, adding channels such as VIS006 and IR120 did not significantly improve the model’s FSS and, in fact, resulted in a degraded score. Similar to the historical series of images, the IR120 and VIS006 channels may only introduce noise or redundant information. These findings suggest that a combination of the IR108 and WV062 channels provides sufficient information for reliable precipitation estimates.
Further, we evaluated the significance of our additional descriptive features, including latitude, longitude, sun elevation, and date information. The inclusion of these features aimed to enhance the model’s capacity to generalize across various regions and climates. We trained the model three times: once with all additional features, once with only the latitude feature, and finally without any additional features. Figure 4c) shows that incorporating these additional features does not improve the model’s performances but actually degrades them.
These findings contradict our initial assumption that more features would help the network generalize to different situations and enhance its performance. Our hypothesis is that these additional sources of information are not relevant and the variability in the model’s performance is more due to intrinsic variability in the initialization of the weights and the training process than to the addition of features. Consequently, we determined the optimal configuration for the precipitation estimation model: our final configuration, named Espresso, focuses solely on the most relevant information, utilizing only the most recent GEO image as input, combined with the IR108 and WV062 channels. This is the configuration that is used in the subsequent experiments.

3.2. Evaluation of Espresso on the test set

Having established the optimal configuration for the Espresso model, its performances against the test set of GPMCO data were evaluated. The evaluation provides insights into the model’s accuracy and its ability to estimate precipitation effectively.
Evaluation begins with an example of rainfall estimation from Espresso. Figure 5 depicts the case of cyclone Emnati, which developed into a category 4 tropical cyclone when it passed north of La Réunion and Mauritius on the 20th of September, 2022. The cyclone caused flooding and wind gusts at speeds of 163km/h recorded at the Maido station in La Réunion. The figure shows the two channels used as input for the neural network (IR108 and WV062), the estimation from Espresso, and the GPMCO rainfall measure. The comparison between Espresso and GPMCO reveals certain characteristics. The rainfall field appears more smoothed and spread out compared to the data from GPMCO, which can be attributed to the nature of the regression model trained with MSE. As a result, the spatial precision of Espresso is not as refined as that of GPMCO, and it overlooks some light rains in the north and south of the GPM swath. It also expands the patches of medium and high rainfalls. However, Espresso effectively captures the structure of the cyclone, including the eye wall with its peak rainfall intensity and the surrounding rainbands. The cyclone is well-positioned, and the intense rainfalls associated with the cyclone are accurately represented.
Secondly, Figure 6 showcases the confusion matrix, providing a comprehensive view of the model’s ability to correctly classify rainfall intensities. It is evident from the figure that the model demonstrates strong performance in accurately identifying the “No Rain” category. However, the model tends to underestimate the precipitation rate in the majority of cases, and more than half of the "light rain" cases are classified as "No rain". However, the model overlooks fewer medium and heavy rainfalls cases, still detecting some rain in the majority of cases, even if it is underestimated.
To further assess the model’s performance, we calculated the Fraction Skill Score (FSS), Probability of Detection (POD), and False Alarm Rate (FAR) of Espresso on the test set. Figure 7 gives an overview of the FSS across various latitudes. The model shows its best performance near the equator, with higher FSS for all three rain thresholds. On the other hand, the FSS on the [-70°;-50°] and [50°;70°] bands of latitude range from poor to very poor for heavy rainfalls. The POD and FAR, available in Appendix B, support the same conclusions, even if the difference between POD near the poles and at the equator is less marked.
These results suggest that while the model is able to accurately detect and estimate precipitation in temperate and tropical regions, it struggles to do so at higher latitudes. Despite precautions to over-sample the dataset and weigh the loss in the higher latitudes, heavy rainfalls in these regions are still too scarce to allow the network to learn the different patterns of rain near the poles. In addition, at the poles, the tropopause is lower and the angle between the ground and the satellite’s sensors is greater, leading to a diminished contrast between precipitating and non-precipitating clouds. This makes the task of rain detection more challenging, even to the human eye. Moreover, it is worth noting that the overall FSS, POD, and FAR scores for each latitude band are relatively low. The significant variations in precipitation between adjacent pixels due to localized phenomena pose difficulties for a Deep Learning model to accurately reproduce, as seen in Figure 5.
These results, based on the 2022 GPMCO data, underscore Espresso’s ability to effectively estimate precipitation across a range of rainfall categories in temperate and tropical regions. Nearer to the poles it often overlooks rainfall, especially heavy rainfalls. Although the model cannot reproduce the fine details of the GPMCO data, as evident from Figure 5, it may still prove useful to forecasters in situations of extreme rainfall.

3.3. Comparison with other operational products

In order to thoroughly evaluate the Espresso model, we carried out a comparative analysis against six other operational precipitation estimation products: IMERG, GHE, QPE, PDIR-NOW, P-IN-SEVIRI, and GSMAP. This evaluation concentrated on 1,000 samples of 1-hour accumulated data from 2022, spanning across our four radar domains. It is worth noting that QPE and P-IN-SEVIRI are generated in a geostationary space view, thus they are not available across all the french radar domains.
Figure 8 presents the RMSE of each product across each radar domain. The overall RMSE values are higher in the tropical domains, which experience heavier rainfall than mainland France, leading to larger errors for all models. Espresso is comparable to the other models, with IMERG, GSMAP, and PDIR-NOW competing for the lowest RMSE. The error of Espresso against ground radar data is on par with the other models. However, RMSE does not provide information about the distributions of these errors.
Next, we examined the FSS, POD, and FAR. Figure 9 displays the FSS for the France domain for each model. The FSS for the other domains, along with the POD and FAR for each domain, can be found in Appendices BC.1 and BC.2.
Espresso demonstrated superior performance in the FR, NC, and RE domains, achieving higher scores in terms of FSS and POD compared to other models. IMERG trailed Espresso in performance, while the rest of the models exhibited comparable quality to each other but fell short of Espresso and IMERG. However, when all domains are taken into account, Espresso was outpaced by IMERG and occasionally GSMAP and GHE in terms of FAR for weak and moderate rainfall. Nonetheless, Espresso exhibited a lower FAR for heavy rainfall than IMERG and GSMAP, but on the NC and FR domains, Espresso was bested by GHE for heavy rainfall events.
In the AG domain, the GHE and QPE models outperformed IMERG in terms of FSS and POD. While Espresso maintained its superiority in terms of FSS, it was significantly outdone by GHE and QPE in the detection of moderate and heavy rainfall. Among global models, Espresso outperformed the rest in nearly all FSS and POD scores, particularly in heavy precipitation events.
These results establish that Espresso is better at detecting and localising rainfall than other global products, especially heavy rainfall. It demonstrated superior POD and FSS across all four domains with distinct climates. Simultaneously, its FAR is comparable to other products, indicating that Espresso does not overestimate rainfall and can be used in crisis management. The less pronounced difference with other products in terms of FAR and RMSE can be explained by the model’s intrinsic spreading of precipitation, which causes false alarms on the periphery of precipitation cells.
These findings position Espresso as a viable and reliable alternative to existing operational products for precipitation estimation. Espresso delivers real-time, accurate, and efficient precipitation estimates, comparable to or better than the widely recognized IMERG, without the associated data availability time delays.

3.4. Case study

To further assess the Espresso model’s performance and visually compare it to the other six operational products, Météo-France forecasters conducted a double-blind review involving 15 instances of extreme precipitation events. Where available, their assessments were made in reference to radar readings and rain gauge measurements. Evaluation criteria included the maximum rainfall captured by the radar and predicted by the various models, the spatial distribution and spread of rainfall, and the structural representation of the event.
Overall, forecasters exhibited a preference for Espresso, primarily due to its superior ability in localizing events and the proximity of its estimations to actual rain gauge values. GHE was the next preferred model due to its ability to accurately locate rainfall, even though the estimations were somewhat underestimated. IMERG, QPE, PDIR-NOW, and P-IN-SEVIRI were regarded similarly, with precipitation cells often mislocated and underestimated. In contrast, GSMAP was least preferred as it frequently failed to detect precipitation events altogether. However, all products tended to produce structures larger than the actual events.
Below, the analysis of two specific cases is detailed: one in Montpellier, located in the south of mainland France, along the Mediterranean coast, and another in the island of Guadeloupe.

3.4.1. Case Study 1: Stationary Convective Storm in Montpellier

The first case study features a stationary convective storm that impacted Montpellier on September 6, 2022, leading to a meteorological warning due to flooding risks and potential river overflow. This storm caused substantial rainfall, accumulating up to 70mm in a single afternoon. Figure 10 displays the 1-hour cumulative estimations from each of the six available models, as well as the ground radar’s estimation.
Upon analysis, it was observed that GSMAP failed to detect the storm, perceiving only light rainfall. While the other operational products successfully pinpointed the storm, they consistently underestimated the associated rainfall. In contrast, Espresso emerged as the superior model in this scenario, accurately identifying and locating the storm and providing reliable rainfall estimates.
These results underscore the robustness and effectiveness of the Espresso model in capturing the detailed features of convective storms and providing accurate precipitation estimates. Figure 10 also highlights Espresso’s excellent spatial resolution (5km), which is comparable to GHE, PDIR-NOW, and HSAF, and notably superior to IMERG and GSMAP (10 km).

3.4.2. Case Study 2: Southeast Flow Over Guadeloupe island

Our second case study focuses on a southeast airflow over Guadeloupe island, characterized by warm and humid air. The combination of converging low-level winds, wind shear at higher altitudes, and the unique configuration of the island amplified the convective activity. Consequently, the region witnessed heavy rainfall, resulting in widespread flooding and the issuance of a meteorological warning. Record-breaking precipitation levels were noted, especially at the Raizet station, where 312 mm fell within a 24-hour period. This severe rainfall event caused substantial material damage and fatalities. Figure 11 displays the 1-hour cumulative estimations from each of the six available models, as well as the ground radar’s estimation.
Upon evaluating the performance of the operational products in this scenario, we found that both GSMAP and IMERG failed to detect the storm. While GHE, HSAF, and QPE were successful in pinpointing the storm’s location, they consistently underestimated its intensity. On the other hand, both Espresso and PERSIANN exhibited remarkable proficiency in storm localization and intensity estimation. Their ability to accurately locate and estimate extreme rainfall values highlights their potential for providing invaluable data for disaster management and response initiatives.
In conclusion, the Montpellier and Guadeloupe case studies consolidate Espresso’s position as a reliable and capable model for precipitation estimation. The model’s demonstrated accuracy in localizing and estimating the intensity of convective storms underscores its practical utility in real-time monitoring and response to extreme weather events.

3.5. Computational Resources

The training of each experiment was performed on four Nvidia Tesla V100 Graphical Processing Units (GPU) and took approximately 4 hours for the neural network to converge. Over the course of this year-and-a-half-long research project, a total of 160 training experiments were conducted. This accumulates to a total computation time of 27 days, equivalent to an electricity consumption of 650 kWh.
On the other hand, the inference phase, which covers the entire globe, requires only around 5 minutes of computation time on a single Central Processing Unit (CPU).

4. Discussion

Throughout this work, we have presented the development and evaluation of Espresso, a convolutional neural network architecture that leverages satellite imagery for global precipitation estimation. As a global model, Espresso overcomes the limitations of data availability delays inherent to operational models such as IMERG, offering real-time precipitation estimates while maintaining or even exceeding the performance of existing operational products. Furthermore, the relatively low computational cost of the inference phase and ease of deployment add to the model’s appeal, making it an attractive solution for operational use.
The model has been carefully designed to ensure the high-resolution of input satellite data is preserved, contributing to the precise detection and estimation of precipitation events. In the various evaluations conducted, Espresso has demonstrated its ability to accurately capture precipitation patterns across the globe, particularly in temperate and tropical regions. It provides better POD and FSS, and similar FAR, when compared to other models, especially for heavy rainfall.
However, the model’s performance was found to be less robust in higher latitudes, an aspect that could be improved in future iterations. While the current approach to addressing the imbalance in the data using oversampling and weighting the loss has proven to be somewhat effective, other methods could be explored to further improve the model’s ability to learn in these regions. One potential solution is to incorporate data from periods before 2018 and after 2022.
Additionally, advancements in Deep Learning architectures such as Vision Transformers [35] and diffusion models [36] could be utilized to develop a model that provides finer details than our DeepLabV3+ and lowers the False Alarm Rate of the model.
Furthermore, our attempt to enhance the model’s performance by incorporating additional features like topography, latitude, or season into additional channels has proven to be ineffective. Better results might be achieved by adopting an approach similar to the recent MetNet-3 [37], where the authors prefer to use topographical embeddings. This allows the network to autonomously discover relevant topographical information and store it in the embedding. This embedding is a trainable parameter, similar to techniques used in Natural Language Processing.

5. Conclusions

In conclusion, this paper presents Espresso, a deep convolutional neural network designed for global precipitation estimation using satellite data. The model has demonstrated strong performance across various geographical regions, particularly in temperate and tropical zones. The ability of Espresso to detect and accurately estimate rainfall, especially heavy rainfall, establishes it as a reliable and competitive tool in the field of weather prediction and monitoring.
Despite some limitations in higher latitudes, the model demonstrates significant results and potential for further improvements. Future work could explore new approaches to address the imbalance in the data, incorporate additional data sources, or fine-tune the model parameters to enhance performance. As weather patterns continue to become increasingly complex due to climate change, the role of precise, real-time precipitation estimation models like Espresso becomes critical.
Espresso has been incorporated as an operational product at Météo-France, delivering high-quality, real-time global precipitation estimates every 30 minutes. These estimations are readily accessible to forecasters for monitoring French Overseas Territories, where ground radars may not be available, and for anticipating the movement of incoming precipitation before it becomes visible on radars. This tool strengthens Météo-France’s ability to respond to and manage the impacts of extreme weather events, thereby contributing to the protection of people and property across French territories.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The data are not publicly available due to restrictions from the original providers.

Acknowledgments

We would like to express our gratitude for the collaborative effort between the AI Lab of Météo-France and the Centre de Météorologie Spatiale, which was crucial to the success of this work. Specifically, we want to acknowledge the significant contributions of several individuals: Laurent Perier for his instrumental role in software development, particularly in data preprocessing; Laura Bouzid, Célestin Cadoret, Coralline Nicolas and Louis Soulard-Fisher for their diligent work in comparing Espresso with other operational products, carried out as part of their internships. We also extend our appreciation to Sylvain Le Moal, Jean-Baptiste Hernandez, Olivier Membrive, and Philippe Chambon for their valuable insights and advice, which enriched the quality of our research. In addition, we acknowledge the essential role of data providers in our work. GPMCO and IMERG data were provided by NASA, GEO data were provided by EUMETSAT, GSMaP data by Japan Aerospace Exploration Agency (JAXA). We thank NASA’s Earth Observing System Data and Information System (EOSDIS) and the AERIS/ICARE Data and Services Center for providing access to the data used in this study. We also express our gratitude to the providers of the four other precipitation estimation products evaluated in this study: GHE, PDIR-NOW, QPE, and P-IN-SEVIRI.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
GPMCO Global Precipitation Measurement Core Observatory
IR Infrared
GEO Geosynchronous-Earth-Orbiting
PMW Passive Microwave
LEO Low-Earth-Orbiting
TRMM Tropical Rainfall Measuring Mission
GHE Global Hydro Estimator
PDIR PERSIANN Dynamic Infrared-Rain Rate
QPE Quantitative Precipitation Estimation
GSMaP Global Satellite Mapping of Precipitation
IMERG Integrated Multi-satellitE Retrivals for GPM
CNN Convolutional Neural Network
GAN Generative Adversarial Network
cGAN Conditional Generative Adversarial Network
USA United States of America
MLP Multi-Layer Perceptron
MSE Mean Squared Error
RMSE Root Mean Squared Error
POD Probability of Detection
FAR False Alarm Rate
FSS Fraction Skill Score
FBS Fraction Brier Score
GPU Graphical Processing Unit
CPU Central Processing Unit

Appendix A. Materials and Methods

Figure A1. Distribution of the maximum of precipitation of our 21,000 GPMCO data tiles for the seven latitude bands.
Figure A1. Distribution of the maximum of precipitation of our 21,000 GPMCO data tiles for the seven latitude bands.
Preprints 80615 g0a1

Appendix B. POD and FAR of Espresso on the test set

Figure A2. POD of Espresso for each band of latitude and each threshold of rain.
Figure A2. POD of Espresso for each band of latitude and each threshold of rain.
Preprints 80615 g0a2
Figure A3. FAR of Espresso for each band of latitude and each threshold of rain.
Figure A3. FAR of Espresso for each band of latitude and each threshold of rain.
Preprints 80615 g0a3

Appendix C. Comparison with other products

Appendix C.1. POD and FAR

Figure A4. POD of each model and each threshold of rain, on the mainland France (FR) domain, for 1 hour cumulated data.
Figure A4. POD of each model and each threshold of rain, on the mainland France (FR) domain, for 1 hour cumulated data.
Preprints 80615 g0a4
Figure A5. FAR of each model and each threshold of rain, on the mainland France (FR) domain, for 1 hour cumulated data.
Figure A5. FAR of each model and each threshold of rain, on the mainland France (FR) domain, for 1 hour cumulated data.
Preprints 80615 g0a5
Figure A6. POD of each model and each threshold of rain, on the Guadeloupe (AG) domain, for 1 hour cumulated data.
Figure A6. POD of each model and each threshold of rain, on the Guadeloupe (AG) domain, for 1 hour cumulated data.
Preprints 80615 g0a6
Figure A7. FAR of each model and each threshold of rain, on the Guadeloupe (AG) domain, for 1 hour cumulated data.
Figure A7. FAR of each model and each threshold of rain, on the Guadeloupe (AG) domain, for 1 hour cumulated data.
Preprints 80615 g0a7
Figure A8. POD of each model and each threshold of rain, on the Réunion island (RE) domain, for 1 hour cumulated data.
Figure A8. POD of each model and each threshold of rain, on the Réunion island (RE) domain, for 1 hour cumulated data.
Preprints 80615 g0a8
Figure A9. FAR of each model and each threshold of rain, on the Réunion island (RE) domain, for 1 hour cumulated data.
Figure A9. FAR of each model and each threshold of rain, on the Réunion island (RE) domain, for 1 hour cumulated data.
Preprints 80615 g0a9
Figure A10. POD of each model and each threshold of rain, on the New Caledonia (NC) domain, for 1 hour cumulated data.
Figure A10. POD of each model and each threshold of rain, on the New Caledonia (NC) domain, for 1 hour cumulated data.
Preprints 80615 g0a10
Figure A11. FAR of each model and each threshold of rain, on the New Caledonia (NC) domain, for 1 hour cumulated data.
Figure A11. FAR of each model and each threshold of rain, on the New Caledonia (NC) domain, for 1 hour cumulated data.
Preprints 80615 g0a11

Appendix C.2. FSS

Figure A12. FSS of each model and each threshold of rain, on the Guadeloupe (AG) domain, for 1 hour cumulated data.
Figure A12. FSS of each model and each threshold of rain, on the Guadeloupe (AG) domain, for 1 hour cumulated data.
Preprints 80615 g0a12
Figure A13. FSS of each model and each threshold of rain, on the Réunion island (RE) domain, for 1 hour cumulated data.
Figure A13. FSS of each model and each threshold of rain, on the Réunion island (RE) domain, for 1 hour cumulated data.
Preprints 80615 g0a13
Figure A14. FSS of each model and each threshold of rain, on the New Caledonia (NC) domain, for 1 hour cumulated data.
Figure A14. FSS of each model and each threshold of rain, on the New Caledonia (NC) domain, for 1 hour cumulated data.
Preprints 80615 g0a14

References

  1. Tropical Globe’s Database of Publicly Accessible Weather Radars. http://tropicalglobe.com/radar_database/list.shtml. Accessed: 2023-07-19.
  2. Prigent, C. Precipitation retrieval from space: An overview. Comptes Rendus Geoscience 2010, 342, 380–389. [Google Scholar] [CrossRef]
  3. Adler, R.; Braun, S.; Stocker, E.; Marius, J. Tropical Rainfall Measuring Mission, TRMM, Senior Review Proposal. Technical report, Laboratory for Atmospheres, NASA Goddard Space Flight Centre, Greenbelt, MD, USA, 2007.
  4. Smith, E.A.; Asrar, G.; Furuhama, Y.; Ginati, A.; Mugnai, A.; Nakamura, K.; Zhang, W. International global precipitation measurement (GPM) program and mission: An overview. In Measuring precipitation from space: EURAINSAT and the future; 2007; pp. 611–653.
  5. NOAA, Office of Satellite And Product Operations. Global Hydro-Estimator - Algorithm Description. https://www.ospo.noaa.gov/Products/atmosphere/ghe/algo.html. Accessed: 2023-07-19.
  6. Nguyen, P.; Ombadi, M.; Gorooh, V.A.; Shearer, E.J.; Sadeghi, M.; Sorooshian, S.; Ralph, M.F. Persiann dynamic infrared–rain rate (PDIR-now): A near-real-time, quasi-global satellite precipitation dataset. Journal of hydrometeorology 2020, 21, 2893–2906. [Google Scholar] [CrossRef] [PubMed]
  7. NOAA NESDIS Center for Satellite Applications and Research. GOES-R Advanced Baseline Imager (ABI) Algorithm Theoretical Basis Document For Rainfall Rate (QPE). https://www.star.nesdis.noaa.gov/goesr/documents/ATBDs/Enterprise/ATBD_Enterprise_Rainfall_Rate_v3_2020-07-10.pdf. Accessed: 2023-07-19.
  8. EUMETSAT HSAF. Algorithm Theoretical Baseline Document (ATBD) for product H03B – P-IN-GRU-SEVIRI. Precipitation rate at ground by GEO/IR supported by LEO/MW. Technical report, 2017.
  9. Kuligowski, R.J. The Self-Calibrating Multivariate Precipitation Retrieval (SCaMPR) for high-resolution, low-latency satellite-based rainfall estimates. In Satellite Rainfall Applications for Surface Hydrology; 2010; pp. 39–48.
  10. Turk, F.J.; Rohaly, G.; Hawkins, J.D.; Smith, E.A.; Grose, A.; Marzano, F.S.; Levizzani, V. Analysis and assimilation of rainfall from blended SSM/I, TRMM and geostationary satellite data. Proc. 10th Conf. Satellite Meteorology and Oceanography, 2000, pp. 66–69.
  11. Kubota, T.; Aonashi, K.; Ushio, T.; Shige, S.; Takayabu, Y.N.; Kachi, M.; Arai, Y.; Tashima, T.; Masaki, T.; Kawamoto, N.; Mega, T.; Yamamoto, M.K.; Hamada, A.; Yamaji, M.; Liu, G.; Oki, R. Global Satellite Mapping of Precipitation (GSMaP) products in the GPM era. In Satellite precipitation measurement; 2020.
  12. NASA Global Precipitation Measurement (GPM) Integrated Multi-satellitE Retrievals for GPM (IMERG). Algorithm Theoretical Basis Document (ATBD) Version 06. https://gpm.nasa.gov/sites/default/files/2020-05/IMERG_ATBD_V06.3.pdf, 2020. Accessed: 2023-07-19.
  13. Pritt, M.; Chern, G. Satellite image classification with deep learning. 2017 IEEE Applied Imagery Pattern Recognition Workshop (AIPR). IEEE, 2017, pp. 1–7.
  14. Jeppesen, J.H.; Jacobsen, R.H.; Inceoglu, F.; Toftegaard, T.S. A cloud detection algorithm for satellite imagery based on deep learning. Remote Sensing of Environment 2019, 229, 247–259. [Google Scholar] [CrossRef]
  15. Neupane, B.; Horanont, T.; Aryal, J. Deep learning-based semantic segmentation of urban features in satellite images: A review and meta-analysis. Remote Sensing 2021, 13. [Google Scholar] [CrossRef]
  16. Tao, Y.; Hsu, K.; Ihler, A.; Gao, X.; Sorooshian, S. A two-stage deep neural network framework for precipitation estimation from bispectral satellite information. Journal of Hydrometeorology 2018, 19, 393–408. [Google Scholar] [CrossRef]
  17. Sadeghi, M.; Asanjan, A.A.; Faridzad, M.; Nguyen, P.; Hsu, K.; Sorooshian, S.; Braithwaite, D. PERSIANN-CNN: Precipitation estimation from remotely sensed information using artificial neural networks–convolutional neural networks. Journal of Hydrometeorology 2019, 20, 2273–2289. [Google Scholar] [CrossRef]
  18. Wang, C.; Xu, J.; Tang, G.; Yang, Y.; Hong, Y. Infrared precipitation estimation using convolutional neural network. IEEE Transactions on Geoscience and Remote Sensing 2020, 58, 8612–8625. [Google Scholar] [CrossRef]
  19. Hayatbini, N.; Kong, B.; Hsu, K.L.; Nguyen, P.; Sorooshian, S.; Stephens, G.; Ganguly, S. Conditional generative adversarial networks (cGANs) for near real-time precipitation estimation from multispectral GOES-16 satellite imageries—PERSIANN-cGAN. Remote Sensing 2019, 11. [Google Scholar] [CrossRef]
  20. Chen, H.; Chandrasekar, V.; Cifelli, R.; Xie, P. A machine learning system for precipitation estimation using satellite and ground radar network observations. IEEE Transactions on Geoscience and Remote Sensing 2019, 58, 982–994. [Google Scholar] [CrossRef]
  21. Wang, C.; Tang, G.; Gentine, P. PrecipGAN: Merging microwave and infrared data for satellite precipitation estimation using generative adversarial network. Geophysical Research Letters 2021, 48. [Google Scholar] [CrossRef]
  22. Wang, C.; Tang, G.; Xiong, W.; Ma, Z.; Zhu, S. Infrared Precipitation Estimation using Convolutional neural network for FengYun satellites. Journal of Hydrology 2021, 603. [Google Scholar] [CrossRef]
  23. Gao, Y.; Guan, J.; Zhang, F.; Wang, X.; Long, Z. Attention-unet-based near-real-time precipitation estimation from fengyun-4A satellite imageries. Remote Sensing 2022, 14, 2925. [Google Scholar] [CrossRef]
  24. Olson, W. GPM DPR and GMI Combined Precipitation L2B 1.5 hours 5 km V07, 2022. Accessed: 2023-04-01, doi:10.5067/GPM/DPRGMI/CMB/2B/07. [CrossRef]
  25. Huffman, G.; Stocker, E.; Bolvin, D.; Nelkin, E.; Tan, J. GPM IMERG Final Precipitation L3 Half Hourly 0.1 degree x 0.1 degree V06, 2019. Accessed: 2023-03-01. [CrossRef]
  26. Agency, J.A.E. GSMaP(Hourly). 1998. Accessed: 2023-03-01. [CrossRef]
  27. Group, G.R.A.W.; Office, G.R.P. NOAA GOES-R Series Advanced Baseline Imager (ABI) Level 2 Rainfall Rate / QPE, 2018. Accessed: 2023-03-01. [CrossRef]
  28. NOAA Global Hydro Estimator (GHE). https://registry.opendata.aws/noaa-ghe. Accessed : 2023-03-01.
  29. Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks - Dynamic Infrared Rain Rate near real-time (PDIR-Now). https://chrsdata.eng.uci.edu/. Accessed : 2023-03-01.
  30. Precipitation rate at ground by GEO/IR supported by LEO/MW IODC, P-IN-SEVIRI-E (H63). https://hsaf.meteoam.it/. Accessed : 2023-03-01.
  31. Chen, L.C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European conference on computer vision (ECCV), 2018, pp. 801–818.
  32. Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer International Publishing, 2015, pp. 234–241. 5 October.
  33. Rakshit, S. Multiclass semantic segmentation using DeepLabV3+. https://keras.io/examples/vision/deeplabv3_plus/, 2021. Accessed : 2023-03-01.
  34. Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization 2014. [arXiv:cs.LG/1412.6980].
  35. Hafiz, A.M.; Parah, S.A.; Bhat, R.U.A. Attention mechanisms and deep learning for machine vision: A survey of the state of the art 2021. [arXiv:cs.CV/2106.07550].
  36. Yang, L.; Zhang, Z.; Song, Y.; Hong, S.; Xu, R.; Zhao, Y.; Yang, M.H. Diffusion models: A comprehensive survey of methods and applications 2022. [arXiv:cs.CV/2209.00796].
  37. Andrychowicz, M.; Espeholt, L.; Li, D.; Merchant, S.; Merose, A.; Zyda, F.; Kalchbrenner, N. Deep Learning for Day Forecasts from Sparse Observations 2023. [arXiv:cs.LG/2306.06079].
Figure 1. Example of a precipitation tile and its associated GEO data on the northern coast of Madagascar: (a) precipitation data from GPMCO in mm/h; (b) normalized albedo of the visible 0.6 μ m channel from MSG3; (c), (d), and (e) normalized brightness temperature for the 10.8, 12.0, and 6.2 μ m channels, respectively.
Figure 1. Example of a precipitation tile and its associated GEO data on the northern coast of Madagascar: (a) precipitation data from GPMCO in mm/h; (b) normalized albedo of the visible 0.6 μ m channel from MSG3; (c), (d), and (e) normalized brightness temperature for the 10.8, 12.0, and 6.2 μ m channels, respectively.
Preprints 80615 g001
Figure 2. Radar coverage of our four study areas.
Figure 2. Radar coverage of our four study areas.
Preprints 80615 g002
Figure 3. The experimental framework.
Figure 3. The experimental framework.
Preprints 80615 g003
Figure 4. Fractions skill scores for each experiment, as a function of the rainfall rate (RR) in mm/h. (a) Different numbers of input GEO images; (b) Number of GEO images set to one, variable GEO input channels; (c) With channels IR108 and WV62, variation of the additional input features.
Figure 4. Fractions skill scores for each experiment, as a function of the rainfall rate (RR) in mm/h. (a) Different numbers of input GEO images; (b) Number of GEO images set to one, variable GEO input channels; (c) With channels IR108 and WV62, variation of the additional input features.
Preprints 80615 g004
Figure 5. Example of rainfall estimation from Espresso during the cyclone Emnati, north of La Réunion and Mauritius.
Figure 5. Example of rainfall estimation from Espresso during the cyclone Emnati, north of La Réunion and Mauritius.
Preprints 80615 g005
Figure 6. Confusion Matrix of the Espresso model for four categories of Rainfall Rate (RR). True Label lines sum to 1.
Figure 6. Confusion Matrix of the Espresso model for four categories of Rainfall Rate (RR). True Label lines sum to 1.
Preprints 80615 g006
Figure 7. FSS of Espresso for each band of latitude and each threshold of rain.
Figure 7. FSS of Espresso for each band of latitude and each threshold of rain.
Preprints 80615 g007
Figure 8. Barplot of the RMSE of each model for each radar domain.
Figure 8. Barplot of the RMSE of each model for each radar domain.
Preprints 80615 g008
Figure 9. Fractions skill scores of each model and each threshold of rain, on the mainland France domain.
Figure 9. Fractions skill scores of each model and each threshold of rain, on the mainland France domain.
Preprints 80615 g009
Figure 10. Comparison of the seven models for a convective storm in Montpellier, 2022-09-06.
Figure 10. Comparison of the seven models for a convective storm in Montpellier, 2022-09-06.
Preprints 80615 g010
Figure 11. Comparison of all six models for a case of convective activity in Guadeloupe, 2022-04-03.
Figure 11. Comparison of all six models for a case of convective activity in Guadeloupe, 2022-04-03.
Preprints 80615 g011
Table 1. Satellite-based precipitation estimation operational products.
Table 1. Satellite-based precipitation estimation operational products.
Name Producer Coverage Resolution Frequency Availability Delay
IMERG NASA World 0.1° ∼10km 30 minutes 4 hours
GHE NOAA World limited to 65°S/65°N 0.36° ∼4km 1 hour Real time
QPE NOAA GOES East Space view 2 km at nadir 10 minutes Real time
PDIR-NOW University of California World limited to 60°S/60°N 0.04° ∼4.5km 1 hour Real time
P-IN-SEVIRI EUMETSAT HSAF MSG 0° Space view 3 km at nadir 15 minutes Real time
GSMAP JAXA World 0.1° ∼10km 30 minutes Real time
Table 2. Geostationary satellite from which our data originated.
Table 2. Geostationary satellite from which our data originated.
Name Central longitude Time frequency (min) Size of images Resolution at nadir
MSG4 15 3712x3712 3 km
MSG1 41.5° 15 3712x3712 3 km
Himawari8 140.7° 10 5500x5500 2 km
GOES16 -75° 10 5424x5424 2 km
GOES17 -137°° 15 5424x5424 2 km
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated