Preprint
Article

This version is not peer-reviewed.

Extreme Heat and Emergency Health Impacts in the US (2018-2025)

Submitted:

16 June 2026

Posted:

17 June 2026

You are already at the latest version

Abstract
Future climate projections suggest an increase in heat-related mortality and a decrease in cold-related deaths under warming scenarios. Understanding the health impacts of extreme heat, and their implications for healthcare demand is essential for assessing the future burden of climate-related illnesses. In this study, we examined the relationship between extreme heat events and Emergency Department Visits (EDVs) for heat-related illnesses (HRIs) across the United States from 2018 to 2025. Using data from the Centers for Disease Control and Prevention (CDC) Heat and Health Tracker and other relevant sources, we analyzed EDV rates standardized to 100,000 population. We aggregated daily into the 10 U.S. Health and Human Service Regions (HHSRs). We used 0.5∘×0.5∘ gridded maximum daily temperature data (aggregated to HHSRs with proportional area weighting) and daily maximum heat index extracted from the CDC data portal (estimated using the US National Weather Service methodology and aggregated to HHSRs using total population weighting) to characterize seasonal patterns and regional variability. The association between peak heat events and EDV time series was explored using log-linear mixed-effects models, which accounted for seasonal trends, climate variables, and socio-economic vulnerability factors. Regional variability in predictor-response relationships was captured through random effects. Model performance was evaluated using prediction error metrics and goodness-of-fit assessments. Maximum temperature and heat index were both significant predictors, with the heat index offering a slightly better fit. Associations were largely contemporaneous, with peak correlations at lag zero, underscoring the need for real-time response. EDVs increased several days before peak environmental conditions, consistent with early exposure effects. While temperature–EDV relationships varied regionally, heat index associations were more stable. This work underscores the urgent need for regionally adaptive public health strategies in the face of intensifying climate extremes and outlines future directions for research and policy to strengthen health systems’ preparedness in a warming world.
Keywords: 
;  ;  ;  

1. Introduction

Extreme heat events stand out as one of the most deadly weather-related hazards worldwide. Heat-related events caused the most death in the U.S. during 2023, followed by wildfires and tornadoes. The largest number of reported injuries during that year were caused by heat, tornadoes and winter weather (National Oceanic and Atmospheric Administration (NOAA), 2024).
During the period 2022-2025, 36 countries faced national breaking temperature records (Yale Climate Connections, 2025). Five million deaths were attributed to extreme heat and cold globally between 2000-2019 (Chen et al., 2024), and projections indicate that heat-related deaths will increase.
Climate change might decrease cold-related deaths at high latitudes but increases heat-related deaths specially in Tropical Africa and Asia (Climate Impact Lab and United Nations Development Programme, 2024).
Impacts of these events can extend beyond individual increased mortality and morbidity to challenging the resilience and response of the health care system.
There is a wide spectrum of heat-related illnesses which range from dehydration and heat exhaustion, to cardiovascular and respiratory complications, with an uneven effect on vulnerable populations like elderly, young-children and individuals with pre-existing conditions. Prolonged heat exposure can affect the human body at multiple levels including mild physiological stress including dehy-dration, to life-threatening conditions as heat-stroke.
Thiel et al. (2025) identified a broad spectrum of heat-related diseases including electrolyte imbalances, cardiovascular disease and mental health issues. Using recent surveillance data, Vaidyanathan et al. (2024) report substantial increases in heat-related emergency department visits across the United States, with pronounced regional and demographic variability, underscoring the growing public health burden of extreme heat.
Focusing on vulnerable populations, Liss et al. (2017) demonstrate that higher ambient temperatures and heatwave events are strongly associated with increased hospitalizations among older adults, with clear seasonal synchronization between temperature peaks and morbidity. At a national scale, Vaidyanathan et al. (2019) estimate exposure–response relationships between heat index and hospitalizations across 1,617 U.S. counties, revealing significant geographic heterogeneity and showing that health risks can occur even at moderately elevated heat levels. Similarly, Gronlund et al. (2014) find that extreme heat and heatwaves are associated with increased hospital admissions among the elderly, particularly for renal and respiratory conditions, highlighting specific pathways through which heat affects morbidity. Recent work by Xing et al. (2026) provides detailed empirical evidence from 460 communities in Victoria, Australia, demonstrating that heat exposure contributes substantially to both hospital admissions and emergency department visits, with measurable economic and health burdens. Their analysis further shows that a significant proportion of these impacts can be attributed specifically to human-induced climate change, highlighting the role of anthropogenic warming in amplifying heat-related health risks.
Understanding the impacts of extreme heat on health and health services demand would provide a better estimation of future climate-related illness burden. In this paper we focus on evaluating the synchronicity between ambient variables peak timing and emergency department visits peak timing in the large contiguous US states by Health and Human Service (HHS) regions building predictive models for Emergency Department Visits (EDVs) that account for regional impacts of environmental factors as well as geographical factors. In section 2 we provide a description of all data sources used in the analysis. In section 3 we describe the methodology including the different modeling approaches for seasonality quantification and predictive models for EDVs across the 10 HHS regions. Results are provided in section 4 including model diagnostics and predictive skills. Discussion and Conclusions are presented in section 5 with further discussions on the limitations of this approach and future research directions.

2. Data Description

Data for this research was extracted from different sources. Date pre-processing was necessary for consistency and compatibility of different temporal and spatial scales. This section describes the different data sources and data processing for environmental and vulnerability related variables, including ambient maximum temperature and heat index, percentage of forest area, percentage of elderly and non-white population, as well as the health related outcome of this research which is Emergency Department Visits.

2.1. Emergency Department Visits

The emergency department visit (EDV) data was obtained from the Centers for Disease Control and Prevention’s (CDC’s) National Environmental Public Health Tracking Network (https://ephtracking.cdc.gov/DataExplorer/) under the heat-related illness (HRI) content. Data is available at a daily rate from 1/1/2018 to 12/31/2025 and for each of the ten Health and Human Services (HHS) regions (see Figure 1). EDV was expressed as HRI-associated visits per 100,000 emergency department visits.
Because EDV data was only available by HHS region, the temperature and heat index data were aggregated to this level. Furthermore, because the EDV data did not go further back than the 2018 (other than 12/31/2017, which was excluded for simplicity), temperature data was limited to this range despite being available at prior years.

2.2. Maximum Temperature Data

The temperature data was obtained from the Physical Sciences Laboratory of the National Oceanic and Atmospheric Administration (NOAA) (https://psl.noaa.gov/data/gridded/data.cpc.globaltemp.html). The maximum surface temperature in Celsius at a daily rate from 1/1/2018 to 12/31/2025 was used. Because the data was gridded in 0.5 × 0.5 degree latitude and longitude squares, it needed to be aggregated to the HHS region level.
Shape-files from the United States Census Bureau (https://www.census.gov/geographies/mapping-files/time-series/geo/carto-boundary-file.html) were used at the 500 km resolution level. The daily temperature for each region was calculated by averaging the gridded values, weighted by the proportion of the area in the region to account for grid overlapping between two regions.

2.3. Heat Index

Heat Index (HI) data were obtained from the CDC’s National Environmental Public Health Tracking Network, specifically from the “HRI” section and the “Historical Temperature and Heat Index” subsection. These data are available at a daily resolution at the county level for the period January 1, 2018 through December 31, 2022, and were additionally aggregated to the HHS regional level.
The heat index integrates both, the air surface temperature and the relative humidity, and it is frequently used to measure heat exposure (Anderson et al., 2013). For calculating the Heat Index, CDC uses a National Weather Service’s modified version of the Rothfusz regression1.
H I = 42.379 + 2.04901523 T + 10.14333127 R H 0.22475541 T × R H 6.83783 × 10 3 T 2 5.481717 × 10 2 R H 2 + 1.22874 × 10 3 T 2 × R H + 8.5282 × 10 4 T × R H 2 1.99 × 10 6 T 2 × R H 2
where H I is heat index (°F); T is the air temperature (°F) and R H is the Relative humidity (%). The raw data are collected daily at the county level across all 3,143 US counties and sourced from the CDC website.
To align with the 10 HHS regions, we combined the data in two steps:
  • First, compute the average heat index for each state:
    HI state = 1 n c i = 1 n c HI county i
    where n c is the number of counties in the state.
  • Then, to better reflect the population distribution, we calculate the population-weighted average for each HHS region:
    HI region = j region HI state j × Pop j j region Pop j
where P o p j is the total population for state j and H I s t a t e j is the heat index for state j calculated from equation 2.

2.4. Vulnerability-Related Variables

Vulnerability-related data were obtained from the CDC’s National Environmental Public Health Tracking Network under the “HRI” (heat-related illnesss) ection, “Vulnerability and Preparedness: Heat” subsection. These variables capture demographic and environmental characteristics that may influence population susceptibility to HRI. Three variables were included in the analysis:
  • elderly_percentage: Percentage of the population aged 65 or older living alone. This variable is collected annually at the state level from 2009 to 2021.
  • nonwhite_percentage: Percentage of the population identifying as a race other than White. This variable is measured every decade at the county level from 2000 to 2010.
  • forest_coverage: Percentage of land area covered by forest. This variable is measured every four years at the state level from 2001 to 2021.
The spatial distribution of the three variables by state and aggregated to HHS regions is presented in Figure 2, Figure 3 and Figure 4 for selected years.
Elderly population is highly concentrated in the states of Iowa, Missouri, Nebraska and Kansas (HHS Region 7), while the highest proportion of forested areas concentrates in the North East states of Maine, New Hampshire, Vermont, Massachusets and Rhode Island (HHS Region 1). In the case of non-white population the highest concentration occurs in the South West states of California, Nevada and Arizona (HHS Region 9).

3. Methodology

A first step in the analysis was to quantify the seasonality of the main response variable (EDV) and the proposed predictors: maximum temperature and heat index. The aim was to compare the features of the seasonal cycles (peak time and amplitude) and their regional variability, as well as to assess the potential synchronicity among them.
The detection of a potential lag effect between EDV and temperature, and between EDV and heat index, was also assessed by calculating the sample cross-correlation function between the response and the predictors after removing the trend and seasonal components using a seasonal–trend decomposition approach.
Finally, predictive models for EDV were fitted by incorporating the overall observed seasonal effect of the response, along with the effects of the predictors, which included either heat index or temperature as well as vulnerability-related variables. Regional variability in the strength of this dependence was accounted for through the inclusion of additional random effects in the model. Sequential model comparison was conducted as additional predictors were added by evaluating model predictive accuracy using a training–testing data split, while model fit was assessed using the Bayesian Information Criterion (BIC).

3.1. Seasonality Quantification

Variables EDV, temperature and heat index are strongly seasonal across all HHS regions; therefore seasonality quantification using useful and data informed metrics becomes an important initial step to describe the temporal patterns of these variables. For EDV, temperature, and heat index, seasonality metrics were calculated following Naumova et al. (2022).
Let Y t j be the response variable (EDV) at time t for region j, and X t j be the temperature or heat index at time t and region j. A log transformation was applied to EDV since it is a count variable.
We initially model the response as a purely seasonal model with one harmonic:
Y t j = β 0 j + β s j sin ( ϕ t ) + β c j cos ( ϕ t ) + ε t j
where ϕ = 2 π 365.25 ; β 0 j , β s j and β c j are the unknown parameters for the one-harmonic periodic regression; and ε j t are random normal errors with zero mean and constant variance within region.
Model coefficients are estimated by region using maximum likelihood, giving us regression coefficient estimates, their standard errors and covariances for each region. From these estimates we calculate:
  • Phase angle ψ = arctan ( β s j / β c j )
  • Var ( ψ ) = β c j 2 σ s j 2 + β s j 2 σ c j 2 2 σ β c j β s j β s j β c j ( β c j 2 + β s j 2 ) 2
  • Peak timing P T = ψ / ϕ if β s j , β c j > 0 ( ψ + π ) / ϕ if β c j < 0 ( ψ + 2 π ) / ϕ otherwise
  • Var ( P T ) = Var ( ψ ) ϕ
  • Amplitude γ = β c j 2 + β s j 2
  • Var ( γ ) = β c j 2 σ s j 2 + β s j 2 σ c j 2 + 2 σ β c j β s j β s j β c j β c j 2 + β s j 2
  • Intensity θ = e γ
These metrics are then summarized by regions and represented graphically to assess spatial variability within their geographical setting.

3.2. Emergency Department Visits (EDV) Modeling

Enhancing effects of extreme air temperature and heat index are expected to produce amplified responses in the corresponding health-related outcomes. Under these assumptions, EDV may be subject to an overall seasonal effect that can be modified by localized seasonal effects, as well as by the impacts of temperature or heat index, which may also vary across regions due to local conditions.
Two sets of models are proposed to represent EDV outcomes, using either temperature or heat index as potential predictors. In total, six generalized linear mixed-effects models (GLMMs) were considered. Let X t j denote the temperature or heat index at time t for region j, and let μ t j = E ( Y t j ) denote the expected value of EDV. The link function is denoted by g ( · ) .
In these models, parameters b s and b c are the corresponding fixed effects regression coefficients for the one-harmonic sine and cosine functions, while β s j and β c j are corresponding random effects with values changing across regions. β x is the fixed effect regression coefficient for the external climatic predictors (maximum temperature or heat index) while β x j is the corresponding random effect with values changing across regions. Parameters β 0 j represents a random intercept or baseline value which changes across regions. The random effects are assumed to follow mean-zero distributions with variance components to be estimated from the data.
Two different sets of models are proposed to represent EDV outcomes using temperature or heat index as potential predictors. Six generalized linear mixed effects models (GLMM) were considered. Let X t j be the temperature or heat index at time t for region j. The models are:
1:
g ( μ t j ) = b s sin ( ϕ t ) + b c cos ( ϕ t ) + β 0 j
2:
g ( μ t j ) = b s sin ( ϕ t ) + b c cos ( ϕ t ) + β s j sin ( ϕ t ) + β c j cos ( ϕ t ) + β 0 j
3:
g ( μ t j ) = b s sin ( ϕ t ) + b c cos ( ϕ t ) + β 0 j + b x X t j
4:
g ( μ t j ) = b s sin ( ϕ t ) + b c cos ( ϕ t ) + β 0 j + b x X t j + β x j X t j
5:
g ( μ t j ) = b s sin ( ϕ t ) + b c cos ( ϕ t ) + β s j sin ( ϕ t ) + β c j cos ( ϕ t ) + β 0 j + b x X t j
6:
g ( μ t j ) = b s sin ( ϕ t ) + b c cos ( ϕ t ) + β s j sin ( ϕ t ) + β c j cos ( ϕ t ) + β 0 j + b x X t j + β x j X t j
where g ( . ) is the link function and μ t j = E [ E D V t j ] is the expected value of EDV for time t and region j.
Models 1 and 2 exclude the effect of an environmental predictor and are purely seasonal, while the remaining models 4 to 6 include the effect of an environmental variable (maximum temperature or heat index) on the EDV response. Models 2, 5 and 6 include both a fixed global seasonal effect and a random seasonal effect that varies across regions. Models 4 and 6 include both a fixed global effect of X over EDV, and a random effect of X over EDV that changes across regions. While model 1 considers a random intercept which modifies the baseline values of EDV from region to region, Model 2 also accounts for regional modifications of the seasonal effects over the global seasonal signal, by adding random effects on the sine and cosine components.
All models were compared in terms of predictability, by estimating the model parameters for a training set, and estimating the the mean squared error (MSE) on a testing set, and in terms of goodness of fit. Different train-test split data percentages were compared to evaluate the sensitivity of the prediction errors to the different split schemes.

4. Results

The seasonality analysis of the main time series in this research is presented in Section 4.1. As an initial step, the time series are treated as independent across the different HHS regions, and separate models are fitted for each region. Exploration about the delay between the environmental time series and the EDV response time series is presented in Section 4.2. Following this analysis, we present the EDV predictive models that include maximum temperature as the main external covariate in Section 4.3 (maximum temperature models), followed by the EDV predictive models that use the heat index as an external covariate in Section 4.4 (heat index models).

4.1. Seasonality Analysis

The one-harmonic seasonal models fitted to the observed time series of EDV, maximum temperature and heat index are shown in Figure 5, Figure 6 and Figure 7. The model (red lines) is able to reproduce the overall seasonal behavior for all time series at each HHS region, more specifically, the peak timing, but they are not able to reproduce the more extreme events (blue dots). Representation of regional differences in amplitude and inter-annual variability requires additional model components or covariates.
Seasonality metrics for EDV, temperature, and heat index are shown in Table A1, Table A2, and Table A3, respectively. These results show an average peak timing across regions of 194, 200.4 and 203.4 days from the beginning of the year for EDV, maximum temperature and heat index respectively. The EDV peak occurs earlier than the peak timings for both maximum temperature and heat index. There is a difference of nearly one week between the average peak timing of EDV and that of maximum temperature. This difference increases to nearly 10 days on average when comparing the average peak timing of EDV with that of the heat index.
Figure 8, Figure 9, Figure 10, and show maps of the estimated peak times for EDV, temperature, and heat index, respectively. The earliest peak timing for EDV occurs in HHS Region 5 (Illinois, Indiana, Michigan, Minnesota, Ohio, and Wisconsin), while the latest peak occurs in Region 9 (Arizona, California, and Nevada) among the continental states. This pattern is not necessarily synchronized with the earliest and latest peak times for maximum temperature and heat index.
An inspection of detailed regional differences between the response and the seasonal effects reveals that a maximum difference of two weeks occurs in Region 5 (Illinois, Indiana, Michigan, Minnesota, Ohio, Wisconsin) when comparing EDV and heat index peak timing, while a minimum difference of one-quarter of a day between peak times occurs in Region 6 (Arkansas, Louisiana, New Mexico, Oklahoma, Texas). Similar extreme differences in peak timings are observed between EDV and maximum temperature for the same regions. These regional differences are accounted for through random effects that represent the region-to-region variability of the seasonal cycle model representation.

4.2. Lag Detection Between EDV and Environmental Predictor Variables

A common way to quantify the time delay between two time series and their association strength is by calculating the sampling cross-correlation functions (CCF) and looking the lag values where the highest sample cross-correlation occurs (Runge et al., 2014). To minimize the effect of the autocorrelation of each time series on this calculation, we applied the STL decomposition (Seasonal–Trend decomposition using LOESS) to remove seasonal and non-seasonal trends in the data (Cleveland et al., 1990).
STL decompositions for EDV, temperature, and heat index are shown in Figure A1, Figure A2, and Figure A3, respectively.
Cross-correlations between the STL remainders for EDV and temperature and EDV and heat index are shown in Figure A4 and Figure A5, respectively.
In most cases, except for the CCF between EDV and maximum temperature residuals in Region 5, the maximum cross-correlation values occurred at lag = 0. This indicates that no significant time delay between the response and the climatic variable was detected, suggesting an immediate heat response. In view of these results, lagged predictors were not included in the proposed models described in the following section. Models that include maximum temperature as a predictor are referred to as temperature models and are discussed in Section 4.3, whereas models that include the heat index as a predictor are referred to as heat index models and are discussed in Section 4.4.

4.3. Temperature Models

Temperature models were evaluated using both the full data (to the end of 2025) and data limited to the heat index time range (until the end of 2022). To test the sensitivity of different train-test data splits, different percentages were considered for each data range. Figure 11 compares the MSEs for models 1 to 6, including temperature as a predictor in models 3 to 6.
As shown in Figure 11, models 3, 4, 5, and 6 had much lower MSEs than models 1 and 2. The lowest MSEs occurred at a 70-30 train-test split for both data ranges, with model 4 and model 6 being optimal for the data ending in 2025 and 2022, respectively. However, the difference in MSEs was very small, and all four models including temperature had a similar performance.
The best two models, models 4 and 6, were used to make predictions in the testing set. Figure 12 shows the predictions for these two models fo all HHS regions. Both models suggest distinct regional differences in the associations between EDV and maximum temperature.

4.4. Heat Index Models

Heat index models were evaluated using data that ends at the end of 2022, as shown in Figure 13. Similar to the temperature models, models 3, 4, 5, and 6 had much lower MSEs than models 1 and 2 (which did not include heat index as a predictor). Though the four models with heat index show similar predictive performance at a 70-30 train-test split, model 5 had the lowest MSE.
Figure 14 shows the predictions for the model with the best predictive performance (model 5). Results are shown for all HHS regions.
In contrast with the best predictive models selected in the temperature case, model 5 suggests a more homogeneous dependence of EDV on the environmental predictor (heat index) across regions, excluding a random effect for the dependence between the mean EDV and the heat index. An estimated regression coefficient b ^ x = 0.1893 implies an overall 20% increase in the mean EDVs rate per unit increase in the heat index.
Both temperature and heat index models are able to adequately capture the seasonal patterns of EDV. Extreme observations generally lie within the 95% prediction intervals but remain more difficult to reproduce, possibly due to the presence of overdispersion, which is not accounted for by the Poisson model.
Overdispersion can be accounted for by using the Negative Binomial 2 family as the probability distribution for the response. Model comparisons between the Poisson and Negative Binomial 2 (NB2) families were conducted. The results are presented in the following section.

4.5. Fitting Performance vs. Prediction Accuracy for Temperature and Heat Index Models

The Bayesian Information Criterion (BIC) is a model selection tool to evaluate how well a model captures the overall distribution of the observed EDV counts when it used within the context of model comparison. Lower values of BIC indicate better goodness of fit after penalizing for model complexity. In this case, the data exhibit clear overdispersion, meaning that the variance exceeds the mean. The Poisson model imposes equality between the mean and variance, which leads to an inadequate representation of the variability in the data, particularly in the tails. In contrast, the NB2 model relaxes this restriction by allowing the variance to exceed the mean, enabling it to better capture the observed dispersion. As a result, the NB2 model provides a substantially higher likelihood by fitting the full distribution more accurately. This improvement in likelihood outweighs the modest complexity penalty in BIC, yielding lower (better) BIC values as shown in Figure 15 and Figure 16 for temperature and heat index models respectively.
Mean Squared Error (MSE), on the other hand, evaluates how accurately a model predicts the conditional mean of EDV and does not incorporate information about the variance structure. From this perspective, the simpler Poisson model can have an advantage because it directly estimates the mean without introducing an additional dispersion parameter. This can result in more stable point predictions and slightly smaller prediction errors. In contrast, the NB2 model’s added flexibility allows for greater variability around the mean, which may increase the variability of point predictions and lead to a marginally higher MSE. Consequently, the Poisson model can achieve better predictive performance in terms of MSE, despite providing an inferior fit to the overall distribution. Figure 17 and Figure 18 show a comparison of the MSEs between the Poisson family and Negative Binomial 2 family for the temperature and heat index models respectively.
In this work, we focus on improving model predictability by treating the Poisson model as a preferable alternative to the NB2 model.

4.6. Intra-Class Correlation Coefficient

The intra-class correlation coefficient (ICC), also referred to as the variance partition coefficient, measures the proportion of total variability attributable to the random effects associated with the data clusters (i.e., region-to-region variability). Fixed effects influence the ICC indirectly by reducing the unexplained variance components used in its calculation.
To quantify the contribution of regional variability in emergency department visits (EDV), including seasonal components and covariate effects, we computed ICC values both before adjusting for model covariates (unadjusted ICC) and after including covariates such as seasonal effects and temperature or heat index (adjusted ICC).
Table 1 presents the ICC estimates for the best-performing temperature and heat index models under both the Poisson and Negative Binomial type 2 (NB2) families. For the Poisson models, unadjusted ICC values are relatively low (approximately 5–10%), indicating modest baseline clustering and that only a small fraction of the total variability is attributable to differences between regions prior to covariate adjustment.
In contrast, the adjusted ICC values for the Poisson models are very close to 1. This indicates that, after accounting for seasonal and meteorological covariates, nearly all of the remaining unexplained variability is attributable to between-region differences. In other words, the residual variation becomes highly structured at the regional level, with very little within-region variability remaining once covariates are included.
For the Negative Binomial models, adjusted ICC values remain substantially lower (approximately 0.10–0.14). This reflects the additional observation-level variability introduced by the NB2 dispersion parameter, which accounts for overdispersion beyond the Poisson assumption. The increased within-region heterogeneity inflates the total variance, thereby reducing the proportion attributable to between-region variation and leading to smaller ICC values even after covariate adjustment.

4.7. Residual Analysis

Residual analysis is a critical step in regression model diagnostics, used to detect model misspecification, including incorrect functional forms, distributional assumptions, or the presence of influential outliers. For generalized linear mixed models (GLMMs) with non-normal responses, a simulation-based residual approach is recommended over traditional Pearson and deviance residuals (Feng et al. (2020)). In this framework, residuals are defined as simulation-based quantile residuals taking values in the interval [ 0 , 1 ] . They represent the position of each observed value within the cumulative distribution of responses simulated from the fitted model, i.e., r i = F m o d e l ( y i )
This simulation-based quantile residual approach, implemented in the R package DHARMa (Hartig (2024)), was used for residual diagnostics. Figure 19 shows the DHARMa residual plots for the best temperature model (Model 5) fitted with a negative binomial (NB2) distribution. The left panel presents a quantile–quantile (QQ) plot against the expected uniform distribution, where each point represents a scaled residual and the red diagonal line indicates the theoretical expectation under correct model specification; systematic deviations from this line indicate potential model misspecification. The right panel displays residuals versus predicted values (rank-transformed), where the residuals should be uniformly distributed and centered around 0.5 with no discernible pattern under a well-specified model. A notable feature is the presence of several residuals equal to 1, indicating that the corresponding observed values lie at or beyond the upper tail of the simulated distribution (i.e., nearly all simulated responses are smaller than the observations), suggesting that the model may underestimate extreme values.
Analogous to Figure 19, Figure 20 presents the DHARMa residual plots for the best heat index model (Model 5) using the NB2 distribution family.
Analogous results are observed for the heat index model (Figure 20). The diagnostic tests yield consistent conclusions for both models. The Kolmogorov–Smirnov (KS) test for uniformity is significant, indicating that the residuals deviate from the expected uniform distribution. In contrast, the dispersion test is not significant, suggesting that both models adequately account for overdispersion. However, the outlier test is significant, indicating an excess number of observations falling outside the simulated range, which is consistent with a lack of fit in the upper tail of the distribution.
Taken together, these diagnostics suggest that, although the negative binomial models successfully address overdispersion, they fail to adequately capture extreme observations and possibly other structural features of the data. The significant deviation from uniformity further indicates residual misspecification, potentially arising from unmodeled nonlinear effects, temporal dependence, or missing covariates. While the NB2 specification provides an improvement over the Poisson model, additional model refinement is required to better represent the distributional characteristics of heat-related emergency department visit counts.

5. Discussion

This study examined the relationship between extreme heat indicators and emergency department visits (EDVs) for heat-related illnesses across the ten U.S. Health and Human Services regions over the period 2018–2025. By combining seasonal analysis with generalized linear mixed-effects models, we provide insights into the temporal dynamics, regional variability, and predictive performance of climate–health relationships at a national scale.
Our results confirm that both maximum temperature and heat index are important predictors of heat-related EDVs. Models incorporating these environmental variables consistently demonstrated lower prediction error compared to models based solely on seasonal components. Among the two indicators, models using the heat index achieved slightly better predictive performance and goodness of fit than those using temperature. This finding is consistent with the construction of the heat index, which integrates both temperature and relative humidity to better represent human heat exposure.
A key result of this analysis is the immediate heat effect on the response variable EDV. Cross-correlation analyses showed that the maximum correlation between EDVs and both temperature and heat index occurred at lag zero in most regions. This indicates that heat-related health impacts are largely contemporaneous with exposure, emphasizing the importance of real-time monitoring and rapid response to extreme heat conditions at this spatial and temporal scales. These results align with the results presented in Xing et al. (2026) where an immediate effect of heat response was observed and persisted for 21 days.
The seasonal structure of EDVs further highlights important dynamics in heat-related health outcomes. The average EDV peak occurs around day 194 of the year, while maximum temperature and heat index peak later, around days 200 and 203, respectively. The observed differences of approximately 6 days for temperature and 9 days for heat index, suggest that EDVs begin increasing before environmental variables reach their maximum values. This pattern aligns with previous findings reported in the literature, where increased heat-related morbidity precedes peak temperature conditions, likely reflecting early exposure effects and population vulnerability conditions (Vaidyanathan et al., 2019), (Liss et al., 2017).
Regional heterogeneity was also observed in the relationship between temperature and EDVs. Models incorporating random effects indicated that the strength of this association varies across regions, reflecting differences in environmental conditions and regional characteristics. In contrast, the best-performing models suggested that the relationship between heat index and EDVs is more consistent across regions, indicating that the heat index may capture more uniform physiological responses to heat exposure.
Model comparison revealed a trade-off between predictive accuracy and distributional fit. While Poisson models achieved lower mean squared error (MSE), indicating better predictive performance, negative binomial (NB2) models provided improved goodness of fit by accounting for overdispersion in the data. Residual diagnostics further indicated that, despite addressing overdispersion, both model classes exhibit some degree of misspecification, including deviations from uniformity and the presence of extreme observations outside expected modeled data ranges. These results suggest that additional model refinement, such as alternative distributional assumptions or more flexible model structures, may be necessary.
The intra-class correlation coefficient (ICC) analysis provides additional insight about the sources of EDVs’ variability. Unadjusted ICC values were relatively low (approximately 5–10% for the Poisson family and 4% for the NB2 family), indicating that region-to-region differences alone explain a limited portion of total sources of variability. However, after adjusting for seasonal and environmental covariates, most of the remaining unexplained variability is attributed to regional differences in the Poisson case. For the NB2 model case, additional variability is accounted for by overdispersion, and only a moderate fraction of the remaining unexplained variability (11%-14%) after fixed effects adjustment is attributed to regional differences.
First, the analysis is constrained by the availability of EDV data, which spans only from 2018 onward, limiting the evaluation of longer-term trends. Second, spatial aggregation at the HHS regional level may mask finer-scale variability and limit the detection of localized effects. Third, vulnerability-related variables were excluded due to lack of statistical significance in the models, which may reflect limitations in data resolution rather than absence of real effects. Considerations on different population strata as sex, age and socio-economic status might reveal additional differences in heat responses.

6. Conclusions

In summary, this study demonstrates that environmental heat indicators, particularly heat index, are valuable predictors of heat-related emergency department visits across the United States. Models incorporating temperature or heat index substantially outperform purely seasonal models, underscoring the importance of including environmental drivers in predictive frameworks.
The finding that EDV peaks occur earlier than the peaks of temperature and heat index has important public health implications. It suggests that interventions such as heat-health warning systems and preparedness strategies should be implemented prior to the highest levels of heat exposure, rather than in response to peak conditions.
From a methodological standpoint, mixed-effects modeling provides a useful framework for capturing both shared seasonal trends and regional variability. However, further work is needed to address model limitations related to overdispersion, residual structure, and extreme responses.
Future research should focus on improving spatial and temporal resolution of both health and environmental data, expanding the temporal coverage of EDV records, and enhancing access to integrated datasets. Improved data availability will be critical for better understanding lagged effects, vulnerability factors, and regional disparities in heat-related health outcomes.
Overall, this study contributes to the growing evidence linking extreme heat to increased healthcare demand and provides a basis for developing regionally adaptive public health strategies aimed at mitigating the impacts of a changing climate on population health.

Author Contributions

Conceptualization, L.B.; methodology, L.B., T.H., B.Z. and A.T.; software, T.H. and B.Z.; validation, L.B., T.H. and B.Z.; formal analysis, L.B., T.H., B.Z. and A.T.; investigation, L.B., T.H., B.Z. and A.T.; resources, L.B.; data curation, T.H., B.Z. and A.T.; writing—original draft preparation, L.B., T.H. and B.Z. ; writing—review and editing, L.B.; visualization, T.H. and B.Z.; supervision, L.B.; project administration, L.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study since it is Not applicable.

Data Availability Statement

Data used in this study is available in the public domains: Centers for Disease Control and Prevention’s (CDC’s) National Environmental Public Health Tracking Network (https://ephtracking.cdc.gov/DataExplorer/) and the Physical Sciences Laboratory of the National Oceanic and Atmospheric Administration (NOAA) (https://psl.noaa.gov/data/gridded/data.cpc.globaltemp.html).

Acknowledgments

During the preparation of this manuscript, the author(s) used Microsoft Co-pilot M365 for the purposes of improving text clarity and grammar correctness. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CDC Centers for Disease Control and Preventions
EDV Emergency Department Visits
HHSRs Health and Human Service Regions
HHS Health and Human Service
HI Heat Index
NOAA National Oceanic and Atmospheric Administration

Appendix A

Table A1. Seasonality metrics for EDV: P T : Peak timing (number of days relative to the start of the year); σ P T : P T Standard Error; γ : Amplitude (Degrees Celsius); σ γ 2 : γ Variance
Table A1. Seasonality metrics for EDV: P T : Peak timing (number of days relative to the start of the year); σ P T : P T Standard Error; γ : Amplitude (Degrees Celsius); σ γ 2 : γ Variance
Region Peak Timing ( P T ) σ P T Amplitude ( γ ) σ γ 2
Region 1 192.9 0.9617 4.596 0.0835
Region 2 191.8 0.8782 4.791 0.0812
Region 3 192.4 0.7399 3.964 0.0314
Region 4 196.2 0.5312 2.299 0.0025
Region 5 187.5 0.9239 3.451 0.0341
Region 6 197.5 0.5739 2.487 0.0036
Region 7 193.2 0.7998 3.131 0.0169
Region 8 194.3 0.5347 3.607 0.0116
Region 9 198.8 0.5682 2.887 0.0056
Region 10 195.4 1.338 4.479 0.1396
Table A2. Seasonality metrics for temperature: P T : Peak timing (number of days relative to the start of the year); σ P T : P T Standard Error; γ : Amplitude (Degrees Celsius); σ γ 2 : γ Variance
Table A2. Seasonality metrics for temperature: P T : Peak timing (number of days relative to the start of the year); σ P T : P T Standard Error; γ : Amplitude (Degrees Celsius); σ γ 2 : γ Variance
Region Peak Timing ( P T ) σ P T Amplitude ( γ ) σ γ 2
Region 1 202.5 0.4661 14.08 0.0127
Region 2 201.7 0.4995 13.17 0.0128
Region 3 199.1 0.5592 11.94 0.0132
Region 4 197.8 0.5717 9.314 0.0084
Region 5 200.5 0.4539 15.06 0.0138
Region 6 198.7 0.5521 10.64 0.0102
Region 7 200.8 0.5706 14.01 0.0189
Region 8 202.4 0.4612 15.29 0.0147
Region 9 203.0 0.4323 11.85 0.0078
Region 10 197.1 0.3158 15.23 0.0068
Table A3. Seasonality metrics for heat index: P T : Peak timing (number of days relative to the start of the year); σ P T : P T Standard Error; γ : Amplitude (Degrees Celsius); σ γ 2 : γ Variance
Table A3. Seasonality metrics for heat index: P T : Peak timing (number of days relative to the start of the year); σ P T : P T Standard Error; γ : Amplitude (Degrees Celsius); σ γ 2 : γ Variance
Region Peak Timing ( P T ) σ P T Amplitude ( γ ) σ γ 2
Region 1 207.4 0.5505 13.76 0.017
Region 2 204.7 0.5467 14.45 0.0185
Region 3 202.4 0.6004 13.69 0.02
Region 4 200.4 0.6351 10.98 0.0144
Region 5 201.7 0.5132 16.05 0.0201
Region 6 197.1 0.6368 12.24 0.018
Region 7 200.3 0.5618 16.39 0.0251
Region 8 203.1 0.4388 14.53 0.012
Region 9 207.9 0.5274 10.51 0.0091
Region 10 205.2 0.5231 11.58 0.0109

Appendix B

Figure A1. STL decomposition for EDV region 5
Figure A1. STL decomposition for EDV region 5
Preprints 218939 g0a1
Figure A2. STL decomposition for temperature region 5
Figure A2. STL decomposition for temperature region 5
Preprints 218939 g0a2
Figure A3. STL decomposition for heat index region 5
Figure A3. STL decomposition for heat index region 5
Preprints 218939 g0a3
Figure A4. Cross-correlations between EDV and temperature STL remainders
Figure A4. Cross-correlations between EDV and temperature STL remainders
Preprints 218939 g0a4
Figure A5. Cross-correlations between EDV and heat index STL remainders
Figure A5. Cross-correlations between EDV and heat index STL remainders
Preprints 218939 g0a5

References

  1. Anderson, G. B., Bell, M. L., Peng, R. D. (2013). Methods to Calculate the Heat Index as an Exposure Metric in Environmental Health Research. Environmental Health Perspectives, 121(10), 1111–1119. [CrossRef]
  2. Chen, K., de Schrijver, E., Sivaraj, S., Sera, F., Scovronick, N., Jiang, L., Roye, D., Lavigne, E., Kyselý, J., Urban, A., Schneider, A., Huber, V., Madureira, J., Mistry, M. N., Cvijanovic, I., Gasparrini, A., Vicedo-Cabrera, A. M. (2024). Impact of population aging on future temperature-related mortality at different global warming levels. Nature Communications, 15, 1796. [CrossRef]
  3. Cleveland, R. B., Cleveland, W. S., McRae, J. E., Terpenning, I. (1990). STL: A Seasonal-Trend Decomposition Procedure Based on Loess. Journal of Official Statistics, 6(1), 3–33.
  4. Climate Impact Lab and United Nations Development Programme. (2024). Human climate horizons. (Global climate risk and mortality projections report). [CrossRef]
  5. Feng, C., Li, L., Sadeghpour, A. (2020). A comparison of residual diagnosis tools for diagnosing regression models for count data. BMC Medical Research Methodology, 20(1), 175. [CrossRef]
  6. Gronlund, C. J., Zanobetti, A., Schwartz, J. D., Wellenius, G. A., O’Neill, M. S. (2014). Heat, heat waves, and hospital admissions among the elderly in the United States, 1992–2006. Environmental Health Perspectives, 122(11), 1187–1192. [CrossRef]
  7. Hartig, F. (2024). DHARMa: Residual Diagnostics for Hierarchical (Multi-Level / Mixed) Regression Models [Computer software manual]. Available online: http://florianhartig.github.io/DHARMa/ (accessed on). (R package version 0.4.7).
  8. Liss, A., Wu, R., Chui, K. K., Naumova, E. N. (2017). Heat-Related Hospitalizations in Older Adults: An Amplified Effect of the First Seasonal Heatwave. Scientific Reports, 7, 39581. [CrossRef]
  9. National Oceanic and Atmospheric Administration (NOAA). (2024). Summary of natural hazard statistics for 2024 in the united states. Available online: https://www.weather.gov/media/hazstat/sum24.pdf (accessed on).
  10. Naumova, E. N., Simpson, R. B., Zhou, B., Hartwick, M. A. (2022). Global seasonal and pandemic patterns in influenza: An application of longitudinal study designs. International Statistical Review, 90(S1), S82-S95. Available online: https://onlinelibrary.wiley.com/doi/abs/10.1111/insr.12529 (accessed on). [CrossRef]
  11. Runge, J., Petoukhov, V., Kurths, J. (2014). Quantifying the Strength and Delay of Climatic Interactions: The Ambiguities of Cross Correlation and a Novel Measure Based on Graphical Models. Journal of Climate, 27(2), 720–739. [CrossRef]
  12. Thiel, J., Seim, A., Stephan, B., Sedlmayr, M., Prochaska, E., Henke, E. (2025). The Spectrum of Heat-Related Diseases: A Meta-Review. International Journal of Public Health, 70. [CrossRef]
  13. Vaidyanathan, A., Gates, A., Brown, C., Prezzato, E., Bernstein, A. (2024). Heat-Related Emergency Department Visits — United States, May–September 2023. Morbidity and Mortality Weekly Report, 73(15), 324–329. [CrossRef]
  14. Vaidyanathan, A., Saha, S., Vicedo-Cabrera, A. M., Gasparrini, A., Abdurehman, N., Jordan, R., Hawkins, M., Hess, J., Elixhauser, A. (2019). Assessment of extreme heat and hospitalizations to inform early warning systems. Proceedings of the National Academy of Sciences, 116(12), 5420–5427. [CrossRef]
  15. Xing, Y., Xu, R., Xu, Z., Li, Z., Yang, Z., Zhang, Y., Huang, W., Yu, P., Li, S., Guo, Y. (2026). Mapping heat-related morbidity burden attributable to human-induced climate change across 460 communities of Victoria, Australia. Environmental Health. [CrossRef]
1
National Weather Service, “The Heat Index Equation,” https://www.wpc.ncep.noaa.gov/html/heatindex_equation.shtml
Figure 1. The ten Health and Human Services regions. Image taken from hhs.gov.
Figure 1. The ten Health and Human Services regions. Image taken from hhs.gov.
Preprints 218939 g001
Figure 2. Percentage of Elderly Population for different HHS regions (Year 2018)
Figure 2. Percentage of Elderly Population for different HHS regions (Year 2018)
Preprints 218939 g002
Figure 3. Percentage of Non-White Population (Year 2000)
Figure 3. Percentage of Non-White Population (Year 2000)
Preprints 218939 g003
Figure 4. Percentage of Forest area for different HHS regions (Year 2021)
Figure 4. Percentage of Forest area for different HHS regions (Year 2021)
Preprints 218939 g004
Figure 5. Seasonal models for EDV (per 100,000 population)
Figure 5. Seasonal models for EDV (per 100,000 population)
Preprints 218939 g005
Figure 6. Seasonal models for temperature (degrees Celsius)
Figure 6. Seasonal models for temperature (degrees Celsius)
Preprints 218939 g006
Figure 7. Seasonal models for heat index (degrees Celsius)
Figure 7. Seasonal models for heat index (degrees Celsius)
Preprints 218939 g007
Figure 8. Peak Timing for EDV (days from the beginning of the year)
Figure 8. Peak Timing for EDV (days from the beginning of the year)
Preprints 218939 g008
Figure 9. Peak Timing for Maximum Temperature (days from the beginning of the year)
Figure 9. Peak Timing for Maximum Temperature (days from the beginning of the year)
Preprints 218939 g009
Figure 10. Peak Timing for Heat Index (days from the beginning of the year
Figure 10. Peak Timing for Heat Index (days from the beginning of the year
Preprints 218939 g010
Figure 11. MSEs for temperature models at different train-test splits. Top: Data ending in 2025. Bottom: Data ending in 2022.
Figure 11. MSEs for temperature models at different train-test splits. Top: Data ending in 2025. Bottom: Data ending in 2022.
Preprints 218939 g011
Figure 12. Temperature model predictions using a 70-30 train-test split. Training data shown in blue and test data shown in red. The black line represents the model predictions and the orange lines represent the approximate 95% predictions intervals (±2 standard errors). Top: Data ending in 2025 using model 4. Bottom: Data ending in 2022 using model 6.
Figure 12. Temperature model predictions using a 70-30 train-test split. Training data shown in blue and test data shown in red. The black line represents the model predictions and the orange lines represent the approximate 95% predictions intervals (±2 standard errors). Top: Data ending in 2025 using model 4. Bottom: Data ending in 2022 using model 6.
Preprints 218939 g012
Figure 13. MSEs for heat index at different train-test splits. Data ends in 2022.
Figure 13. MSEs for heat index at different train-test splits. Data ends in 2022.
Preprints 218939 g013
Figure 14. Predictions for heat index models with the optimal model (model 5) and a 70-30 train-test split. Training data shown in blue and test data shown in red. The black line represents the model predictions and the orange lines represent the approximate 95% predictions intervals (±2 standard errors).
Figure 14. Predictions for heat index models with the optimal model (model 5) and a 70-30 train-test split. Training data shown in blue and test data shown in red. The black line represents the model predictions and the orange lines represent the approximate 95% predictions intervals (±2 standard errors).
Preprints 218939 g014
Figure 15. Temperature model comparisons using the Bayesian Information Criteria (BIC). Lower values indicate better model fit.
Figure 15. Temperature model comparisons using the Bayesian Information Criteria (BIC). Lower values indicate better model fit.
Preprints 218939 g015
Figure 16. Heat Index model comparisons using the Bayesian Information Criteria (BIC). Lower values indicate better model fit.
Figure 16. Heat Index model comparisons using the Bayesian Information Criteria (BIC). Lower values indicate better model fit.
Preprints 218939 g016
Figure 17. Temperature model predictive power measured by Mean Square Error(MSE) using the data period. Lower values indicate better predictive performance.
Figure 17. Temperature model predictive power measured by Mean Square Error(MSE) using the data period. Lower values indicate better predictive performance.
Preprints 218939 g017
Figure 18. Heat Index model predictive power measured by Mean Square Error(MSE). Lower values indicate better predictive performance.
Figure 18. Heat Index model predictive power measured by Mean Square Error(MSE). Lower values indicate better predictive performance.
Preprints 218939 g018
Figure 19. DHARMa residual diagnostics plots for the best temperature model with NB2 response: left panel: QQ/uniformity plot; right panel: scaled residuals vs. rank-transformed predicted values
Figure 19. DHARMa residual diagnostics plots for the best temperature model with NB2 response: left panel: QQ/uniformity plot; right panel: scaled residuals vs. rank-transformed predicted values
Preprints 218939 g019
Figure 20. DHARMa residual diagnostics plots for the best heat index model with NB2 response: left panel: QQ/uniformity plot; right panel: scaled residuals vs. rank-transformed predicted values.
Figure 20. DHARMa residual diagnostics plots for the best heat index model with NB2 response: left panel: QQ/uniformity plot; right panel: scaled residuals vs. rank-transformed predicted values.
Preprints 218939 g020
Table 1. Adjusted and unadjusted ICCs for best temperature and heat index models.
Table 1. Adjusted and unadjusted ICCs for best temperature and heat index models.
Covariate Adjusted ICC Unadjusted ICC
Model Family Poisson NB2 Poisson NB2
Temperature (2018-2022) 1.000 0.115 0.086 0.040
Temperature (2018-2025) 1.000 0.137 0.102 0.044
Heat index (2018-2022) 0.999 0.106 0.058 0.036
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated