Preprint
Article

This version is not peer-reviewed.

AI-Augmented Water Quality Event Response: The Role of Generative Models for Decision Support

A peer-reviewed article of this preprint also exists.

Submitted:

22 September 2025

Posted:

23 September 2025

You are already at the latest version

Abstract
The global water sector faces unprecedented challenges from climate change, rapid urbanisation, and ageing infrastructure, necessitating a shift towards proactive, digital strategies. Historically characterised as "data rich but information poor," the sector struggles with underutilised and siloed operational data. While traditional Machine Learning (ML) and Deep Learning (DL) models have provided foundations for smart water management, their effectiveness has been limited by the scarcity of high- fidelity, structured data. This paper explores and reviews the transformative potential of Generative Artificial Intelligence (GenAI) and Large Language Models (LLMs) to overcome these barriers, enabling a paradigm shift towards data-centric thinking. GenAI, particularly when augmented with Retrieval-Augmented Generation (RAG) and agentic AI, can create new content, facilitate natural language interaction, synthesise insights from vast unstructured data (of all types including text, images and video) and automate complex, multi-step workflows. Focusing on the critical area of drinking water quality, we demonstrate how these intelligent tools can move beyond reactive systems. A case study is presented which utilises regulatory reports to mine knowledge, providing GenAI-powered chatbots for accessible insights and improved water quality event management. This approach empowers water professionals with dynamic, trustworthy decision support, enhancing the safety and resilience of drinking water supplies by recalling past actions, generating novel insights and simulating response scenarios.
Keywords: 
;  ;  ;  ;  ;  ;  ;  ;  

1. Introduction

The global water sector is navigating a period of profound transformation, driven by the intersecting pressures of climate change, rapid urbanisation, population growth, increasing consumer expectations and ageing infrastructure the latter of which is not far from being at a critical level in developed western countries such as the UK [1]. These challenges place unprecedented strain on the management of water resources, demanding a paradigm shift from traditional, often reactive, operational models toward more proactive, efficient, and resilient strategies. Central to this transition is the concept of digitalisation, which promises to reshape how water utilities monitor, manage, and optimise their complex networks. However, the water sector has historically been characterised as ‘data rich but information poor (DRIP)’, a condition where vast amounts of operational data are collected but remain siloed and underutilised, hindering the extraction of actionable knowledge. Water companies are actually interested in information and knowledge, not raw data. Knowledge comes from understanding the information about a subject, and then using it to make decisions, form judgments/opinions, or make predictions.
For decades, the discipline of hydroinformatics has sought to bridge this gap by applying computational intelligence, information technology, and data science to solve complex water management problems. Hydroinformatics is concerned with the development and hydrological application of mathematical modelling, information technology, data science and Artificial Intelligence (AI) tools. It provides computer-based decision-support systems that are solutions for areas such as Smart Networks for leakage management. Early data-driven models, including Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), and Random Forests (RFs), have been applied to various tasks such as prediction, classification and data mining demonstrating the value of machine learning in the water domain [1]. These tools have provided the foundation for smarter water management, enabling applications in leakage detection, asset condition assessment, and demand forecasting. Yet, these model-centric approaches often depend on high-fidelity, structured data, the scarcity of which can limit their effectiveness and scalability [2]. In addition, they remain hard-data centric, missing extensive softer—often linguistic—sources that embed high-value context and tacit expert knowledge which is now being lost to workforce ageing and retirement. With the rapid development of AI tools it has been suggested that a paradigm shift toward more inclusive data-centric thinking and away from model-centric approaches (based on internal computational architectures and numerical data only) is needed in the broader water industry [3].
Deep learning (DL) is when big data intersects with machine learning techniques which allow the tackling of problems that exceed human understanding [4]. Deep learning’s important innovation is to have ANNs learn features incrementally, attempting to model lower-level details (like letters) before attempting to acquire higher-level features and categories (like words). Deep learning excels at this sort of problem and has transformed the application of AI in the last decade allowing for the direct transition from data to action. Their usage has been facilitated by big data on top of next-generation formulations and algorithmic breakthroughs. Such Deep learning is at the heart of revolutionary technologies such as Generative AI.
The recent and rapid emergence of Generative Artificial Intelligence (GenAI), and particularly Large Language Models (LLMs), represents a fundamental technological inflection point with the potential to empower data-centric thinking and help overcome many traditional barriers [5]. Unlike earlier AI systems that primarily focused on quantitative analysis and prediction, generative models are capable of creating new, original content—including text, code, images, and synthetic data—based on patterns learned from existing information [6]. This capability, powered by architectures like the Transformer [7], allows for human-like natural language interaction, sophisticated reasoning [8], and the synthesis of insights from vast quantities of unstructured data, such as technical manuals, operational logs, and regulatory reports [5]. Whilst GenAI systems offer powerful capabilities, they are prone to so-called “hallucinations,” defined as instances where the model invents details not supported by data. Recognising this limitation is vital to mitigating risks in applied settings as we shall discuss. The potential of GenAI to transform knowledge-driven fields is reshaping industries, and the water sector is now beginning to explore its applications, with early adoption and pilot projects beginning to demonstrate the value of GenAI. Figure 1 shows the results of a comprehensive search (using OpenAlex and Crossref across journals, conferences, preprints, theses etc.) in publications that mention ‘generative AI’ or related terms (e.g. generative models, GANs) in the context of water resources and hydrology. The bar chart summarises the approximate counts per year (global coverage, all publication types). In 2021 there were only a handful of such works, but by 2025 the number had surged dramatically.
A chatbot is a program (powered by ML algorithms) within an app that interacts directly with users to help with simple tasks. ChatGPT allows the user to ask questions in natural language, to which the chatbot responds. Platforms such as OpenAI allow the creation of AI Assistants or specialised ‘GPTs’ for particular doman tasks. Prompt engineering is utilised to craft instructions that guide the AI to generate desired outputs. A key enabling technology for these advanced applications is Retrieval-Augmented Generation (RAG), which grounds an LLM’s responses in external, verified knowledge sources. By retrieving relevant information from a utility’s internal databases or technical documents before generating a response, RAG ensures that the AI’s output is accurate, up-to-date, and context-specific, thereby mitigating the risk of factual errors or "hallucinations" [9]. A significant evolution in this space is the development of agentic AI, where AI systems, or "agents," are designed to autonomously perceive their environment, make decisions, and execute multi-step tasks to achieve specific goals [10]. This moves beyond simple question-answering systems to create a "virtual workforce" capable of automating complex workflows, from data analysis and report generation to operational control. The concept of AI agents has been identified by Gartner as the top strategic technology trend for 2025 and holds immense promise for orchestrating the multifaceted operations of a smart water network. 89% of CIOs recently surveyed consider agent-based AI a strategic priority for enhancing decision-making and business process orchestration [11]. In the future, we may see the emergence of fully autonomous "AI scientists" capable of generating novel hypotheses, designing and executing experiments, and writing full scientific papers with minimal human intervention. See Figure 2.
After exploring the digital transformation of the water sector and some of the latest work in GenAI, this paper focuses on a critical, high-stakes application area within the water sector: the management of drinking water quality. The provision of safe, clean water is a paramount public health responsibility, and incidents such as contamination events, discolouration, or microbiological failures require rapid and effective decision-making to protect consumers and comply with strict regulatory standards. We then explore how LLMs, RAG, and agentic AI can develop in the future to create a new generation of intelligent tools. These tools will not only recall past actions but also generate novel insights, simulate response scenarios, draft communications, and orchestrate complex, multi-step workflows. Through a discussion of the state of the art and a case study focused on a GenAI-powered chatbot for water quality incident management, this paper will outline a vision for an AI-augmented future where water professionals are empowered with dynamic, intelligent, and trustworthy decision support systems to ensure the safety and resilience of our drinking water supplies.

2. Background: Digital Transformation and AI in the Water Sector

The “Fourth Industrial Revolution” and “The Second Machine Age” have been used to describe current efforts to digitise sectors, from government to health care to education to manufacturing to the water industry. Digitalisation is transforming the world we live in and new data will need to be integrated successfully by reimagining business processes. These new technologies have the potential to deliver significant positive outcomes in the water sector. Recent analyses suggest that by 2030, digital transformation technologies—especially AI and platform-based business models—could generate nearly US$20 trillion in additional global economic value [12], with as much as 70% of new value created over the next decade being digitally enabled [13].
Digitalisation has the potential to take water companies into the 21st century where almost everything can be measured. High-fidelity data is required as an essential building block for this transformation. Innovations such as PipeBots are designed to provide this necessary data by enabling a shift from limited, fixed-location (Eulerian) sensing to pervasive, Lagrangian (mobile) sensing within buried pipe networks, generating potentially massive volumes of real-time information [14]. This increased data frequency and geo-distribution provide the granularity required to produce actionable information, thereby enabling the development of site-specific, continually updated predictive models that support a move from reactive to truly proactive maintenance and system performance management. These virtual representations of water systems or “digital twins’ will enable situational awareness and/or near-real time hydraulic and quality monitoring, which has the potential to solve many of the business challenges faced by the industry, such as improving efficiency, resilience, leakage and predictive maintenance. Digital water is about setting the foundation for water companies to begin applying data science and related technologies (such as IoT, AI, blockchain and augmented reality) to amplify the power of data to optimise real-world decision-making across water networks.
Our society is founded on distributed infrastructure, dependent on it to deliver our fundamental needs; such as pipe networks for clean drinking water. In the UK this infrastructure is ageing and deteriorating and for water in particular we live off a legacy of past over-engineering. Water utilities are increasingly moving from traditional, reactive operational models toward more proactive, efficient, and data-driven strategies [1]. This shift, often termed digital transformation, involves integrating advanced technologies like the Internet of Things (IoT), cloud computing, and artificial intelligence (AI) to turn massive volumes of data into actionable knowledge. Water utilities collect vast amounts of data from sources like SCADA systems, GIS databases, asset records, and customer contacts. However, this data often remains in isolated "silos," stored in different formats across separate platforms, which hinders integrated analysis and decision-making. It has been estimated that some water utilities might use as little as 10% of the data they collect, highlighting a significant gap between data availability and its application. The journey toward a smarter water sector has been marked by the evolution of data-driven modeling techniques, not just collecting more data, but integrating it and applying advanced analytics to extract meaningful patterns and insights. To date, most digital efforts have centred on hard numerical data (SCADA time series, network telemetry etc), leaving out often text based operational evidence. Emerging GenAI/LLM/MLLM tools can now parse and align unstructured text and other inputs with the hard numeric data, extracting entities and events with provenance so decisions are richer, faster and more auditable.
For decades, researchers and utilities have applied Machine Learning (ML) models to address specific challenges. These "model-centric" approaches focused on leveraging algorithms to find patterns in historical data. Areas include:
  • Prediction and Forecasting: Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), and Random Forests have been widely used for tasks like water demand forecasting, water quality prediction and stream flow forecasting.
  • Classification and Anomaly Detection: These models are also used for classification tasks, such as identifying pipes at high risk of failure or detecting anomalies in sensor data that could indicate a leak or contamination event.
  • Asset Management: In asset management, AI has been used for automated condition assessment. For example, computer vision combined with ML methods like Random Forest has been successfully applied to analyse CCTV footage of sewer pipes, automatically detecting faults like cracks and displaced joints with high accuracy [15].
The recent explosion in data volume, driven by the proliferation of IoT sensors and smart meters, has fueled the adoption of Deep Learning [4] , a subfield of ML that uses multi-layered neural networks [16,17]. Key architectures:
  • Convolutional Neural Networks (CNNs) are adept at capturing spatial patterns, making them ideal for analysing geospatial data like satellite imagery for land use classification or remote sensing of water bodies.
  • Recurrent Neural Networks (RNNs), and particularly their advanced variant Long Short-Term Memory (LSTM), are designed to handle sequential data, making them state-of-the-art for time-series forecasting of hydrological variables like rainfall, stream flow and water demand.
  • Hybrid Models: researchers are increasingly creating hybrid models that combine the strengths of different architectures. A CNN-LSTM model, for instance, can use CNN layers to extract spatial features from weather maps and LSTM layers to model the temporal evolution of a flood event.
While the aforementioned DL models primarily focus on analysis and prediction, the recent emergence of GenAI and Large Language Models (LLMs) represents a new paradigm. GenAI models, particularly LLMs like GPT-5 and Large Multimodal Models (MLLMs) that can process images and text, data, images and video are capable of creating new, original content. This capability will transform how the water sector can address data scarcity, manage knowledge, and interact with complex systems.
Applications range from enhancing operational efficiency [18] and predictive maintenance to improving customer engagement and knowledge management [5]. For instance, Multimodal Large Language Models (MLLMs) can unlock vast amounts of domain knowledge trapped in physical archives by accurately transcribing handwritten notes and complex forms from legacy documents [19]. This is particularly valuable for creating comprehensive historical records to train AI models for detecting rare events like pipeline leaks or water quality anomalies. GenAI is reshaping traditional engineering by enabling intuitive, natural language interactions with complex systems. LLMs can assist in generating solutions, analysing data, or writing code and be supplemented with technical manuals or integrated with analytical tools, providing context-specific information for knowledge management for areas such as asset management, predictive maintenance and demand management [20].
Some of the more recent academic research that demonstrates the possibilities of GenAI:
  • A powerful application of GenAI is in creating high-fidelity synthetic data to augment limited real-world datasets, which is crucial for training robust AI models. For example, a generative AI approach was developed for spatiotemporal imputation and demand prediction in water distribution systems, exploiting sensor data to reconstruct missing data and improve forecasting accuracy. [21] Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs) have been used to generate realistic time-series data for water demand, synthetic acoustic signals for leak detection, and patterns of water quality parameters to train anomaly detection systems. A recent study explored using LLMs to generate synthetic hydrological data (e.g., daily rainfall and reservoir levels) by providing prompts engineered with key statistical properties from a real dataset. Models trained on the augmented dataset showed improved forecasting accuracy [22]. Generative models like VAEs, GANs and diffusion models can be trained on digital twin data for data augmentations, hence generating high-fidelity synthetic water consumption samples already labeled to train supervised learning models and can simulate different scenarios such as changes in water demand, infrastructure failures, or weather conditions [23].
  • A multi-agent LLM framework has been presented to automate water-distribution optimisation [24]. An Orchestrating Agent coordinates knowledge, modelling (EPANET), and coding agents for hydraulic model calibration and pump-operation optimisation. Tested on Net2 and Anytown, the Coding agent performed most accurately across both benchmarks; natural-language reasoning lacked numerical precision, while tool integration improved reliability.
  • Projects like WaterER and DRACO are creating standardised benchmarks to rigorously evaluate the performance of different LLMs (e.g., GPT-4, Gemini, Llama) on domain-specific tasks. The Drinking water and wAstewater COgnitive assistant (DRACO) initiative [25] released a dataset for benchmarking open- and closed-source models and found that state-of-the-art open-weight models can match, and sometimes outperform, proprietary systems, which is promising for utilities seeking secure, on-premise deployments.
  • Researchers can use LLMs to perform AI driven literature mapping. For example, using an LLM and geocoding on 310,000 hydrology papers (1980–2023), a study mapped basin trends, collaborations, and hotspots [26]. Research has surged since the 1990s, shifting from groundwater/nutrients toward climate change and ecohydrology. Activity concentrates in North America and Europe, with rising China/South Asia attention but persistent gaps in extreme-rainfall regions.
  • HydroSuite-AI, an LLM-enhanced web application that generates code snippets and answers questions related to open-source hydrological libraries, lowering the entry barriers for researchers and practitioners in the field without extensive programming expertise [9].
  • A vast repository of knowledge in the water sector remains locked in non-machine-readable legacy documents like handwritten maintenance logs, scanned reports, and old maps. Traditional Optical Character Recognition (OCR) engines like Tesseract struggle with so called "dark data," especially with complex layouts and handwriting. A case study on Danish water utility documents demonstrated that a Multimodal Large Language Model (MLLM), Qwen-VL, dramatically outperformed Tesseract [18]. The MLLM reduced the Word Error Rate by nearly nine fold, accurately transcribing handwritten notes and correctly interpreting complex forms in a zero-shot setting.

3. Drinking Water Quality AI Analytics

3.1. Overview

The imperative to deliver safe, clean drinking water and manage wastewater effectively places water quality management at the forefront of the global water sector's priorities. A US study recorded 7.15 million water-borne illnesses annually, with > 600k emergency department visits, 118k hospitalisations, 6,630 deaths and annual economic cost of $3.33 billion [27]. Water quality is a multifaceted issue, influenced by an increasing array of pollutants, including micro-pollutants and trace contaminants (such as PFAS). Managing these threats is central to public health and environmental quality. Monitoring is inherently expensive and labor-intensive, spanning the entire water cycle from wastewater resource recovery facilities and combined sewer overflows (CSOs) to rivers, lakes, and oceans. Many variables crucial for process optimisation, such as microbial activity, cannot be directly measured, necessitating inference from indirect measurements and domain knowledge. Moreover, water quality is often subjected to multiple stressors (e.g., climate change, land use, management practices) that can exert complex synergistic and antagonistic effects, making prediction and management challenging.
Water quality in the UK's approximately 1,500 water treatment works is well understood, the challenge is to monitor water quality in the ~300,000km of pipe in the UK distribution network. Water quality data in the distribution network is obtained from manual sampling, portable water quality monitors and a relatively small but growing number of fixed water quality monitors. Customers and regulators are demanding ever greater water quality, however water utilities are unable to deliver because they have inadequate data and undestanding or insights into what is happening in their networks. This critical domain is fraught with complexities on the operational management side, stemming from DRIP [1], the presence of legacy systems, disparate and siloed data sources (e.g., SCADA, GIS, IoT sensors, asset management databases), and varied data formats often result in information silos, limiting real-time monitoring and hindering effective decision-making. Beyond the data layer, water systems function as high-surface-area physical, chemical, and biological reactors, where interacting processes—mixing, adsorption/desorption, redox transformations, corrosion and scaling, biofilm growth and sloughing, pathogen die-off/regrowth and natural organic matter dynamics—evolve across multiple spatial and temporal scales. The data fragmentation and process uncertainty create an operational environment where situational awareness is partial, model confidence can be fragile and risk-informed decisions must be made under persistent uncertainty. Water quality data (parameters such as water temperature, turbidity, conductivity, colour, pH, etc.) is not currently collected in a very consistent and automated manner across networks, and these sensor technologies (and their price) have moved little in the last decade. As the technology improves and the whole life cost of ownership falls, water quality instruments could be used for the provision of operational data and become as widespread as for flow metering. Water quality data analytics are relatively undeveloped because there are limited significant deployments of real-time water quality monitoring within networks [28,29]. Currently data/information access is often limited to secure remote platforms only at the premises of the service provider or data generator or on non-standard and expensive portable platforms. Furthermore, critical contextual information, such as the physical and topological relationships between measurements, often remains in human-interpretable formats (e.g., SCADA synoptics) rather than machine-readable ones. Future technologies promise enhanced sensors for natural biological/biochemical markers. Interpretation and analysis across multiple online parameters is expected to provide deeper understanding of water supply system state and asset condition. Mobile sensing platforms including robots and drones could be game changing for providing intelligence for applications including interpretation and mission planning [14].

3.2. Traditional Machine Learning and Deep Learning in Water Quality

In recent decades, data-driven models, particularly those based on machine learning and deep learning, have been deployed to address specific water quality challenges. These approaches aim to map inputs to outputs (for hard numerical data) without necessarily modeling the underlying physical processes in detail [30].
A range of application areas have been explored, many centering on prediction/ forecasting, in the sphere of anomaly detection from monitoring and for data mining. Areas concerned with distribution of potable water include:
  • Water Quality Parameter Prediction. Early neural networks were applied to water quality modeling [31,32]. More recently, Long Short-Term Memory (LSTM) networks have demonstrated success in predicting specific water quality parameters, such as dissolved oxygen (DO) levels, effectively capturing peaks and troughs, even during periods of low stream flow and data fluctuations [33]. Hybrid models, like the SOD-VGG-LSTM, have achieved high accuracy in short-term water quality prediction, particularly in capturing extreme values. An LSTM-based encoder-decoder model has also been shown to predict water quality variables with satisfactory accuracy after data denoising [34]. Further advancements include hybrid encoder-decoder BiLSTM models with attention mechanisms, which outperform state-of-the-art algorithms by efficiently handling noise, capturing long-term correlations, and performing dimensionality reduction [35]. Other techniques such as Ensemble decision tree ML algorithms have been applied, including for predicting low chlorine events [36].
  • Potability Prediction. DL models, including Multi-Layer Perceptrons (MLP), LSTM, and Gated Recurrent Units (GRU), have been comparatively analysed for water potability classification in rural areas, with CNNs achieving higher accuracy than other ML models like Random Forest, Decision Jungle, Naive Bayes, Logistic Regression and SVM. These predictions often rely on parameters such as pH, total dissolved solids, chloramines, sulfate and turbidity, with data often sourced from public datasets like Kaggle [22].
  • Contamination Event Detection. ML and DL models are widely used for water quality anomaly detection, addressing point, contextual, and collective anomalies [30,37,38]. A hybrid framework combining deep learning and extreme machine learning has been proposed for detecting anomalies in high-dimensional water quality data [39]. Most data-driven methods suffer drawbacks associated with high computational cost, imbalanced anomalous-to-normal data ratio resulting in high false positive rates and poor handling of missing data [22]. LSTM models have been specifically designed for monitoring water distribution systems, detecting anomalies based on changes over time in water indicators [40].
  • Remote Sensing and Vision-based AI. Remote sensing data enables ML to serve as an efficient alternative to traditional manual laboratory analysis, offering real-time detection feedback crucial for rapid contamination detection [21]. Computer vision techniques are also being employed for tasks such as algal bloom monitoring. Efforts are directed towards using advanced image description mechanisms to detect and analyse water pollution sources, facilitating prompt intervention and mitigation measures [41].
  • Data mining for data visualisation and knowledge discovery for water quality. Self-organising maps (SOM), a form of unsupervised artificial neural network (ANNs) [42], have been demonstrated in water distribution system data mining for microbiological and physico-chemical data at laboratory scale [43]; in clustering of water quality, hydraulic modelling and asset data for a single water supply zone [44]; for multiple DMAs [45]; in interpreting the risk of iron failures [46]; in relating water quality and age in drinking water [47] and in exploring the rate of discolouration material accumulation in drinking water [48]. Other approaches to data-driven visualisation/mapping and cluster analysis include using k-means (typically with Euclidean distance), hierarchical clustering, distribution models (such as the expectation-maximisation algorithm), fuzzy clustering and density-based models, e.g. DBSCAN [49] and Sammon’s projection [50]. Work has also developed techniques to explore correlations (using semblance analysis) between water quality and hydraulic data streams in order to assess asset deterioration and performance [28].
  • Model calibration and tuning. A combined accumulation–mobilisation model for distribution-system discolouration was calibrated and validated using particle swarm optimisation [51]. The formulation tracks pipe-wall deposit build-up and erosion with two empirical parameters, reproducing turbidity responses from field trials. Results enable risk-based planning of flushing and maintenance, forecasting discolouration propensity across assets and operating regimes.

3.3. Generative AI for Enhanced Water Quality Modelling

The advent of GenAI, LLMs and MLLMs offers a paradigm shift in how water quality challenges can be addressed, moving beyond mere analysis and prediction to content creation, advanced knowledge management and robust decision support. Crucially, approaches once confined to hard numerical data can now unlock and integrate “softer” evidence: operator logs, maintenance tickets, customer contacts, images/video and expert narratives via LLM-driven extraction and linking. This multimodal fusion strengthens signal detection and root-cause hypotheses by aligning qualitative cues with quantitative trends, reducing silos and improving interpretability. In practice, provenance tracking and uncertainty annotations helps keep the softer insights auditable while enhancing the value of the hard data. Some of the more recent academic research that applies GenAI in a potable water quality context:
  • Synthetic Data Generation to address scarcity. One of the significant limitations of traditional ML/DL models in water quality is the difficulty in obtaining large amounts of high-fidelity, domain-specific data, especially for rare events like contamination. Generative Adversarial Networks (GANs) are proving useful for creating synthetic data that mimics real measurements, thereby augmenting sparse datasets and improving model robustness [34].
  • Contamination detection. A case study involved a novel GAN architecture that generated expected normal water quality patterns across multiple monitoring sites in China’s Yantian network. By comparing real measurements against these GAN-generated baseline patterns, the system effectively detected contamination events, demonstrating the model's capability to capture complex spatiotemporal relationships with high detection performance and low false alarm rates [52].
  • Multimodal Monitoring and Visualization. MLLMs (e.g., GPT-4 Vision, Gemini, LLaVA, QWEN-VL) are being evaluated for hydrological applications such as water pollution management, demonstrating exceptional proficiency in interpreting visual data for water quality assessments. These models can integrate visual and textual data for real-time monitoring and analysis, enabling dynamic, real-time visualisations of complex water quality scenarios, including potential contaminant spread patterns [35].
  • AI-Assisted Information Retrieval. LLMs are being integrated into platforms to enable conversational queries about operational data. For instance, Klir’s "Boots" chatbot, powered by ChatGPT, can summarize lab results, flag anomalies in water quality readings, and assist with report generation to regulators by drawing on the utility’s secure internal database [54].
  • AI Assistants for professionals. LLMs can serve as ‘water expert models’ to assist professionals by answering questions, summarising compliance documents (enhanced decision support), and generating reports or code [55,56]. The WaterER benchmark suite has evaluated LLMs (e.g., GPT-4, Gemini, Llama3) for their ability to perform water engineering and research tasks, demonstrating their proficiency in generating precise research gaps for papers on ‘contaminants and related water quality monitoring and assessment’ and creating appropriate titles for drinking water treatment research. It is expected that the field of hydrology can benefit from LLMs, by addressing various challenges within physical processes [57]. WaterGPT is a domain-adapted large language model for hydrology [58]. The authors curated hydrology corpora, perform incremental pretraining and supervised fine-tuning, and enable multimodal inputs. WaterGPT supports knowledge-based Q&A, hydrological analysis, and decision support across water resources tasks, showing improved performance versus general LLMs on domain benchmarks and case studies reported. Another perspective on customising general LLMs into domain-adapted “WaterGPTs” for water and wastewater management appears in [59]. The authors outline methods—prompt engineering, knowledge and tool augmentation, and fine-tuning—discuss dataset curation and ethics, and propose benchmarking tasks and evaluation suites. A roadmap highlights reliability, safety, and practical use cases across stakeholders.
In summary, AI, particularly with the advancements in generative models, is rapidly transforming water quality analytics (and the water sector in general). From synthetic data generation and advanced knowledge management to enhanced decision support and real-time monitoring, these technologies offer unprecedented opportunities. However, navigating the challenges related to data quality, explainability, ethical governance, and workforce development will be crucial to fully realise AI's potential in ensuring the safety and sustainability of water resources.

4. Case Study: Drinking Water Quality Event Management

4.1. Introduction

Drinking water quality is paramount and is consistently ranked as the top service priority for water companies. Water treatment is delivered to a consistently high quality but this high grade product is then at risk during the long transit times through vast, ageing and deteriorating water supply systems. Treated water undergoes various changes whilst travelling through the ageing distribution system infrastructure and a link exists between the age of distributed water, persistent sporadic bacteriological failures and poor water quality. In drinking water supply systems, when a hazard occurs from water treatment works to consumers’ taps, it is considered an event. Events do occassionally occur and can be aesthetic, chemical and microbiological failures. Every water company in England and Wales must inform the Drinking Water Inspectorate (DWI) of all events that have affected, or are likely to affect drinking water quality, sufficiency of supplies and where there may be a risk to health. If quality fails, actions will be flagged for consideration, and certain sampling types required (3.52 million tests were conducted in England in 2022). For example, in 2023 a total of 533 events were reported to the DWI and by 2024 this had increased to 556 overall events, with 55% of these water quality events reported outside of the control of the water company [60]. The DWI evaluates the companies’ responses to the most significant events on an annual basis via documentary reporting. Existing solutions to the problem of tackling water quality events in water supply systems typically involve manual analysis of graphs, charts and following designated procedures and performing evaluation in conference calls where necessary. Priority will depend on the issue so when a non-critical event occurs a control centre operator would look at a set of criteria and procedures (see Figure 3 for example of life cycle of an event caused by a burst main). These approaches are insular to each individual water company with varying types of procedures, working practices and response plans. This process is time-consuming, prone to human error, and unlikely to optimally utilise past learning.

4.2. ACQUIRE Project

The ACQUIRE project [61] utilises and exploits DWI reports regarding water quality events submitted by water companies to mine the complex knowledge and derive understanding from the information buried within them. These reports form a valuable resource that can be explored and exploited for improved understanding, evaluation and learning for incident management solutions. By data mining many previous water quality event reports, valuable insights into the water quality events can be obtained facilitating better responses by effectively providing a ‘triage’ view of typical actions from sector wide responses. This leads to better water quality management, reduced risks, improved regulatory compliance, enhanced customer satisfaction and water industry reputation.
Previous work [62] presented a prototype and findings on how water companies could deal more effectively with drinking water quality events by using information from previous incidents. It explored the use of Case-Based Reasoning (CBR), as a useful tool for providing diagnosis and solutions which can be used for the interpretation of, and guidance on, water quality incidents in water distribution systems. CBR is a classic knowledge-based AI technique that relies on the reuse of past experience. Similar problems from the past have similar solutions and therefore deriving solutions to new problems can be effectively addressed by reusing (and adapting) past solutions. While promising, the early CBR prototype was limited by the need for expert scoring of every DWI finding; a limited historical case base; and a complex user interface which restricted its usage to expert or academic users. A subsequent study [63] expanded this foundation by creating a comprehensive database of over 2,300 historical events from 14 years of DWI reports and used deep learning models to automate the sentiment scoring of regulatory outcomes. Further, LLMs were used to create AI assistant ‘chatbots’ to provide accessible insights. Using GenAI can generate decision support insights, aiding efficient repair and response strategies and enhance emergency response in water systems by simulating novel scenarios beyond historical data, thereby improving readiness and adaptive capacity. But it is important to underscore the need for water professionals to blend AI knowledge with engineering principles [64].

4.3. RAG for Generic Response Plan

LLMs can be both commercial/ proprietary (examples: GPT 5, Claude, Gemini, Grok) and open source (examples: Llama 3, DeepSeek, Mistral). There are various benchmarks for rating the models, with various research groups deciding which benchmarks are suitable, so in practice the choice of model is application dependent. A number of open source and rapidly developing GenAI delivery platforms and LLMs were used to build and deploy AI Assistant bots with the RAG concept of tuning on appropriate documents (see Figure 4). The RAG approach grounds the LLM’s responses in factual, up-to-date information, with a curated and high provenance document corpus and carefully constructed prompts, mitigating the risk of “hallucinations” [5,65]. RAG enhances the results achieved in both accuracy and reliability by retrieving information from trusted sources. For example, the prompt fed to the LLM can be enhanced with more up-to-date or technical information.
Compared to manual analysis, these approaches offer several significant improvements. Firstly, enabling rapid identification of previously effective interventions for a given unfolding incident, as well as those to be avoided, providing valuable insights for control rooms and water quality scientists. It also aggregates and presents actionable information for response plans, improving decision-making and response effectiveness. Furthermore, the tools can facilitate strategic data mining and provide inter industry learning (including for training of new staff). It could particularly benefit smaller water companies by providing whole industry learning and helping to gradually refine response plans over time. Also, there are many water quality events that are not large enough to be reported (in 2024, there were a total of 70,507 consumer contacts in England regarding acceptability of drinking water – up 12% since 2022), however that companies must still address through good decision making. Change needs to be cross sector, by encouraging industry learning, promoting resilience and by informing impact plans [19,66,67]. Event response is often challenged by the rarity of such events, which limits organisational learning, and the gradual loss of institutional knowledge as experienced personnel retire [19]. A notable challenge is the loss of ‘institutional memory’ and ensuring that best possible practice is being effectively shared across the sector [1]. Since the reports are public originating from all companies they are effectively ‘open data’ engendering data sharing; alleviating fears about data security and confidentiality by creating a transparent supply chain of datasets for analysts. Figure 5 shows an example of generating a response plan using RAG – a detailed sixteen phase plan is produced starting with part 1 - Immediate Incident Response.
GenAI tools using such structured data with RAG can improve water quality management and help the water sector become more efficient and cost-effective, while protecting public health. LLMs can be fine-tuned as powerful query and reporting tools in water sector operations by capturing institutional knowledge.

5. Discussion and Future Work

5.1. Overview

2025 has marked the rise of AI systems that can learn, adapt, and act independently, revolutionising decision-making and automation. A key AI development is the distinction between generative AI and agentic AI. While generative AI, exemplified by tools like ChatGPT, functions as a sophisticated pattern-matching and completion system, reacting to specific prompts without genuine understanding of context or long-term objectives, agentic AI marks a fundamental evolution. Agentic AI systems operate with a degree of autonomy, capable of setting their own goals, formulating strategies to achieve them, and adapting their approach based on changing circumstances. This makes agentic AI more akin to a proactive colleague than a reactive assistant; for instance, while generative AI might draft an email on command, an agentic AI could proactively monitor an inbox, identify important messages, draft responses based on past communications, and even schedule follow-up meetings, adapting its approach as priorities change. Human-in-the-loop (HITL) and human-on-the-loop (HOTL) approaches represent crucial safeguards as AI autonomy increases. HITL systems require human oversight at key stages—approving decisions, validating outputs, or steering learning processes—ensuring accountability, transparency and ethical alignment. HOTL systems shift this role toward supervisory control, where humans intervene only if the AI deviates or risks emerge. Together, these paradigms preserve human judgment and trust, balancing efficiency with responsibility in what is likely to become an increasingly autonomous agentic AI environment. For example, a traditional AI tool might raise a burst alert based on abnormal flow signatures to the control room; a GenAI could synthesise telemetry and customer contact data to narrow down the location and an agentic AI could then recommend practical next steps, such as repair materials required, team mobilisation options, and traffic management considerations, all pending human approval.
AI is evolving beyond simple tools to become ‘agentic colleagues’, capable of taking initiative, anticipating needs, and collaborating. Anthropic predicts AI-powered virtual employees will begin operating within corporate networks by 2026, bringing new security challenges and opportunities. Agentic systems can suggest directions, adapt to context, and fill gaps, acting more like active partners across various functions. Operationally, agentic AI fosters ‘smarter scale,’ enabling smaller teams to achieve significant output by dramatically increasing efficiency rather than just growing headcount. This necessitates a fundamental rethinking of leadership and team structures, empowering humans to shift from manual tasks to higher-value work like strategy, empathy, and creativity, while AI agents handles repetitive duties.
The revolutionary nature of agentic AI stems from its architecture, which integrates advanced planning modules, memory systems, and decision-making frameworks, as well as calling external tools. These components enable agentic systems to maintain context and pursue objectives over extended periods, break down complex tasks, prioritise actions, and even self-correct when an approach is proving ineffective. We are observing an emerging convergence of generative and agentic capabilities within mainstream AI tools. OpenAI's introduction of scheduled tasks in ChatGPT was an early indicator, allowing the AI to perform semi-autonomous actions. Further developments, such as OpenAI’s ‘Operator’, ‘Agent Mode’ and ‘Deep Research’ (with similar functionality on alternative platforms such as Claude Artifacts) suggest ambitions for AI agents that can independently control computers, proactively search for information on the web, and navigate digital environments with minimal human intervention.
Virtual employees of the future could have their own memories, corporate roles, and network accounts with greater autonomy than current AI agents. Unlike task-specific agents, virtual employees will have broader capabilities across multiple corporate systems. Such AI identities will require companies to completely rethink their cybersecurity strategies and access management. In industries like water management, Agentic AI could optimise energy usage, manage networks, and predict maintenance needs all without human intervention.
The water sector is notoriously slow and resistant to adopt new technologies, partly due to existing ossified work flows. Coupled with an adverse data sharing culture, innovation adoption can be extremely slow and incremental. Further, despite the current buzz, AI technology can be perceived as a ‘threat to jobs’ (as with all sectors). Water utilities are beginning to deploy LLM-powered chatbots to provide customers with instant, context-aware support and engagement and to equip internal teams with tools that can quickly summarise compliance documents or retrieve maintenance procedures. Major industry bodies, including the American Water Works Association (AWWA) and the Water Environment Federation (WEF), have initiated research to establish best practices for GenAI, signaling a sector-wide move toward adoption. Forward-looking water utilities, including those in the UK, EU and North America, have started piloting generative AI and LLM solutions in real operations. A notable case is DC Water (Washington, D.C.’s water utility) which has actively encouraged its staff to use tools like ChatGPT for daily tasks such as writing reports, coding, and analysing data, establishing an internal community of practice to explore GenAI's potential. They have also applied them for improving customer engagement (e.g. chatbots for customer queries). This underscores that large utilities are beginning to deploy LLMs as knowledge management and decision support tools internally, albeit with cautious governance. Some utilities are also exploring custom LLMs – training smaller models on internal reports, maintenance logs, and design manuals so that staff can query these via chatbot and instantly retrieve answers with context saving staff time on tedious tasks. In essence, the technical trend is towards hybrid systems: pairing LLMs with structured data sources, or combining physics-based models with generative ML, to leverage the strengths of both.
A white paper [68] published by the UK Water Partnership (UKWP) highlights how AI is poised to revolutionise the water industry and describes several real-world AI/GenAI use cases being developed. It highlights that if AI is pointed at unstructured data without first improving the dataset standard, spurious results will likely follow. They recommend only pointing LLMs at curated data – by considering ‘fine-tuning’ capabilities, i.e., by a human resource that is capable of training models appropriately.
In 2025, the UK Water Industry Research (UKWIR) launched an AI-powered search tool using a customized large-language model (Llama) trained on their own library of reports to provide contextual search and generative AI summaries specific to water research [69]. The UKWIR’s tool is hosted onsite and trained exclusively on UKWIR’s own library of reports and tools.

5.2. Roadmap

Figure 6 provides a high level roadmap for considering when deploying GenAI in real world applications in the water sector.
  • Key areas:
  • Foundations and governance. Foundations and governance provide the strategic bedrock for GenAI deployment in the water sector. Establishing a clear business case is essential, ensuring that investment delivers measurable benefits in efficiency, resilience, or customer service. Alignment with regulators safeguards compliance, while ethical frameworks address bias, fairness, and transparency. Building trust in AI requires proactive communication and embedding responsible AI practices from the outset. Crucially, success rests on positioning AI as a partner to human expertise, enabling collaboration rather than substitution.
  • Data and infrastructure. Data readiness entails ensuring quality, availability, and interoperability across diverse sources such as SCADA, GIS, and asset management systems. Establishing targeted governance frameworks—focused on specific use cases—supports compliance without stifling innovation. Cloud, edge, or hybrid architectures must be evaluated for scalability, latency, and resilience, with cybersecurity protections built into every layer. With so much data available, it can be overwhelming to identify the most valuable insights for decisions. If data is machine readable in knowledge repositories then humans can be AI assisted to be more productive and effective. By addressing data fragmentation and prioritising metadata and standards, utilities can build robust pipelines that allow AI to process, contextualise, and integrate operational data into actionable intelligence.
  • Model development and deployment. Generative AI is emerging as a ‘game changer’ by democratising access to complex AI tools through natural language interactions, enabling intuitive communication with complex systems and reducing the steep learning curve associated with traditional software and advanced analytics [19]. Model development involves the systematic selection, adaptation and validation of generative AI tools for water applications. Choosing between commercial and open-source large language models requires balancing transparency, licensing and support. Fine-tuning models with sector-specific documentation enhances contextual accuracy. RAG offers an effective mechanism to reduce hallucinations and ground outputs in verified sources. Rigorous benchmarking and validation ensure robustness and reliability, while modular deployment strategies—whether on-premises, cloud-based, or via APIs—facilitate integration into operational workflows and maintain flexibility as foundation models evolve.
  • Use-cases and applications. The true value of GenAI lies in its application to priority use cases that deliver tangible benefits. When considering GenAI, keep an open mind – it is not all about chat functions. For example, incorporating AI into reporting/ dashboard tools, or exploring how automation and Agentic AI could be used to enhance business processes and decision-making. In water quality monitoring, AI can synthesise laboratory data, regulatory limits and historical trends to flag emerging risks. Asset management and predictive maintenance are strengthened by analysing sensor data to anticipate failures and optimise investment. Customer service can be enhanced through AI-enabled chatbots that provide clear responses to queries. Furthermore, regulatory compliance activities benefit from automated report generation, structured audit trails and streamlined document retrieval, collectively driving efficiency, transparency, and resilience.
  • Human-in-the-Loop. HITL/HOTL approaches ensure that AI deployment remains guided by expert judgment and domain-specific oversight. Rather than replacing operators, GenAI functions as a co-pilot, augmenting their capacity to interpret complex datasets and make informed decisions. Structured frameworks for human oversight safeguard against errors and reinforce accountability. Developing hybrid expertise—engineers proficient in both water systems and AI— will enable organisations to critically assess outputs. Fusion skills, particularly judgement integration, support the evaluation of AI suggestions for trustworthiness, and relevance, ensuring that GenAI systems complement, rather than supplant, human analytical and ethical capabilities. This collaborative intelligence, where human ingenuity combines with AI systems, consistently outperforms either humans or machines working alone and helps address and mitigate job fears as human engineering experience is still essential.
  • Change and adoption. Successful GenAI integration demands comprehensive change and adoption strategies. Users need to be involved in the process as the process: it should be about what people need from AI, not the technology itself. Integration and change management are key to ensuring any new AI solution is properly adopted and embedded in processes. Stakeholder engagement—spanning regulators, operators, and customers—fosters legitimacy and confidence. Upskilling programmes in AI literacy will equip staff to use tools effectively, reducing resistance and building organisational trust. Building systems that excel at preserving institutional knowledge by processing historical records and creating accessible repositories of operational expertise is particularly valuable during workforce transitions. Change management must also address cultural concerns, including fears of redundancy or technological disruption, by framing AI as an enabler of higher-order and more productive work. Incremental adoption through pilots builds confidence and delivers demonstrable value, while embedding AI into business processes requires strong leadership to champion innovation and sustain momentum across the organisation.
  • Scaling and future outlook. Scaling GenAI from pilots to sector-wide adoption requires careful planning and sustained monitoring. Early projects should focus on quick wins, delivering measurable improvements that secure organisational confidence. Continuous retraining and adaptation are necessary to maintain accuracy, particularly as new foundation models and regulatory requirements emerge. Over time, adoption can progress from reactive generative tools towards agentic AI systems capable of autonomous goal-setting and adaptive strategy.

5.3. Challenges and Future Developments

Generative AI is poised to play an expanding role in the water industry, but its successful widespread adoption will require navigating several challenges and innovating in key areas. Challenges relate not only to technical feasibility but also to regulatory, ethical, and environmental considerations. Addressing these issues will be essential for ensuring that AI fulfils its potential as a transformative force in water management.
A primary challenge lies in the development of domain-specific models. While the prospect of water-focused foundational models—trained on regulations, design standards, and operational datasets—is compelling, progress is hindered by data scarcity and fragmentation. Many utilities retain data in silos, non-English documentation is underrepresented, and quality standards are inconsistent across regions. Collaborative data-sharing initiatives, such as the EU’s WATERVERSE platform [70], alongside standardisation efforts, will be crucial to enabling the creation of robust and representative training corpora. Without such measures, model bias and limited generalisability remain persistent risks.
The application of generative AI for real-time decision support further illustrates the promise and complexity of this technological shift. Embedding models into treatment plant controls or distribution network monitoring could enable predictive optimisation of dosing regimes, emergency response and adaptive network management. However, scalability and reliability are critical barriers: models must function across city-wide sensor networks without sacrificing speed or accuracy. Moreover, the conservatism of the water sector, rooted in its public health responsibilities, demands rigorous validation. HITP/HOTL oversight will remain indispensable until automated systems demonstrate consistent trustworthiness. Governance through this oversight can be retained while automation executes guard-railed, pre-approved actions, with post-action audit trails to build trust and expand autonomy over time. Moving beyond alarm-driven operations, GenAI enables proactive management: forecasting emerging risks, recommending pre-emptive set-point adjustments and scheduling interventions before KPIs drift toward a breach. In a proactive mode, digital twins and prescriptive analytics could continually simulate “what-if” scenarios, ranking interventions by impact on compliance risk, resilience and costs.
Closely tied to these technical hurdles is the issue of explainability and trust. The black box nature of many AI systems, particularly ANNs is a long standing issue. The opacity of deep learning systems poses difficulties in regulated environments, where transparency and accountability are essential. Explainable AI (XAI) techniques will therefore play a pivotal role in translating complex model outputs into interpretable insights, enabling experts to validate predictions, trace contaminant pathways and design appropriate interventions. The absence of interpretability risks undermining regulatory approval, operator confidence and public trust.
The ethical, social, and regulatory dimensions of AI adoption further complicate implementation. A secure regulatory environment is crucial, as legal grey areas and ethical quandaries regarding data breaches, copyright infringement, or DEI rules can lead to cautious industries like the water sector to adopt a "wait-and-see" approach. Data privacy, accountability for AI-driven outcomes, and the prevention of inequitable service allocation are pressing concerns. If training data embeds structural biases, AI systems may inadvertently prioritise certain communities exacerbating disparities. Regulators are likely to introduce new requirements, including auditability, certification and ethical guidelines for AI deployment in critical water services. Proactive governance frameworks will be necessary to align innovation with principles of equity and safety.
Another emerging issue is the resource efficiency of AI systems themselves. Training and running large generative models consume substantial energy and water, particularly for cooling data centers. Estimates suggest that individual queries to state-of-the-art models already entail measurable water use, raising questions of sustainability. Adopting “green AI” principles—such as energy-efficient architectures, solar-powered data centers and intelligent scheduling of computation—will be vital for reconciling AI’s benefits with environmental stewardship. There are even opportunities for innovation at the water–compute nexus, for example through utilities hosting edge infrastructure cooled by treated wastewater.
Equally important are the human and workforce dimensions. Generative AI is unlikely to replace water professionals but it will augment their roles by automating routine tasks and providing decision support. This shift will require significant upskilling and a cultural reframing of AI from threat to collaborator. Structured training programmes, integration of AI into engineering curricula and communities of practice will be essential in preparing the workforce for collaborative intelligence environments.
Looking ahead, several developments can be anticipated. Strategic integration of AI will move beyond the hype cycle, driving innovation in business models and operational practices. The rise of agentic AI promises more autonomous workflows, capable of proactively managing networks and responding dynamically to changing conditions. Synthetic data generation offers a solution to the scarcity of high-fidelity training datasets, particularly for rare events. As enterprise AI becomes increasingly democratized through open-source initiatives, accessibility and adaptability will improve, reducing barriers to adoption across organizations of varying sizes.
In parallel, adaptive organisational cultures will be required to sustain these innovations. Institutions must embrace continuous learning, experimentation and regulatory engagement to safely integrate AI. Transparency, accountability, and ethical safeguards will remain foundational to ensuring public confidence in AI-enhanced water services.
In summary, the trajectory of generative AI in the water sector is characterised by profound potential tempered by equally significant challenges. The future will demand balanced progress: the development of domain-specific models underpinned by robust data infrastructures; real-time decision support coupled with validation and explainability; proactive regulatory frameworks to safeguard ethical use; and sustainable computing practices aligned with environmental goals. Above all, success will hinge on cultivating a skilled and adaptive workforce capable of co-creating solutions with AI. Together, these measures will define whether AI emerges not merely as a technological tool, but as a trusted collaborator in advancing the resilience, equity, and sustainability of global water management.

6. Conclusions

Water utilities are investing in new technologies that improve customer service, enhance efficiency and drive resilience. Technological advancements allow water companies to gather more information about their networks and assets than ever before. Extra data provides new opportunities for asset maintenance and event analytics. Digital water will bring automation and connect the sector to the Internet of Things. A smart water network is a fully integrated set of products, solutions and systems that enable water utilities to remotely and continuously monitor and diagnose problems, pre-emptively prioritise and manage maintenance issues and remotely control and optimise all aspects of the water distribution network using data-driven insights.
This digital re-imagination of the sector [71] will enable a broad spectrum of outcomes from improved efficiencies to optimised asset management across providers and potentially new business models such as consolidated multi-utility retail operations. While there is an increase of digital adoption in water, the sector still lags behind other industries in integrating new, smart technologies. Water companies can benefit from the lessons learned in other sectors (such as energy) and the established best practices and network infrastructures. As technology evolves, the price of smart devices decreases, the functionality increases, and by piggy-backing other sectors, an accelerated adoption rate for realisation of benefits in the water management ecosystem can be achieved.
The water sector is not generally at the forefront of the technology adoption curve and thus not often seen as an attractive industry when compared to other leading-edge industries for data science professionals. In contrast, new digital technologies are a hot topic, particularly as AI begin to proliferate into industrial application. This fact means that a career in the water industry generally is not a top priority for data scientist professionals and attracting the right type of talent can be difficult. Start-ups in the water sector can rarely compete financially with Google or OpenAI for hiring coders, graphic designers and technology engineers.
With AI becoming more prevalent across industries, there is a growing need to make it broadly available, accessible and applicable to engineers and scientists with varying specialisations. Engineers, not just data scientists, will drive the experimentation and adoption of AI in water industry applications. Complexity of larger datasets, embedded applications and bigger development teams will drive solution providers towards interoperability, greater collaboration, reduced reliance on IT departments and higher productivity workflows. The key to realising these benefits will be addressing the challenges head-on: ensuring data quality, upskilling the workforce, maintaining ethical guardrails, and sharing successes and failures openly across the water community. With thoughtful implementation, generative AI and LLMs could become invaluable tools in building a more resilient, efficient, and smart water future.
In conclusion, the water sector is at a pivotal moment. Innovators are disrupting the old business models and there is little place for complacency in the water industry, with threats such as decentralised and distributed technology arising. Disruptive innovation can be beneficial. There is no reason why water companies cannot learn from insurgents, engage with new thinking, and embrace innovation to update their business models to deliver new solutions that benefit all. The evolution from traditional data analytics to the sophisticated capabilities of generative and agentic AI offers unprecedented opportunities to build more resilient, efficient, and sustainable water systems. Realising this future will require a concerted effort to address the foundational challenges of data, skills, and governance, fostering a collaborative ecosystem where technology enhances, rather than replaces, human expertise.

Author Contributions

Stephen R. Mounce primarily wrote the manuscript, conducted the review and developed Generative AI tools in the case study. Richard B. Mounce contributed with design and developing tools in the case study and Joby B. Boxall contributed on concepts and water quality expertise. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

This work was supported by the ACQUIRE project funded by the Ofwat Discovery Challenge Innovation Fund competition [61].

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mounce, S. R. (2020). Data Science Trends and Opportunities for Smart Water Utilities. In D. Han, S. Mounce, A. Scozzari, F. Soldovieri, & D. Solomatine (Eds.), ICT for Smart Water Systems: Measurements and Data Science. Handbook of Environmental Chemistry. Springer, Cham. [CrossRef]
  2. Bettin, A. (2023). Digital transformation for the water industry: how a data-driven business intelligence platform can improve operations. Water Practice & Technology, 18(7), 1599-1607. [CrossRef]
  3. Zolghadr-Asli, B., Ferdowsi, A.and Savić, D. (2024). A call for a fundamental shift from model-centric to data-centric approaches in hydroinformatics. Cambridge Prisms: Water, 2, e7, 1–4. [CrossRef]
  4. LeCun, Y., Bengio, Y., & Hinton, G. (2015). Deep learning. Nature, 521(7553), 436–444. [CrossRef]
  5. Taiwo, R., Yussif, A.-M., & Zayed, T. (2025). Making waves: Generative artificial intelligence in water distribution networks: Opportunities and challenges. Water Research X, 28, 100316. [CrossRef]
  6. Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative Adversarial Networks. arXiv preprint arXiv:1406.2661. [CrossRef]
  7. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30, 5998–6008.
  8. Wala, J., & Rüppel, U. (2025). Leveraging Large Language Models for Synthetic Data Generation for AI Applications for Water Infrastructure Management. Paper presented at EG-ICE 2025. [CrossRef]
  9. Pursnani, V., Ramirez, C. E., Sermet, Y., & Demir, I. (2024). HydroSuite-AI: Facilitating Hydrological Research with LLM-Driven Code Assistance. EarthArXiv preprint.
  10. World Economic Forum & Capgemini. (2024). Navigating the AI Frontier: A Primer on the Evolution and Impact of AI Agents.
  11. Agentic Platforms and APIs: Global Market Outlook. (2025).
  12. International Data Corporation (IDC). (2024). The global impact of artificial intelligence on the economy and jobs. IDC. https://my.idc.com/getdoc.jsp?containerId=prUS52600524. Accessed: September 2025.
  13. World Economic Forum. (2023). Digital transition framework. World Economic Forum. https://www3.weforum.org/docs/WEF_Digital_Transition_Framework_2023.pdf. Accessed: September 2025.
  14. Mounce, S. R., Shepherd, W., Boxall, J. B., Horoshenkov, K. & Boyle, J. (2021). Autonomous robotics for water and sewer networks. Hydrolink, IAHR. No. 2/2021, pp.55-62.
  15. Li, Y., Wang, H., & Dang, L. M. (2022). Vision-Based Defect Inspection and Condition Assessment for Sewer Pipes: A Comprehensive Survey. Sensors, 22(7), 2722. [CrossRef]
  16. Kumar, P. T. and Mishra, K. (2024). Deep learning in hydrology and water resources disciplines: concepts, methods, applications, and research directions. Journal of Hydrology, 628, 130458. [CrossRef]
  17. Fu, G., Jin, Y., Sun, S., Yuan, Z., & Butler, D. (2022). The role of deep learning in urban water management: A critical review. Water Research, 223, 118973. [CrossRef]
  18. World Economic Forum & Accenture. (2025). AI in Action: Beyond Experimentation to Transform Industry.
  19. Ramachandran, A., Neergaard, T. F. B., Maier, A., & Bayer, S. (2025). Digitization Framework of Water Utility Documents for Digital Twins. Paper presented at CCWI 2025 - 21st Computing & Control for the Water Industry Conference.
  20. Sela, L., Housh, M., & Vairavamoorthy, K. (2025). Making waves: The potential of generative AI in water utility operations. Water Research, 272, 122935. [CrossRef]
  21. Velayudhan, N. K., Devidas, A. R. & Savić, D. A. (2025). Generative AI for spatio-temporal multivariate imputation and demand prediction in water distribution systems. Results in Engineering, 27, 106178. [CrossRef]
  22. Wala, J. & Rüppel, U. (2025). Leveraging large language models for synthetic data generation for AI applications for water infrastructure management. In A. Moreno-Rangel & B. Kumar (Eds.), EG-ICE 2025: AI-Driven Collaboration for Sustainable and Resilient Built Environments: Proceedings of the 32nd International Workshop on Intelligent Computing in Engineering (ISBN 978-1-914241-82-6). University of Strathclyde Publishing. [CrossRef]
  23. Taloma, R. J. L., Cuomo, F., Comminiello, D., & Pisani, P. (2025). Machine learning for smart water distribution systems: Exploring applications, challenges and future perspectives. Artificial Intelligence Review, 58, 120. [CrossRef]
  24. Wang, J., Fu, G., & Savić, D. A. (2025). Leveraging large language models for automating water distribution network optimization. Water Research. Advance online publication. https://www.sciencedirect.com/science/article/pii/S004313542501440X.
  25. Taormina, R., Kyritsakas, G., Sourlos, N., van der Werf, J. A., & Soodan, A. (2025). DRACO: A Large Language Model Initiative for Urban Water Systems. Paper presented at CCWI 2025 - 21st Computing & Control for the Water Industry Conference, Sheffield, UK.
  26. Miao, C., Hu, J., Moradkhani, H., & Destouni, G. (2024). Hydrological Research Evolution: A Large Language Model-Based Analysis of 310,000 Studies Published Globally Between 1980 and 2023. Water Resources Research, 60, e2024WR038077. [CrossRef]
  27. Collier, S. A., Deng, L., Adam, E. A., Benedict, K. M., Beshearse, E. M., Blackstock, A. J., … Beach, M. J. (2021). Estimate of burden and direct healthcare cost of infectious waterborne disease in the United States. Emerging Infectious Diseases, 27(1), 140–149. [CrossRef]
  28. Mounce, S. R., Gaffney, J. W., Boult, S. & Boxall, J. B. (2015). Automated data driven approaches to evaluating and interpreting water quality time series data from water distribution systems. ASCE Journal of Water Resources Planning and Management, Vol 141 (11), pp. 1-11. [CrossRef]
  29. Gleeson, K., Husband, S., & Boxall, J. B. (2025). Multi-parameter multi-sensor data fusion for drinking water distribution system water quality management. AQUA - Water Infrastructure, Ecosystems and Society, 74(7), 451–465. [CrossRef]
  30. Li, Z., Ma, W., Zhong, D., Ma, J., Zhang, Q., Yuan, Y., Liu, X., Wang, X., & Zou, K. (2024). Applications of machine learning in drinking water quality management: A critical review on water distribution system. Journal of Cleaner Production, 481, 144171. [CrossRef]
  31. Skipworth, P.J., Saul, A. & Machell, J. (1999). Predicting water quality in distribution using artificial neural networks, ICE J., Water Marit. Energy 136 (1) (1999) 1–8. [CrossRef]
  32. Singh, K. P., Basant, A., Malik, A., & Jain, G. (2009). Artificial neural network modeling of the river water quality—A case study. Ecology and Modelling, 220(5), 888–895. [CrossRef]
  33. Zhi, W., Feng, D., Tsai, W.-P., Sterle, G., Harpold, A., Shen, C., & Li, L. (2021). From hydrometeorology to river water quality: Can a deep learning model predict dissolved oxygen at the continental scale? Environmental Science & Technology, 55(4), 2357–2368. [CrossRef]
  34. Tripathy, K. P., & Mishra, A. K. (2024). Deep learning methods in hydrology and water resources: A critical review. Journal of Hydrology, 628, 130458. [CrossRef]
  35. Bi, J., Zhang, L., Yuan, H., & Zhang, J. (2023). Multi-indicator water quality prediction with attention-assisted bidirectional LSTM and encoder-decoder. Information Sciences, 625, 65–80. [CrossRef]
  36. Kyritsakas, G., Boxall, J., & Speight, V. (2024). A data-driven predictive model for disinfectant residual in drinking water storage tanks. AWWA Water Science, 6(3), Article e1376. [CrossRef]
  37. Eliades, D. G., Vrachimis, S. G., Moghaddam, A., Tzortzis, I., & Polycarpou, M. M. (2023). Contamination event diagnosis in drinking water networks: A review. Annual Reviews in Control, 55, 420–441. [CrossRef]
  38. Gong, J., Guo, X., Yan, X. & Hu, C. (2023). Review of urban drinking water contamination source identification methods. Energies, 16(2), 705. [CrossRef]
  39. Dogo, E. M., Nwulu, N. I., Twala, B., & Aigbavboa, C. (2019). A survey of machine learning methods applied to anomaly detection on drinking-water quality data. Urban Water Journal, 16(3), 235–248. [CrossRef]
  40. Qian, K., Jiang, J., Ding, Y., & Yang, S. (2020). Deep learning based anomaly detection in water distribution systems. In 2020 IEEE International Conference on Networking, Sensing and Control (ICNSC) (pp. 1–6). IEEE. [CrossRef]
  41. Kadiyala, L., Mermer, O., Samuel, D. J., Sermet, Y., & Demir, I. (2024). The Implementation of Multimodal Large Language Models for Hydrological Applications: A Comparative Study of GPT-4 Vision, Gemini, LLaVa, and Multimodal-GPT. Hydrology, 11(9), 148. [CrossRef]
  42. Kalteh, A. M., Hjorth, P., & Berndtsson, R. (2008). Review of the self-organizing map (SOM) approach in water resources: Analysis, modelling and application. Environmental Modelling & Software, 23(7), 835–845. [CrossRef]
  43. Mounce, S. R., Douterelo, I., Sharpe, R.& Boxall, J. B. (2012). A bio-hydroinformatics application of self-organizing map neural networks for assessing microbial and physico-chemical water quality in distribution systems, Proceedings of 10th international conference on hydroinformatics, Hamburg, Germany.
  44. Mounce, S. R., Sharpe, R., Speight, V., Holden, B. & Boxall, J. B. (2014). Knowledge discovery from large disparate corporate databases using self-organising maps to help ensure supply of high quality potable water. Proceedings of 11th international conference on hydroinformatics, New York, USA.
  45. Speight, V., Mounce, S. R. and Boxall, J. B. (2019). Identification of the causes of drinking water discolouration from machine learning analysis of historical datasets. Environmental Science: Water Research & Technology (Royal Society of Chemistry), Vol 5, pp 747 – 755. [CrossRef]
  46. Mounce, S. R., Ellis, K., Edwards, J., Speight, V., Jakomis, N. and Boxall, J. B. (2017). Ensemble decision tree models using RUSBoost for estimating risk of iron failure in drinking water distribution systems. Water Resources Management,Vol 31 (5), pp 727-738. [CrossRef]
  47. Blokker, E. J. M., Furnass, W. R., Machell, J., Mounce, S. R., Schaap, P. G. & Boxall, J. B. (2016). Relating water quality and age in drinking water distribution systems using self-organising maps. Environments 3(2):10. [CrossRef]
  48. Mounce, S. R., Blokker, E. J. M., Husband, S. P., Furnass, W. R., Schaap, P. G. & Boxall, J. B. (2016). Multivariate data mining for estimating the rate of discoloration material accumulation in drinking water distribution systems. IWA J Hydroinf 18(1):96–114. [CrossRef]
  49. Mounce, S. R., Machell, J. & Boxall, J. B., (2012). Water quality event detection and customer complaint clustering analysis in distribution systems. IWA Journal of Water Science and Technology: Water Supply, Vol. 12 (5), pp. 580-587. [CrossRef]
  50. Sammon, J. W. (1969). A nonlinear mapping for data structure analysis. IEEE Trans Comput,C-18(5):401–409. [CrossRef]
  51. Furnass, W. R., Mounce, S. R., Husband, P. S., Collins, R. P. & Boxall, J. B. (2019). Calibrating and validating a combined accumulation and mobilisation model for water distribution system discolouration using particle swarm optimisation. Journal of Smart Water, Springer (Open Access),Vol 4 (3), pp 1-24. [CrossRef]
  52. Li, Z., Liu, H., Zhang, C., & Fu, G. (2023). Generative adversarial networks for detecting contamination events in water distribution systems using multi-parameter, multi-site water quality monitoring. Environmental Science & Ecotechnology, 14, 100231. [CrossRef]
  53. Samuel, D. J., Sermet, Y., Cwiertny, D. & Demir, I. (2024). Integrating vision-based AI and large language models for real-time water pollution surveillance. Water Environment Research, 96(8), e11092. [CrossRef]
  54. (2023). WaterWorld. Retrieved from https://www.waterworld.com/smart-water-utility/press-release/14295040/klir-unveils-chatgpt-integration-for-its-water-utility-management-platform. Accessed: September 2025.
  55. Pursnani, V., Erazo Ramirez, C., Sermet, M. Y., & Demir, I. (2024). HydroSuite-AI: Facilitating hydrological research with LLM-driven code assistance. EarthArXiv. [CrossRef]
  56. Samuel, D. J., Sermet, M. Y., Mount, J., Vald, G., Cwiertny, D., & Demir, I. (2024). Application of Large Language Models in Developing Conversational Agents for Water Quality Education, Communication and Operations. EarthArXiv, 7056. [CrossRef]
  57. Pursnani, V., Sermet, Y., Kurt, M., & Demir, I. (2023). Performance of ChatGPT on the US fundamentals of engineering exam: Comprehensive assessment of proficiency and potential implications for professional environmental engineering practice. Computers and Education: Artificial Intelligence, 5, 100183. [CrossRef]
  58. Ren, Y., Zhang, T., Dong, X., Li, W., Wang, Z., He, J., Zhang, H., & Jiao, L. (2024). WaterGPT: Training a large language model to become a hydrology expert. Water, 16(21), 3075. [CrossRef]
  59. Xu, B., Wu, G., Li, Z., Xu, G., Zeng, H., Tong, R., & Ng, H. Y. (2025). Towards domain-adapted large language models for water and wastewater management: Methods, datasets and benchmarking. npj Clean Water, 8, 82. [CrossRef]
  60. DWI (2025). Drinking Water 2024: The Chief Inspector’s report for drinking water in England. Accessed: Sep 2025.
  61. Ofwat (2024). https://waterinnovation.challenges.org/news/discovery-winners/ Accessed: Sep 2025.
  62. Mounce, S. R., Mounce, R. B. and Boxall, J. B. (2016). Case-based reasoning to support decision making for managing drinking water quality events in distribution systems. Urban Water Journal, 13 (7), pp. 727–738. [CrossRef]
  63. Mounce, S. R., Mounce, R. B. & Boxall, J. B. (2025). Case-based reasoning and Generative AI decision support tools for managing drinking water quality events in distribution systems. Proceedings of the 21st International Computing and Control for the Water Industry (CCWI2025), Sheffield, UK, 1st to 3rd September 2025.
  64. Kyritsakas, G., Boxall, J.B. and Speight, V. (2023). A Big Data framework for actionable information to manage drinking water quality. Journal of Water Supply: Research and Technology-Aqua . [CrossRef]
  65. Leyden, T. & Hart-Davis, G. (2025). Agentic AI For Dummies®, Integrail Special Edition. John Wiley & Sons, Inc. Agentic Platforms and APIs: Global Market Outlook. (2025).
  66. Bresnahan, J., Gauthier, C., Gunkel, M., Hafskjold, H., Järvinen, M., Lindberg, K., Makropoulos, C., Neergaard, T. F. B., Niemi, P., Novara, V., O’Connor, M., Savic, D., Spreng, L., Taormina, R., Tondera, K., Urich, C., & Virtanen, V. (2022). IWA Trends and challenges in the water sector: A selection of insights from IWA Specialist Groups 2020–2021. Urban Water Journal, 19(5), 458–483.
  67. Daniel, I., Ajami, N. K., Castelletti, A., Savic, D., Stewart, R. A., & Cominola, A. (2023). A survey of water utilities’ digital transformation: drivers, impacts, and enabling technologies. npj Clean Water, 6(51). [CrossRef]
  68. The UK Water Partnership & Aiimi. (2025). AI Within Reach (White Paper). The UK Water Partnership. https://www.theukwaterpartnership.org/publications/ukwp-and-aiimi-ai-white-paper.
  69. Water Magazine. (2025). AI tool transforms access to UKWIR research. Water Magazine. https://www.watermagazine.co.uk/2025/08/05/ai-tool-transforms-access-to-ukwir-research/ Accessed: September 2025.
  70. Torello, M., Seshan, S., van der Meulen, S. & Vamvakeridou-Lyroudia, L. (2024). WATERVERSE: Strategies in stakeholder engagement for the digitalization of a water data management ecosystem. Engineering Proceedings, 69(1), 209. [CrossRef]
  71. Chaudhary, M. & Pandey, S. (2025). Water Sector 5.0: Harnessing digital transformation opportunities and mitigating risks. Water Resources Management. Advance online publication. [CrossRef]
Figure 1. Rise of publications on generative AI in water resources/hydrology (2021–2025). Data gathered via OpenAlex/Crossref API searches (using year filters and keyword queries). 2025 is partial (up to Q3).
Figure 1. Rise of publications on generative AI in water resources/hydrology (2021–2025). Data gathered via OpenAlex/Crossref API searches (using year filters and keyword queries). 2025 is partial (up to Q3).
Preprints 177728 g001
Figure 2. The evolution of AI: from static search to autonomous agents.
Figure 2. The evolution of AI: from static search to autonomous agents.
Preprints 177728 g002
Figure 3. Phases dealing with a significant water quality event.
Figure 3. Phases dealing with a significant water quality event.
Preprints 177728 g003
Figure 4. RAG workflow for application.
Figure 4. RAG workflow for application.
Preprints 177728 g004
Figure 5. Using the AI assistant to generate a response plan.
Figure 5. Using the AI assistant to generate a response plan.
Preprints 177728 g005
Figure 6. Roadmap for GenAI adoption in the water industry.
Figure 6. Roadmap for GenAI adoption in the water industry.
Preprints 177728 g006
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated