2.1. Bibliometric Analysis
This analysis aims to identify the thematic relevance over the years in the literature, based on the number of publications and their distribution per year. It is essential to note that the results presented in this section are for qualitative analysis only. The database used was the Web of Science, and different queries were used in the advanced search to plot the results. The choice of this database is due to its selectivity and data quality, since the Web of Science makes use of metrics to filter out articles with low impact [
10]. Thus, since the focus is on the qualitative analysis, the macro tendencies are expected to be the same for different databases. The analysis presented in this section refers to the values found on March 27th, 2026.
To analyze the interest in control performance monitoring and assessment, for all kinds of controllers, the query
ALL=("control* performance monitoring" OR "control* performance assessment" OR "control loop performance assessment") was used, returning 495 results and the distribution presented in
Figure 1. The results indicate an increase in publication activity from the 1990s to the 2010s, with the latter representing the period of highest average output. This reflects the increase in monitoring and control performance. In the current decade, the average number of publications remains higher than in the early 2000s, but it is lower than the peak observed in the 2010s. However, the current decade is still incomplete.
By adding the query
AND ("model predictive control*" OR "MPC" OR "NMPC") to the previous one to restrict the results only to model predictive controllers, the results drop to 79 publications, as can be observed in
Figure 2. A similar decade-wise trend is observed when restricting the analysis to MPC-based approaches, with an increase in publication activity from the 2000s to the 2010s. However, the overall number of publications is significantly lower, reflecting the more specific scope of MPC within the broader field. Thus, there is still room to conduct further assessment studies on the performance assessment of this kind of controller, which may reflect the challenge in finding methodologies for monitoring model predictive controllers.
To analyze the presence of machine learning (ML) techniques in research about control performance monitoring, the query
ALL=(("machine learning" OR "artificial intelligence" OR "neural network") AND ("control* performance monitoring" OR "control* performance assessment" OR "control loop performance assessment")) was used. Just 33 publications were found, with the distribution presented in
Figure 3. From those publications found, only 4 include
"model predictive control*" OR "advanced control*" OR "MPC" OR "NMPC" in their text.
However, when the number of publications for machine learning applied to fault detection and diagnosis is analyzed, by using the query
ALL=(("machine learning" OR "artificial intelligence“ OR “neural network”) AND ("fault detection and diagnosis" OR "fault detection" OR "fault diagnosis")), the results showed a total of 25,717 publications.
Figure 4 shows that there is also a crescent tendency in the number of publications in this field, also reflecting the popularity of the field and the interest in using ML tools to detect and diagnose faults in different applications.
The integration of control performance assessment and FDD is also uncommon in the literature. By using the query
ALL = (("fault detection and diagnosis" OR "fault detection" OR "fault diagnosis") AND ("control* performance monitoring" OR "control* performance assessment")), only 38 articles were found, with the distribution presented in
Figure 5. From those results, only 10 include
"predictive control*" in their text. This shows that there is potential to explore FDD methodologies for MPC performance assessment, which is still underexplored.
The results presented so far show that, even though there is an increased interest in studying predictive controllers and ML applied to FDD, there are still few studies applied to the performance assessment of predictive controllers, especially using ML algorithms. According to Özdemir and Yıldırım [
11], many ML algorithms have been used for decades without referring to ML explicitly. Venkatasubramanian et al. [
12], for instance, discussed the use of artificial neural networks for fault detection and diagnosis without referring to it as a machine learning methodology.
Hence, variations in the terminology used to describe machine learning methods could potentially influence the results of the bibliometric analysis. However, when a more wide query was implemented (ALL=(("machine learning" OR "artificial intelligence" OR "neural network" OR "support vector machine" OR "random forest" OR "boosting") AND ("control* performance monitoring" OR "control* loop performance monitoring" OR "closed*loop performance monitoring" OR "control* performance assessment" OR "control* loop performance assessment" OR "closed*loop performance assessment"))), the number of publications increased just from 33 to 36.
This evidences lack of investigation into the technical feasibility of using ML algorithms applied to the performance assessment of predictive controllers. There may be some reasons why there are so few studies about using ML algorithms to monitor and assess predictive controllers. Firstly, there is the challenge of labeling and distinguishing the "normal" dataset from the "abnormal" one. When there is an instrumentation problem, it is more evident and easier to take corrective actions than when there is an inappropriate tuning setting in an MPC, due to its complexity.
Moreover, when the lack of performance in an MPC is detected, it may be more interesting for the industry to correct it right away instead of identifying the issue. This is reflected in the number of publications mentioning adaptive control, which was more than 220 results only in 2025 (ALL=("model predictive control*" AND "adaptive control*")).
Even though there is a high interest in adaptive controllers, it is still important to evaluate the technical feasibility of using ML algorithms applied to the performance assessment of predictive controllers. For this reason, this work was elaborated to fill some of those gaps presented in the literature, by both exploring MPC performance degradation as a type of fault and by using ML models for the detection and diagnosis of root causes. In the next sections, a more detailed analysis was made of the methodologies available for monitoring and assessing predictive controllers.
2.3. Methodologies for Identification and Diagnosis
Although the methodologies described so far help determine whether the control system is healthy or underperforming, they usually do not provide the root cause of performance degradation. For this reason, some metrics were developed to provide a diagnosis of the process conditions. The main issue with diagnosis is the lack of a unique characteristic that determines whether the control system is under unmeasured and unpredictable disturbance, with a process fault, an instrumentation fault, a plant-model mismatch (PMM), or inappropriate tuning settings.
Even though instrumentation problems also cause MPC degradation, the key challenge in this type of controller is identifying issues in its internal configuration. The model is the crucial aspect of a predictive controller and affects its prediction accuracy. Due to the sake of computational efficiency, linear models are preferred in industrial applications, even though chemical processes are inherently nonlinear. However, changes in operational regions or process dynamics may lead to loss of model representability and, consequently, to degradation of MPC performance.
Once the model is degraded, it is required to update the model by performing a plant re-identification. However, this procedure requires intrusive plant tests, which have economic repercussions. Therefore, it is desired to identify only the part or the subsystem of the plant where a significant mismatch occurs. In addition, it is preferred to perform the re-identification when no other procedure is possible to improve MPC performance, since other solutions may limit the intervention required in the process. The most relevant issue in model assessment is determining whether the bad performance comes from a PMM or an unmeasured disturbance. Both cause similar effects in the process output. Nevertheless, the corrective action for an unmeasured disturbance might be less invasive than for a PMM. Thus, methodologies focused on identifying PMM were developed.
Badwe et al. [
24] propose a methodology based on the analysis of partial correlations between the model residuals and manipulated variables for the detection and isolation of PMM. The main purpose of this approach is to evaluate the correlation between the manipulated variables and the prediction error, isolating the effect of the closed-loop. In an open-loop scenario, the manipulated variables (MVs) are not affected by unmeasured disturbances, so they should not be correlated. A correlation between them when closed-loop effects are removed means that the prediction error is caused by the MVs, and, consequently, there is a PMM. Equation
8 shows the relationship between the prediction error
e and the uncorrelated manipulated variable
u, where
y and
are the plant and the model controlled variables, respectively, while
and
v are the PMM and the Gaussian disturbances acting on the process.
However, even though the Badwe et al. [
24] methodology uses only closed-loop operating data for the analysis and does not require intrusive tests on the plant, the system data must be excited. In the absence of target changes, it is not possible to determine the extent of mismatch, which is the main limitation of this methodology.
Botelho et al. [
25] developed another methodology to distinguish PMM from unmeasured disturbances. It is a statistical approach based on a nominal output estimation. Equation
9 represents how the nominal controlled variable
is estimated from the controlled variable
y, where
is the nominal output sensitivity transfer matrix and
is the simulated controlled variable of the nominal model.
The main purpose of Equation
9 is also to decouple PMM from controller configuration, since the effect of the PMM in the controlled system depends on the controller tuning. The relation between the nominal outputs and the nominal prediction errors
is then evaluated through the Pearson correlation between their statistical distribution coefficients
Z in a moving window, according to Equation
10. The statistical distribution coefficients considered are the skewness and kurtosis coefficients.
A high correlation between both variables indicates the presence of PMM. Otherwise, unmeasured disturbances are the cause of prediction errors. The main challenge in implementing this methodology is its dependency on the nominal output sensitivity transfer matrix , which industrial MPCs do not give explicitly.
Giraldo et al. [
26] have made an analysis of the Botelho et al. [
25] and the Badwe et al. [
24] methodologies. The critic relies on the limitations of those methods for real-time applications, given their relatively high computational cost. Furthermore, the assumptions underlying those methods, which rely on the feasibility of achieving an ideal model, limit their applicability in practical scenarios. Giraldo et al. [
26] have proposed an algorithm to distinguish PMM from unmeasured disturbances, trying to mitigate the limitations of the previous methods. However, the method focused on the Filtered Smith Predictor and is also based on the sensitivity transfer function
. Moreover, the methodologies for identifying PMM are focused on distinguishing it from an unmeasured disturbance, requiring other methods to discard other sources of performance loss.
2.4. Industrial Methodologies
Even though the methods discussed so far are very important, since they give evidence of the possibility of detecting and diagnosing poor performance in MPC, they are still not being used in industries. In this section, the methods that Aspen Technology, an industrial software company, implements to monitor advanced controllers are explored.
Aspen Watch Performance Monitor is Aspen Tech’s software for monitoring control loops. Its metrics are grouped into libraries. In version 14.2, the Basic library has 57 metrics, while the Extended library has 86. There are also a library focused on crude oil distillation and a library for quality analysis.
The Extended library has three main groups of metrics: the General’s, the Manipulated Variables’, and the Controlled Variables’ metrics. The General’s metrics evaluate the control system as a whole. This group includes metrics such as the percentage of time the control is on, the percentage of active MVs and CVs, and the percentage of CV constraint violations. The groups of MV and CV metrics include metrics about target deviation, the variable’s standard deviation, constraint violations, oscillation indices, the percentage each variable is on, prediction error, which is the difference between the model prediction and the variable’s measurement, and so on.
Many of the metrics implemented have a literature background. Regarding the variable oscillation indication, Aspen documentation does not supply this metric equation, only the information that the more oscillating the variable is, the closer the metric is to 1. In the literature, some metrics have been developed to detect oscillation in closed-loop systems. Thornhill and Hägglund [
27] have proposed an oscillation detector based on the IAE of the system, and it is also in a range
. Thus, it is likely that Aspen Watch follows a similar approach. The variable’s standard deviation, on the other hand, especially regarding the dependent variable, is aligned with the Harris Index. Even though the software does not use the Harris Index directly to monitor the control-loop, nor use the MVC theory as a benchmark, it does use the variability of the variables as metrics to be evaluated.
A single metric rarely provides a precise diagnosis. Consequently, Aspen Watch has an extensive range of metrics to draw information from the process data. While these metrics offer clues regarding the potential causes of diverging values, the number of possibilities they present means they do not eliminate the need for a deep investigation into the control loop.
2.5. Machine Learning Methodologies
Many researchers have already applied ML algorithms in the chemical engineering context, not only on abnormality detection, but also on signal processing and process modeling [
11]. Most applications in the chemical engineering context already focus on modeling, optimization, control, and monitoring [
28]. As mentioned before, control performance monitoring may also be part of the FDD spectrum. Thus, ML techniques already have a solid base in the academic context. As more of this kind of methodology is studied, it becomes more reliable and feasible for use in industrial applications.
Early detection and diagnosis of process faults while the plant is still operating in a controllable region can help avoid abnormal event progression and reduce productivity loss. Venkatasubramanian et al. [
12,
29,
30], in their review about FDD, already considered neural networks as a non-statistical method for feature extraction.
One of the conclusions of the survey conducted by Bauer et al. [
3] is that, for control-loop monitoring, the simpler the method, the more respondents find it useful. The respondents also asked for better guidance on corrective actions to be taken. de Campos et al. [
31] affirms that classic metrics used by Petrobras are not good to diagnose the root cause of the poor performance. It is necessary to use multiple techniques combined, together with a specialized team, to evaluate and correct control performance issues, and still, there are a lot of challenges to evaluate and diagnose effectively the multivariable model predictive controller performance.
Under this scenario, ML techniques have a great potential to diagnose control performance. As it was shown in
Section 2.1, research about ML has become popular in the last years. Different methods and models have been developed to address different types of applications, especially in chemical engineering [
11,
28,
32]. This evidence shows that ML models have great potential and are already under development in the PSE area. However, the use of ML tools restrictively for control-loops performance assessment is still under-evaluated. As shown in
Section 2.1, only few studies have been conducted to apply the use of ML tools in control performance monitoring, despite its potential. By using the classification task approach, it is possible to assess and diagnose control performance at once, instead of using combined methodologies such as the ones shown in the previous sections.
Most studies on the use of ML techniques to monitor controller performance have considered classic and SISO controllers. Pillay and Govender [
33] analyzed the use of a Multi-Class Support Vector Machine (SVM) classifier to detect and diagnose tuning issues in a proportional-integral (PI) controller. Grelewicz et al. [
34,
35] focused on developing an ML model that can distinguish between acceptable and poor PID performance, independent of the process, and without any additional learning stage.
In studies aimed at MPC, the key object of this study, neural networks have been used to identify the performance index [
36], based on the Minimum Variance Control benchmark. The approach was to estimate the minimum variance of the process for the Harris index. The models evaluated were a radial basis function network and a Laguerre network instead of using the ARMA model proposed by the original methodology [
8]. Even though this approach showed a great capacity to estimate the Harris Index of closed-loops, it did not explore the NN capacity to identify the degradation of the MPC performance.
Wang et al. [
37] also used neural networks, but directly as a classification problem. The approach was to identify MPC performance degradation due to noise variance change, model mismatch, control variables constraint saturation, and manipulated variables constraint saturation. The process evaluated was a two-tank liquid level, and the neural network used was an MLP, being compared to a SVM model. The inputs used were the manipulated and controlled variables, the control effort, and the control errors. Xu et al. [
38] evaluated the use of SVM as well, but with a Mahalanobis distance performance index as input. The proposal was to eliminate the correlation between process variables before the classification procedure. The evaluated systems were the Wood-Berry distillation column process and the two-tank liquid level process, with the same classification task performed by Wang et al. [
37]. Both studies explored the capacity of MLP to identify the loss of performance of an MPC, but the classes used, especially regarding constraint saturation, could be easily extracted from process data, without the need for a ML model. Furthermore, the classes explored do not give a precise diagnosis of the closed-loop condition, except for the model mismatch.
Loquasto and Seborg [
39] conducted a study that might be one of the most relevant in this thematic. They also employed the Wood-Berry process as a case study of a methodology using neural networks to monitor the performance of MPC systems. In this approach, MLP independent classifiers were used in cascade to detect and diagnose the MPC condition. The first MLP is responsible for classifying whether the process is in a normal or abnormal condition. If an abnormality is detected, the second MLP is responsible for detecting the presence of an abnormal disturbance. Finally, the third model is used for diagnosing PMM. The single four-class classifier (normal, disturbance, plant change, and both disturbance and plant change) methodology was also evaluated. Once a model-plant mismatch is detected, another MLP classifier is used to diagnose the specific sub-models that are inaccurate, also considering binary classifications. The MLP inputs are the manipulated and controlled variables, the control effort, the control errors, and the one-step prediction errors. The results presented showed that, in general, all models performed well. The disadvantage of using the multi-class classification is that each sample can belong to only one of the classes determined, while the multi-binary models can be arranged to allow the classification of two problems simultaneously. This study explored different arrangements of ML models and diagnoses focused on PMM. However, they used only one type of ML model and did not explore the possibility of the abnormality being from process instrumentation, which may cause similar effects in the closed-loop, or from the MPC tuning, which might require retuning, especially after operational changes. Furthermore, the process studied - the Wood-Berry process - is linear and has relatively few variables.
Some applications are more specific to each MPC degradation reason. Lu et al. [
40] have focused their study only on PMM. The ML tool evaluated was the SVM to discriminate PMM from noise model change. The problem was a binary classification, and the case study was a SISO system. The classification is based on the Finite Impulse Response (FIR) coefficients, which requires an identification procedure before the classification task.
Dambros et al. [
41], on the other hand, focused on oscillation detection. The ML algorithm evaluated was a deep feedforward network, and data in the frequency domain was used as input. The ML tool was trained and validated with artificial data before being tested with industrial data. Rabba et al. [
42] also proposed the use of an ML algorithm to detect oscillation in a closed-loop process. The ML model used was a XGBoost, and the process was the Tennessee Eastman. A previous step of feature extraction is performed to select the features with a high degree of importance and correlation to be used as inputs. Still focusing on oscillation detection, Akavalappil et al. [
43] proposed the use of a CNN using the process variables (PV) and the controller output (OP) signals as input.
The main contribution of all those studies together is their indication that ML tools have the potential to be applied to predictive controller performance monitoring. However, there is a lack of studies that evaluate whether ML can be applied to diagnose the closed-loop condition as a whole, considering different possibilities of the source of fault. Also, there is a lack of studies that expand the evaluation to nonlinear systems or explore their use in complex systems for more direct diagnosis, including process and controller source of degradation. Furthermore, other gaps include the lack of complex processes as case studies, the application of commercial controllers, and the evaluation of deep neural networks.