Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Tuning of PID Controllers using Reinforcement Learning for Nonlinear Systems Control

Version 1 : Received: 15 March 2024 / Approved: 15 March 2024 / Online: 15 March 2024 (12:12:48 CET)

How to cite: Bujgoi, G.; Sendrescu, D. Tuning of PID Controllers using Reinforcement Learning for Nonlinear Systems Control. Preprints 2024, 2024030914. https://doi.org/10.20944/preprints202403.0914.v1 Bujgoi, G.; Sendrescu, D. Tuning of PID Controllers using Reinforcement Learning for Nonlinear Systems Control. Preprints 2024, 2024030914. https://doi.org/10.20944/preprints202403.0914.v1

Abstract

The numerical implementation of the controllers allows the use of very complex algorithms with ease. However, in practice, due to its proven advantages, the PID controller (and its variants) is widely used in industrial control systems as well as in many other applications that require continuous control. Most of the methods for tuning the parameters of PID controllers are based on time-invariant linear models of the processes, which in practice can lead to poor performance of the control system. The paper presents an application of reinforcement learning algorithms in tuning of PID controllers for the control of some classes of continuous nonlinear systems. Tuning the parameters of PID controllers is done with the help of Twin Delayed Deep Deterministic Policy Gradients (TD3) algorithm which presents a series of advantages compared to other similar methods from Machine Learning dedicated to continuous state and action spaces. TD3 algorithm is an off-policy Actor-Critic based method and was used as it does not require a system model. The presented technique is applied for control of a biotechnological system which has a strongly nonlinear dynamic. The proposed tuning method is compared to the classical tuning methods of PID controllers. The performance of the tuning method based on the TD3 algorithm is demonstrated through simulation illustrating the effectiveness of the proposed methodology.

Keywords

learning-based control; nonlinear systems control; PID controller; bioprocess

Subject

Engineering, Control and Systems Engineering

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.