Time-Optimal Motions of Mechanical System with Viscous Friction

Dmitrii Kamzolkin; Vladimir Ternovski

doi:10.20944/preprints202404.1144.v2

Submitted:

03 May 2024

Posted:

03 May 2024

You are already at the latest version

Abstract

Optimal control is a critical tool for mechanical robotic systems, facilitating precise manipulation of dynamic processes. These processes are described through differential equations governed by a control function, addressing a time-optimal problem with bilinear characteristics. Our study utilizes the classical approach complemented by Pontryagin's Maximum Principle (PMP) to explore this inverse optimal problem. The objective is to develop an exact piecewise control function that effectively manages trajectory control while considering the effects of viscous friction. Our simulations demonstrate that the proposed control law markedly diminishes oscillations induced by boundary conditions. This research not only aims to delineate the reachability set but also strives to determine the minimal time required for the process. The findings include an exact analytical solution for the stated control problem.

Keywords:

inverse problems

;

optimal control

;

maximum principle

;

viscous friction

;

reachibility set

Subject:

Computer Science and Mathematics - Applied Mathematics

1. Introduction

This paper explores the time-optimal control in systems subject to viscous friction, an aspect vital across various domains including robotics and economic systems. It particularly examines the application of Pontryagin’s Maximum Principle (PMP) to provide a fundamental understanding necessary for mastering time-optimal control strategies [1,2,3,4]. The literature cited includes a broad range of sources from theoretical frameworks to practical applications, highlighting both the complexities and strategies for optimizing control processes to enhance time efficiency.

Incorporating the damping term, denoted as

μ > 0

, significantly increases the complexity of solving the optimal problem and understanding the dynamics of the system. Despite this complexity, including the damping term is vital for developing methods to experimentally determine modal characteristics, such as eigenmodes, eigenfrequencies, and generalized masses. The cited references [5,6,7] specifically address the behavior of the damped system for computational and, more importantly, for experimental analysis purposes. It is well known that transient simulation of systems with friction requires excessive computational power due to the nonlinear constitutive laws and the high stiffnesses involved. In [8] authors proposed control laws for friction dampers which maximize energy dissipation in an instantaneous sense by modulating the normal force at the friction interface. Besides optimization of the mechanical design or various types of passive damping treatments, active structural vibration control concepts are efficient means to reduce unwanted vibrations [9]. The conclusion from this broad survey is that the system model and friction model are fundamentally coupled, and they cannot be chosen independently.

Viscous dampers work by converting mechanical energy from motion (kinetic energy) into heat energy through viscous fluids. As a part of the damping process, they oppose the relative motion through fluid resistance, effectively controlling the speed and motion of connected components. Viscous dampers are essential for managing dynamic systems where control of movement and stability is necessary, making them indispensable in many high-stakes environments like automotive engineering and structural design. These dampers are increasingly sophisticated, incorporating technologies like electrorheological and magnetorheological fluids, which allow for variable stiffness and damping properties. This adaptability enhances their ability to mitigate vibrations across various earthquake intensities [10]. By integrating dampers into the structural design using mathematical models, engineers can significantly improve a building’s ability to absorb and dissipate energy during earthquakes. This includes detailed discussions on the calculation of damping coefficients and their impact on the building’s overall dynamic response to seismic events [11]. It is noteworthy that optimizing this type of damper (friction damper) remains a relatively unexplored subject worldwide, which highlights the innovative nature of our paper and serves as the driving motivation for our research.

Within the sphere of optimal control, the time-varying harmonic oscillator garners particular interest for its ability to reach designated energy levels effectively in the form

\ddot{x} (t) + μ \dot{x} (t) + ω^{2} (t) x (t) = u (t) .

(1)

Systems that are linear with respect to their variables and exhibit bounded control

u (t)

from the right side (1) often resort to a bang-bang control strategy. The oscillations in such a system significantly differ both from the natural oscillations in a system described by an equation with constant coefficients and from the forced oscillations due to an external force that depends only on time. This approach toggles the system’s excitation between two extremities at precisely calculated switching intervals, which are essential as they mark the instances of control adjustments. These intervals are visually represented by a switching curve within the state space, directing the oscillator’s management for any given state combination (position

x (t)

and velocity

\dot{x} (t)

). An extensive examination of time-optimality for both undamped and damped harmonic oscillators, including simulations that illustrate their practicality, is detailed in references [13,14]. A complex nonlinear system under state feedback control with a time delay corresponding to two coupled nonlinear oscillators with a parametric excitation is investigated by an asymptotic perturbation method based on Fourier expansion and time rescaling [12]. Given that the present investigation focuses on the optimal control of the coefficient

ω (t)

and

u (t) = 0

, the issue assumes a bilinear form. In the field of engineering, particularly in nonlinear dynamics, parametric excitation is used to control vibrations in complex mechanical systems. The pendulum with periodically varying length which is also treated as a simple model of a child’s swing is investigated in [15]. Simulations were performed in [16] on a double obstacle pendulum system to investigate the effects of various parameters, including the positions and quantities of obstacle pins, and the initial release angles, on the pendulum’s motion through numerical simulations. The pendulum with vertically oscillating support and the pendulum with periodically varying length was considered as two forced dissipative pendulum systems, with a view to draw comparisons between their behaviour [17]. Varying length pendulum is studied to address its oscillations damping using conveniently generated Coriolis force [18]. By applying the homotopy analysis method to the governing equation of the pendulum, a closed-form approximate solution was obtained [19].

The damping results in prolonged oscillations until equilibrium is achieved. Adjusting the damping coefficient,

ω (t)

, can expedite the damping process. Time-optimal control problems, known for their inverse characteristics, are prone to instability [20], which challenges traditional analytical approaches and necessitates regularization of solutions. To complement complex analytical solutions, numerical methods are employed, offering a tangible presentation of results. This research unveils an analytical solution for the control function

ω (t)

and the optimal duration of the process across a wide range of parameters. It also introduces bang-bang relay type controls and defines the system’s reachability set.

Moreover, the paper underscores the critical role of time-optimal control in contemporary industrial and technological realms, stressing the urgency for durable solutions where time efficiency is pivotal to the sustainability of robot-technical systems [21].

To summarize, we address the time-optimal control problem, which must primarily be solved analytically due to its inverse nature. This research addresses whether the time optimal process exhibits periodicity and whether the control function shows symmetry throughout its period, with findings confirming the former and refuting the latter. This focus on the control coefficient

ω (t)

opens new avenues for inquiry, especially concerning the periodicity of the optimal process and the nature of the control function, providing both confirming and challenging insights into established assumptions.

The rest of this paper is organized as follows. Section 2 contains the formulation of the optimal control problem. Section 3 contains a preliminary study of the considered controlled system and reveals some of its properties. Section 4 reveals the local properties of the problem and the application of the maximum principle (PMP) to a single semi-oscillation. Section 5 concludes the study and establishes the global properties of the optimal solution. Section 6 presents the main result of the study: a step-by-step optimization algorithm for solving the problem. Section 6 also presents numerical examples and a discussion of the results obtained. The full text of the paper is summarized in Section 7.

2. Optimal Control Problem Statement

Let’s consider the optimal control problem of a mechanical system

\{\begin{matrix} \ddot{x} (t) + μ \dot{x} (t) + ω^{2} (t) x (t) = 0, \\ x (0) = A, \dot{x} (0) = 0, A > 0, \\ x (T) = B, \dot{x} (T) = 0, B \neq 0, \\ ω_{0} \leq ω (t) \leq 1, t \in [0, T], 0 < ω_{0} < 1, \\ T \to min_{ω (t)}, \end{matrix}

(2)

where

x (t)

is the coordinate,

ω (t)

is the unknown frequency of the external controlling action, subject to determination. The minimum in the problem is sought in the class of piecewise-continuous functions

ω (t)

.

μ

is the coefficient of viscous friction, where

0 < μ < 2 ω_{0}

. If this condition is violated, subsequent analysis is also possible, but we have not investigated it, as we believe it does not arouse interest from a technical point of view. The case

A < 0

leads to a consideration of a change in the sign of the variable

x (t)

.

In this setting, the problem is not symmetric with respect to time inversion because of friction.

3. General Properties of the Controlled System (2)

With any permissible control

ω (t)

, it is observed that the trajectory

x (t)

of the controlled system in (2) oscillates around the starting coordinate with successive intervals of monotonic increase and decrease (Figure 1). The amplitude and duration of each oscillation can vary, based on the chosen control function

ω (t)

(typically discontinuous). Indeed if the conditions

\dot{x} (t_{*}) = 0

and

x (t_{*}) \neq 0

are satisfied at some moment in time

t_{*} \in [0, T]

, it can be derived from the differential equation of problem (2) that

\ddot{x} (t) = - μ \dot{x} (t) - ω^{2} (t) x (t)

. Given that the functions

x (t)

and

\dot{x} (t)

are continuous, the sign of the second derivative will match the sign of

x (t)

in a small vicinity of point

t_{*}

, except possibly at a finite number of discontinuity points of the function

ω (t)

. This implies that for

x (t_{*}) > 0

the trajectory will have a point of local maximum, and for

x (t_{*}) < 0

a point of local minimum.

From the boundary conditions, it is understood that the speeds

\dot{x} (0)

and

\dot{x} (T)

at the initial and final moments of time equal zero, a situation that occurs only at the extreme points of the oscillatory process. These moments in time are denoted as

t_{i}

(Figure 1), and the time intervals

t \in [t_{i}, t_{i + 1}]

are referred to as semi-oscillations. From this point, it is inferred that the optimal trajectory comprises a whole number of semi-oscillations N, being an even number when

B > 0

, and an odd number when

B < 0

(A > 0)

.

To investigate the total optimal control problem, let’s divide the trajectory into separate semi-oscillations and first solve the problem for one semi-oscillation

t \in [t_{i}, t_{i + 1}]

. We will denote

A_{i} = x (t_{i})

(

A_{0} = A

,

A_{N} = B

). It leads to the following N subproblems for

i = 0, \dots, N - 1

:

\{\begin{matrix} \ddot{x} (t) + μ \dot{x} (t) + ω^{2} (t) x (t) = 0, t \in [t_{i}, t_{i + 1}], \\ x (t_{i}) = A_{i}, \dot{x} (t_{i}) = 0, \\ x (t_{i + 1}) = A_{i + 1}, \dot{x} (t_{i + 1}) = 0, \\ t_{i + 1} - t_{i} \to min_{ω (t)} . \end{matrix}

(3)

Utilizing the linearity and homogeneity of the differential equation allows for the normalization of the variable

x (t)

by dividing it by its initial value

A_{i}

. It’s also taken into account that the coefficient of friction

μ

is independent of time t, meaning the initial moment in time can be considered as zero. This approach transforms all subproblems (3) for

i = 0, \dots, N - 1

into a unified auxiliary mini-problem of optimal control

\{\begin{matrix} {\ddot{x}}_{i} (t) + μ {\dot{x}}_{i} (t) + ω_{i}^{2} (t) x_{i} (t) = 0, t \in [0, T_{i}], \\ x_{i} (0) = 1, {\dot{x}}_{i} (0) = 0, \\ x_{i} (T_{i}) = A_{i + 1} / A_{i}, {\dot{x}}_{i} (T_{i}) = 0, \\ T_{i} \to min_{ω_{i} (t)} . \end{matrix}

(4)

Given probem (2) and knowing the numbers

t_{i}

and

A_{i}

, the equation for optimal time in task (4) will be accurately represented by

T_{i} = t_{i + 1} - t_{i}

, and the optimal trajectories and control in the auxiliary task (4) will coincide with the optimal trajectories and control in task (2) over the interval

[t_{i}, t_{i + 1}]

[1] . It will be demonstrated below that the optimal process is broken down into individual equal time intervals, calculated using analytical formulas.

Furthermore, for convenience in solving (4), instead of

x_{i}

and

ω_{i}

, the notations x and

ω

will be used.

4. Solution of the Optimal Control Problem for a Single Semi-Oscillation

In the previous section, it was demonstrated how to resolve the initial problem (2) by first solving an auxiliary problem

\{\begin{matrix} \ddot{x} (t) + μ \dot{x} (t) + ω^{2} (t) x (t) = 0, \\ x (0) = 1, \dot{x} (0) = 0, \\ x (T) = C < 0, \dot{x} (T) = 0, \\ \dot{x} (t) < 0, t \in (0, T), \\ T \to min_{ω (t)}, \end{matrix}

(5)

and find the dependency of the optimal time T on the terminal value C.

Here the condition

\dot{x} (t) < 0

denotes the monotonicity of the trajectory

x (t)

, which corresponds to one semi-oscillation.

First, the question of controllability will be examined, and the range of values for C for which problem (5) has a solution will be defined.

The following notations will be introduced

β_{1} = \sqrt{1 - \frac{μ^{2}}{4}}, β_{2} = \sqrt{ω_{0}^{2} - \frac{μ^{2}}{4}},

φ_{1} = arctan (\frac{2 β_{1}}{μ}), φ_{2} = arctan (\frac{2 β_{2}}{μ}),

The largest value

x_{max} = | x (T) |

can be attained with the control

ω (t) = \{\begin{matrix} 1, x (t) > 0, \\ ω_{0}, x (t) \leq 0, \end{matrix}

(6)

because with such control, acceleration is maximized when

x (t) \geq 0

and deceleration is minimized when

x (t) < 0

.

Similarly, the smallest value

x_{min} = | x (T) |

can be reached analogously with the control

ω (t) = \{\begin{matrix} ω_{0}, x (t) > 0, \\ 1, x (t) \leq 0 . \end{matrix}

(7)

Solving the differential equation with the boundary conditions from system (5) and with control (6) or (7), it is obtained

x_{min} \leq | x (T) | \leq x_{max},

(8)

where

x_{min} = ω_{0} e^{- \frac{μ}{2} (\frac{π - φ_{2}}{β_{2}} + \frac{φ_{1}}{β_{1}})}, x_{max} = \frac{1}{ω_{0}} e^{- \frac{μ}{2} (\frac{π - φ_{1}}{β_{1}} + \frac{φ_{2}}{β_{2}})} .

(9)

To apply PMP [1] introduce the notation

\dot{x} (t) = v (t)

and rewrite (5) in the form of a system of first-order differential equations

\{\begin{matrix} \dot{x} (t) = v (t), \\ \dot{v} (t) = - μ v (t) - ω^{2} (t) x (t), \\ x (0) = 1, v (0) = 0, \\ x (T) = C < 0, v (T) = 0, \\ v (t) < 0, t \in (0, T), \\ T \to min_{ω (t)} . \end{matrix}

(10)

Now let the terminal value C satisfy condition (8), which ensures the controllability of the system.

Write the Pontryagin function

H (ψ_{1}, ψ_{2}, x, v, ω) = ψ_{1} v - ψ_{2} (μ v + ω^{2} x)

and denote its upper boundary

M (ψ_{1}, ψ_{2}, x, v) = sup_{ω \in [ω_{0}, 1]} H (ψ_{1}, ψ_{2}, x, v, ω)

If

x (t)

,

v (t)

, and

ω (t)

constitute a solution to the optimal control problem (10), then the following three conditions are satisfied:

I) There exist continuous functions

ψ_{1} (t)

and

ψ_{2} (t)

, which never simultaneously become zero and are solutions to the adjoint system.

\{\begin{matrix} {\dot{ψ}}_{1} (t) = - \frac{\partial H}{\partial x} = ψ_{2} (t) ω^{2} (t), \\ {\dot{ψ}}_{2} (t) = - \frac{\partial H}{\partial v} = - ψ_{1} (t) + μ ψ_{2} (t) . \end{matrix}

(11)

II) For any

t \in [0, T]

, the maximum condition is satisfied

H (ψ_{1} (t), ψ_{2} (t), x (t), v (t), ω (t)) = M (ψ_{1} (t), ψ_{2} (t), x (t), v (t)) .

(12)

III) For any

t \in [0, T]

, a specific inequality is occured

M (ψ_{1} (t), ψ_{2} (t), x (t), v (t)) \geq 0 .

From condition (12) for the maximum of the function H, the optimal control is obtained in the form

ω (t) = \{\begin{matrix} 1, ψ_{2} (t) x (t) < 0, \\ ω_{0}, ψ_{2} (t) x (t) > 0, \\ unknown value, ψ_{2} (t) x (t) \equiv 0 . \end{matrix}

(13)

Let us show that the case of singular control in Formula (13), specifically when

ψ_{2} (t) x (t) \equiv 0

over a non-zero length interval of time is impossible, assuming the opposite. This means considering the existence of a time interval during which

ψ_{2} (t) x (t) \equiv 0

. In such an interval, determining the value of optimal control from the maximum condition would not be feasible.

Given the continuity of the functions

ψ_{2} (t)

and

x (t)

, it is possible either for

ψ_{2} (t) \equiv 0

over some interval or for

x (t) \equiv 0

over a certain time period.

If

ψ_{2} (t) \equiv 0

, then

{\dot{ψ}}_{2} (t) \equiv 0

must also be identically zero. However, this conclusion, derived from the second equation of the adjoint system (11), implies that

ψ_{1} (t) \equiv 0

, contradicting the maximum principle’s condition I).

In the scenario where

x (t) \equiv 0

, it follows that

v (t) = \dot{x} (t) \equiv 0

. Such a case is deemed impossible, as the controlled system cannot stay in a zero state under any control value, given that the term of the system’s differential Equation (5), which includes the control, would also equate to zero.

This reasoning leads to the formulation of a statement:

Lemma 1.

Optimal control

ω (t)

is limited to only two values, 1 and

ω_{0}

, dictated by the sign of the product

ψ_{2} (t) x (t)

. Considering the case where this product equals zero as non-existent is justified by the fact that the control value at a single point or a finite number of points lacks any impact on the trajectory of the controlled system.

Now, consider condition III. It represents the greatest interest at values

t = 0

and

t = T

.

At

t = 0

, the condition is expressed as

\begin{matrix} M (ψ_{1} (0), ψ_{2} (0), x (0), v (0)) = H (ψ_{1} (0), ψ_{2} (0), x (0), v (0), ω (0)) = \\ ψ_{1} (0) v (0) - ψ_{2} (0) (μ v (0) + ω^{2} (0) x (0)) \geq 0 . \end{matrix}

(14)

At

t = T

, the condition becomes

\begin{matrix} M (ψ_{1} (T), ψ_{2} (T), x (T), v (T)) = H (ψ_{1} (T), ψ_{2} (T), x (T), v (T), ω (T)) = \\ ψ_{1} (T) v (T) - ψ_{2} (T) (μ v (T) + ω^{2} (T) x (T)) \geq 0 . \end{matrix}

(15)

Given the boundary conditions that

v (0) = v (T) = 0

, and considering the control value

ω (t)

is always positive, with

x (0) > 0

and

x (T) < 0

, the following additional conditions are derived from (14) and (15)

ψ_{2} (0) \leq 0, ψ_{2} (T) \geq 0 .

(16)

Now, exploring the potential form of optimal control and the number of switches. It is already known that the value of optimal control is determined by the sign of the product

ψ_{2} (t) x (t)

.

The trajectory

x (t)

, due to its monotonic nature, crosses zero only once. This moment in time is denoted as

τ

.

Thus, control

ω (t)

may only change its value at the point

τ

and at points where the sign of the adjoint variable

ψ_{2} (t)

changes. If at point

τ

, both

x (t)

and

ψ_{2} (t)

change their signs simultaneously, then the control value remains unchanged.

Firstly, consider an interval of time where control

ω (t) \equiv 1

. Then, the general solution

x (t)

of the differential equation from system (4) and

ψ_{2} (t)

from the adjoint system (5) will take a specific form

x (t) = e^{- \frac{μ}{2} t} C_{1} sin (β_{1} t + C_{2}), ψ_{2} (t) = e^{\frac{μ}{2} t} C_{3} sin (β_{1} t + C_{4}),

(17)

where constants

C_{1}

,

C_{2}

,

C_{3}

,

C_{4}

must be determined from the boundary conditions on the interval of constant control. The value of the adjoint variable

ψ_{1} (t)

is not of interest, as it does not enter into Formula (13).

Now, consider an interval of time during which control

ω (t) \equiv ω_{0}

. Similarly, it is obtained that

x (t) = e^{- \frac{μ}{2} t} D_{1} sin (β_{2} t + D_{2}), ψ_{2} (t) = e^{\frac{μ}{2} t} D_{3} sin (β_{2} t + D_{4}),

(18)

where constants

D_{1}

,

D_{2}

,

D_{3}

,

D_{4}

are also to be determined from the boundary conditions.

It’s now proposed that the adjoint variable

ψ_{2} (t)

turns to zero at most twice within the interval

[0, τ]

, either in

[τ, T]

. For instance, let

ψ_{2} (ξ_{1}) = ψ_{2} (ξ_{2}) = 0

, where

0 \leq ξ_{1} < ξ_{2} \leq τ

. Then, within the interval

[ξ_{1}, ξ_{1}]

, the control value does not change, and this leads to a contradiction with Formulas (17) and (18) because the distance between zeros of the function

ψ_{2} (t)

(for example

\frac{π}{β_{1}}

for Formula (17)) exceeds the maximum length of an interval of constancy of sign and monotonicity of the function

x (t)

(for example

\frac{π - φ_{1}}{β_{1}}

or

\frac{φ_{1}}{β_{1}}

).

Thus, it is proven that

Lemma 2.

In problem 5, optimal control can have no more than one switch in each of the intervals

[0, τ]

and

[τ, T]

The function

ψ_{2} (t)

has a continuous derivative (as the right-hand side of the second equation of the adjoint system (11) is continuous) and turns to zero no more than twice within the interval

[0, T]

. Moreover, these zeros cannot both lie within the same subinterval

[0, τ]

or

[τ, T]

. This leads to 10 different cases (Figure 2) of sign changes for the function

ψ_{2} (t)

over the interval

[0, T]

. Dashed gray lines on the graph indicate scenarios that contradict the PMP, while solid red lines indicate cases with no contradiction with PMP found. A detailed analysis of these cases is provided.

If

ψ_{2} (τ) = 0

, then

ψ_{2} (t) \neq 0

for

t \neq τ

, leading to cases 1) and 2). In case 1), a constant control equal to 1 is maintained throughout the entire time interval. Case 2) is not possible, as

ψ_{2} (T) < 0

and does not satisfy condition (16).

If

ψ_{2} (t)

turns to zero twice within the interval

(0, T)

, there exist

ξ_{1} \in (0, τ)

and

ξ_{2} \in (τ, T)

such that

ψ_{2} (ξ_{1}) = 0

and

ψ_{2} (ξ_{2}) = 0

, leading to cases 3) and 4). These cases contradict condition (16) since

ψ_{2} (0)

and

ψ_{2} (T)

have the same sign.

If

ψ_{2} (t)

turns to zero once at a point

ξ \in (0, τ)

and does not equal zero within the interval

(τ, T)

, cases 5) and 6) are obtained. Case 5) is impossible because

ψ_{2} (0) > 0

.

If

ψ_{2} (t)

turns to zero once at a point

ξ \in (τ, T)

and does not equal zero within the interval

(0, τ)

, cases 7) and 8) emerge. Case 8) is not feasible, as

ψ_{2} (T) < 0

.

Finally, if

ψ_{2} (t)

does not turn to zero within the interval

(0, T)

, cases 9) and 10) are considered. Case 9) is possible if

ψ_{2} (0) = 0

. Case 10) is possible if

ψ_{2} (T) = 0

.

After analyzing cases 1)-10), it is determined that the following statement holds

Lemma 3.

Optimal control (bang-bang) in the problem (5) can be one of the five types represented in Figure 3.

It is noted that all types of control satisfying the maximum principle (illustrated in Figure 3) differ in the length of the segment where the control value equals

ω_{0}

, and its placement respectively to the point

τ

.

Introducing the parameter

s = ξ - τ

, the values of

τ

and T can be distinctly determined from the equation and three boundary conditions (excluding the condition

x (T) = C

) of problem (5) by substituting the corresponding control. This results in the determination of the end time

T (s)

and the terminal value

C (s) = x (T (s))

as functions of the unknown parameter s.

For control type 3 (illustrated in Figure 3),

s = 0

corresponds, and for control type 1, the smallest value

s_{min} = - \frac{π - φ_{2}}{β_{2}} < 0

. For control type 5 the largest value

s_{max} = \frac{φ_{2}}{β_{2}}

is obtained as the longest possible duration of motion under constant control

ω (t) \equiv ω_{0}

, that is

s_{max} = T - τ

, with the moments of time

τ

and T derived from Formula (18) and the conditions

x (τ) = 0

,

\dot{x} (T) = 0

,

x (T) < 0

,

T > τ

, aiming to minimize

T - τ

. Similarly, from Formula (18), the smallest value of

s_{\min} = - τ

is obtained. Controls of type 2 and 4 correspond to intermediate values of s within intervals

(s_{min}, 0)

and

(0, s_{max})

.

Knowing the switching moment of control and having an analytical solution (Formulas (17) and (18)), the end time T and the terminal trajectory value

x (T)

can be explicitly calculated as functions of the parameter s.

Let us consider

s \in [0, \frac{φ_{2}}{β_{2}}]

, then for

t \in [0, τ)

,

ω (t) \equiv 1

and from Formula (17) and the initial condition

x (0) = 1, \dot{x} (0) = 0

it’s found

C_{1} = \frac{1}{β_{1}}

,

C_{2} = φ_{1}

leading to

x (t) = \frac{1}{β_{1}} e^{- \frac{μ}{2} t} sin (β_{1} t + φ_{1})

and

τ = \frac{π - φ_{1}}{β_{1}}

.

Subsequently, for

t \in [τ, τ + s)

,

ω (t) \equiv ω_{0}

and from Formula (18) and the continuity of

\dot{x} (t)

at

t = τ

, similarly,

x (t) = - \frac{1}{β_{2}} e^{- \frac{μ}{2} t} sin (β_{2} (t - τ))

.

Finally, for

t \in [τ + s, T]

,

ω (t) \equiv 1

and from Formula (17) and the continuity of

\dot{x} (t)

at

t = τ + s

, it’s found

x (t) = - \frac{e^{- \frac{μ}{2} t} sin (β_{2} s) sin (β_{1} (t - τ - s) + φ_{3})}{β_{2} sin (φ_{3})},

(19)

where

φ_{3} = arctan (\frac{β_{1}}{β_{2}} tan (β_{2} s))

.

From Formula (19) and the condition

\dot{x} (T) = 0

, the end moment of time

T (s) = \frac{1}{β_{1}} (π - φ_{3}) + s .

(20)

Simplifying expressions (19)–(20), ultimately, for

s \in [0, s_{max}]

, one obtains

\begin{matrix} T (s) = \frac{1}{β_{1}} (π - arctan (\frac{β_{1}}{β_{2}} tan (β_{2} s))) + s, \\ C (s) = x (T (s)) = - \frac{1}{β_{2}} e^{- \frac{μ}{2} T (s)} \sqrt{β_{2}^{2} {cos}^{2} (β_{2} s) + β_{1}^{2} {sin}^{2} (β_{2} s)} . \end{matrix}

(21)

Conducting analogous calculations for the case

s \in [s_{min}, 0)

, one obtains

\begin{matrix} T (s) = \frac{1}{β_{1}} (\frac{π}{2} - arctan (\frac{β_{2}}{β_{1}} cot (β_{2} s))) - s, \\ C (s) = \frac{- β_{2} e^{- \frac{μ}{2} T (s)}}{\sqrt{β_{2}^{2} {cos}^{2} (β_{2} s) + β_{1}^{2} {sin}^{2} (β_{2} s)}} . \end{matrix}

(22)

Noting that Formulas (21) and (22) parametrically define a certain curve

T (C)

depicting the dependency of the end time on the terminal value C when utilizing controls that satisfy the maximum principle. The parametric formulation of the function allows for the calculation of the first two derivatives of

T (C)

as functions of the variable C. Thus, the following properties of the function

T (C)

are established

Lemma 4.

Formulas (21) and (22):

Uniquely determine the function $T (C)$ , defined for $C \in [- x_{max}, - x_{min}]$ .
The function $T (C)$ is continuous for $C \in [- x_{max}, - x_{min}]$ .
The function $T (C)$ is differentiable for $C \in (- x_{max}, - x_{min})$ . At the endpoints of the interval, the derivative equals infinity, while at the point corresponding to the parameter $s = 0$ , the derivative equals zero. Let $x_{*} = - C (0)$ be denoted.
The function $T (C)$ decreases on the interval $C \in [- x_{max}, - x_{*}]$ and increases on the interval $C \in [- x_{*}, - x_{min}]$ .
The second derivative of the function $T (C)$ is negative on the intervals $C \in (- x_{max}, - x_{*}) \cup (- x_{*}, - x_{min})$ . This condition signifies that the function $T (C)$ is concave down for $C \in [- x_{max}, - x_{min}]$ .

Remark 1.

Note that the constancy of the sign of the second derivative was established by calculations via symbolic mathematics by Wolfram.

Investigating the properties of the function

T (C)

, it was found that each permissible terminal value C corresponds to a unique control that satisfies the PMP. Therefore the statement is following

Lemma 5.

The function

T (C)

, defined by Formulas (21) and (22), determines the optimal time in problem (5).

Example 1.

Considering an example with given parameters

μ = 0.1

,

ω_{0} = 0.5

, it’s calculated that

x_{min} \approx 0.39

,

x_{max} \approx 1.59

. From (21), it’s found that

x_{*} = C (0) \approx 0.85

. Figure 4 would illustrate the graph of the function

T (C)

, demonstrating how the optimal time varies with different terminal values C within the specified range.

It has been demonstrated that each value of s unequivocally corresponds to a specific optimal control and an optimal trajectory, leading to a particular terminal point

C (s)

. Different optimal trajectories, corresponding to various types of controls, are presented in Figure 5. Controls of types 1 and 5 correspond to trajectories reaching the extreme points of the reachability set. Control of type 2 corresponds to the upper branch of the

T (C)

curve (left branch on Figure 4). Control of type 3, which has no switches, corresponds to the trajectory with the minimum possible time. Control of type 4 corresponds to the lower branch of the

T (C)

curve (right branch on Figure 4).

Trajectories are constructed for the given parameter values on the Figure 4, but the general character of the drawing does not change with different parameter values.

5. Solution to the General Timing Optimal Problem

Applying the results of the previous section to solve the original problem (2). Let’s first explore the question of controllability and determine under what boundary conditions A and B the system is controllable.

Using the estimate (8), we obtain an estimate for

x (T)

depending on the number of semi-oscillations N.

A x_{min}^{N} \leq | x (T) | \leq A x_{max}^{N} .

(23)

Thus, the lemma is following.

Lemma 6.

The system is controllable if and only if there exists an even natural number N (for

B > 0

) or an odd natural number N (for

B < 0

), such that

|\frac{B}{A}| \in [x_{min}^{N}, x_{max}^{N}] .

(24)

Since

φ_{2} \in (0, π / 2)

and

ω_{0} < 1

, it follows that

x_{min} < 1

and

x_{min}^{N} \to 0

as

N \to + \infty

.

Therefore, the system will be controllable for any non-zero values of A and B provided that

x_{max} > 1 .

Utilizing formula (9), this inequality can be expressed as follows

- \frac{π - arctan (\frac{\sqrt{4 - μ^{2}}}{μ})}{\frac{\sqrt{4 - μ^{2}}}{μ}} + \frac{arctan (\frac{\sqrt{4 ω_{0}^{2} - μ^{2}}}{μ})}{\frac{\sqrt{4 ω_{0}^{2} - μ^{2}}}{μ}} > ln ω_{0} .

(25)

Having resolved the question of controllability, we now return to the original problem of optimal control (2). Given

x (t)

the solution of the optimal control problem (2), let’s consider two consecutive semi-oscillations

t \in [t_{i - 1}, t_{i + 1}]

. This segment of the optimal trajectory satisfies the boundary conditions of the original problem and must itself be optimal. Using the results of the previous section and normalizing variable

x (t)

, the time for this segment can be expressed by the formula

t_{i + 1} - t_{i - 1} = T (\frac{x (t_{i + 1})}{x (t_{i})}) + T (\frac{x (t_{i})}{x (t_{i - 1})}) = T (\frac{A_{i + 1}}{A_{i}}) + T (\frac{A_{i}}{A_{i - 1}}) .

Fixing

A_{i + 1}

and

A_{i - 1}

(noting that they have the same sign) and finding the minimum of the last expression by the variable

A_{i}

. Denoting

D = \frac{A_{i + 1}}{A_{i - 1}}

and introducing a new variable

q = \frac{A_{i}}{A_{i - 1}}

, the time

t_{i + 1} - t_{i - 1}

can be expressed by the function

g (q) = t_{i + 1} - t_{i - 1} = T (q) + T (\frac{D}{q}),

where

T (C)

parametrically defined using Formulas (21) and (22). Let q and

\frac{D}{q}

belong to the domain of definition of the function

T (C)

. We find the first derivative of the function

g (q)

g^{'} (q) = T^{'} (q) - T^{'} (\frac{D}{q}) \frac{D}{q^{2}} .

It’s easy to notice that this derivative becomes zero at the point

q_{*} = - \sqrt{D}

. Let’s compute the second derivative at the point

q_{*}

.

\begin{matrix} g^{″} (q_{*}) = T^{″} (q_{*}) + T^{″} (\frac{D}{q_{*}}) \frac{D^{2}}{q_{*}^{4}} + T^{'} (\frac{D}{q_{*}}) \frac{2 D}{q_{*}^{3}} = 2 T^{″} (- \sqrt{D}) - T^{'} (- \sqrt{D}) \frac{2}{\sqrt{D}} . \end{matrix}

(26)

Given that

- \sqrt{D} \in (- x_{max}, - x_{0}]

, the function

T (C)

decreases and is concave downwards. Therefore, all terms in the above expression are positive, and the found point is a point of minimum. At the boundary points of the domain of definition, the function

T (C)

is not differentiable, but in this case, there exists a unique control (either Equation (6) or Equation (7)), leading the controlled system to its extreme position. For the remaining values of

- \sqrt{D}

, the positivity of the above expression (26) follows from complex algebraic manipulations using the parametric setting of the function

T (C)

with the help of Formulas (21) and (22).

We have shown that the numbers

A_{i - 1}

,

A_{i}

,

A_{i + 1}

form a geometric progression. Applying this reasoning to the entire trajectory, we obtain the following statement

Lemma 7.

The numbers

A_{i}

for the optimal process satisfy the condition

A_{i} = A {(- \sqrt[N]{\frac{| B |}{A}})}^{i}, i = 0, \dots, N,

where the number of semi-oscillations is determined as the smallest N satisfying Lemma 6.

Since the ratio

\frac{A_{i + 1}}{A_{i}}

is constant for the optimal trajectory, the optimal control on each segment

[t_{i}, t_{i + 1}]

will be the same. Hence, if the number of semi-oscillations required to reach the end point is more than one, then the optimal control is a periodic function, where the period is one semi-oscillation.

6. Main Result

Thus, necessary and sufficient conditions are determined for the optimal control problem to have a solution. The controlled process is oscillatory in nature and a formula has been obtained to determine the number of semi-oscillations. It has been proven that the amplitudes of semi-oscillations form a geometric progression. We will combine all the statements proven above (Lemmas 1–7) into a step-by-step algorithm for solving the original problem (2).

Determine if the problem has a solution and find the number of semi-oscillations N from the condition (24). Note that the problem will have a solution for any $A > 0$ and $B \neq 0$ if condition (25) is satisfied.
Calculate the denominator of the geometric progression $C_{*} = \frac{A_{i + 1}}{A_{i}}$ using formula

$C_{*} = - \sqrt[N]{\frac{| B |}{A}} .$

(27)

This value determines how much the amplitude changes over one semi-oscillation.
Using the parametric setting (21) and (22) of the function $T (C)$ and the value $C_{*}$ found in the previous step, calculate the value of the parameter $s_{*}$ as the solution of the equation $C (s_{*}) = C_{*}$ and the duration of one semi-oscillation $T_{*} = T (C_{*})$ .

The optimal time for rapid action in problem (2) is then

$T = N \cdot T_{*} .$
The value $s_{*} = ξ - τ$ uniquely determines the type of optimal control for one semi-oscillation (Figure 3) and allows determining the number and position of switching points for one semi-oscillation.

In the case of $s_{*} > 0$ we have optimal control of the type 4 or 5. In this case within one semi-oscillation we calculate the moment of the first switching $τ = \frac{π - φ_{1}}{β_{1}}$ . Then if $s_{*} < s_{max}$ the second switching moment is calculated using formula $ξ = τ + s_{*}$ .

In the case of $s_{*} = 0$ there is no switching moment (it is optimal control of type 3).

In the case of $s_{*} < 0$ the optimal control of the type 1 or 2 is considered. Here first, the the second switching moment $τ = T_{*} - \frac{φ_{1}}{β_{1}}$ is calculated, then the first switching moment $ξ = τ + s_{*}$ is found.

Subsequently, control values for each semi-oscillation periodically repeat. Thus, we find the optimal control and optimal trajectory over the entire segment $t \in [0, T]$ .

Example 2.

Applying the obtained algorithm, find the optimal control and trajectory for the parameter values

A = 1

,

B = - \frac{1}{4}

,

μ = 0.1

,

ω_{0} = 0.5

.

From Equation (9), it follows

x_{min} \approx 0.39

and

x_{max} \approx 1.59

. From (21)

x_{*} = C (0) \approx 0.85

. It’s also noted that since

x_{max} > 1

, the problem will have a solution for any boundary conditions given the specific values of

μ

and

ω_{0}

.

From (24), it’s determined that the end point is reachable within

N = 3

semi-oscillations. Further, according to the Formula (27) the value is

C_{*} = - \sqrt[3]{\frac{1}{4}} \approx - 0.63

. This value of

C_{*}

corresponds to Formula (22) and optimal control of type 2, from which we find

s_{*} \approx - 1.09

,

T_{*} \approx 3.35

and

T = T_{*} \cdot 3 \approx 10.05

. Second switching moment

τ \approx 1.83

and the first switching moment is

ξ \approx 0.74

. The optimal trajectory and phase portrait are shown on Figure 6.

Example 3.

Using the obtained result about the periodicity of optimal control, we can construct the reachability set and optimal trajectories for the case when the endpoint is reachable within no more than three semi-oscillations for

μ = 0.2

,

ω_{0} = 0.75

(Figure 7). Since the optimal control is a periodic function, the optimal trajectories are first constructed for several selected values of the parameter s on one semi-oscillation using given above algorithm. Then the optimal control on one semi-oscillation is repeated on the next two semi-oscillations. For this example, three semi-oscillations are considered, but the process could easily be extended to any number of semi-oscillations.

It’s important to note the discontinuity in the curve of optimal time

T (C)

in the case of more than one semi-oscillation. That is, a small change in the boundary conditions can lead to a significant change in the optimal time by increasing the number of semi-oscillations at the optimal trajectory. We also note that for these parameter values, condition (25) is not satisfied, and the optimal control problem is not solvable for any boundary conditions. Figure 7 shows that all optimal trajectories are damped oscillations and although the optimal control is a periodic function, this period differs for different trajectories.

7. Conclusions

In conclusion, this study presents an insightful examination of a bilinear optimal control problem, with particular emphasis on the coefficient modulation. Through rigorous analysis, it has been established that the optimal process exhibits periodic characteristics with oscillatory behavior, and the amplitudes form a geometric progression. Furthermore, it was determined that while the optimal process itself is indeed periodic, the control function does not retain symmetry within a single period.

The implications of these findings extend to the broader realm of control theory and its applications in engineering and physics, offering a new perspective on the nature of bilinear control systems. The periodicity of the optimal process suggests potential for efficient energy usage and system stabilization in various applications, from mechanical systems to electrical circuits.

However, the lack of symmetry in the control function within the period underscores the complexity of bilinear control systems and indicates that intuition alone may not be sufficient to predict the system behavior. Future research may explore the nuances of this asymmetry and its impact on system performance.

The analytical solution obtained in the paper allows for the precise determination of the switching moments, as well as the amplitudes and the total optimal time of the process.This paper contributes to the ongoing discourse in control theory, providing a foundation for subsequent studies to build upon. The results underscore the necessity for a nuanced approach to control strategy development, especially in systems where time-optimality is a paramount consideration. The methodologies and findings herein have practical implications for designing more efficient and robust control systems in the future.

Author Contributions

Conceptualization, V.T.; Investigation, D.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data supporting the results of this study are available from the corresponding authors upon request.

Conflicts of Interest

The authors declare that they have no known competing financial interests of personal relationships that could have appeared to influence the work reported in this paper.

References

Pontryagin, L.S.; Boltyanskii, V.G.; Gamkrelidze, R.V.; Mishchenko, E.F. The Mathematical Theory of Optimal Processes; Interscience Publishers (division of John Wiley and Sons, Inc.): New York, NY, USA; London, UK, 1962; 360p. [Google Scholar]
Lewis, F.L.; Vrabie, D.; Syrmos, V.L. Optimal Control; John Wiley and Sons, Inc.: Hoboken, NJ, USA, 2012; 540p. [Google Scholar]
Kirk, D.E. Optimal Control Theory: An Introduction; Dover Publications, Inc.: New York, NY, USA, 2004; 464p. [Google Scholar]
Bryson, A.E.; Ho, Y.C. Applied Optimal Control: Optimization, Estimation, and Control; Taylor & Francis Group: New York, NY, USA, 1975; 496p. [Google Scholar]
Geradin, M.; Rixen, D. Mechanical Vibrations: Theory and Application to Structural Dynamics; John Wiley & Sons, Ltd: Chichester, West Sussex, UK, 2015; 598p. [Google Scholar]
Lavrovskii, E.K.; Formal’skii, A.M. Optimal control of the pumping and damping of a swing. Journal of Applied Mathematics and Mechanics 1993, 57, 311–320. [Google Scholar] [CrossRef]
Golubev, Y.F. Brachistochrone with dry and arbitrary viscous friction. J. Comput. Syst. Sci. Int. 2012, 51, 22–37. [Google Scholar] [CrossRef]
Dupont, P.; Kasturi, P.; Stokes, A. Semi-active control of friction dampers. Journal of Sound and Vibration 1997, 202, 203–218. [Google Scholar] [CrossRef]
Berger, E. Friction modeling for dynamic system simulation. ASME. Appl. Mech. Rev. 2002, 55, 535–577. [Google Scholar] [CrossRef]
Zoccolini, L.; Bruschi, E.; Cattaneo, S.; Quaglini, V. Current Trends in Fluid Viscous Dampers with Semi-Active and Adaptive Behavior. Appl. Sci. 2023, 13, 10358. [Google Scholar] [CrossRef]
Cetin, H.; Aydin, E.; Ozturk, B. Optimal Design and Distribution of Viscous Dampers for Shear Building Structures Under Seismic Excitatio. Frontiers in Built Environment 2019, 5, 90. [Google Scholar] [CrossRef]
Maccari, A. Vibration Control for Parametrically Excited Coupled Nonlinear Oscillators. J. Comput. Nonlinear Dynam. 2008, 3, 031010. [Google Scholar] [CrossRef]
Scaramozzino, S.; Listmann, K.D.; Gebhardt, J. Time-optimal control of harmonic oscillators at resonance. In Proceedings of the European Control Conference (ECC), Linz, Austria, 15–17 July 2015; pp. 1955–1961. [Google Scholar] [CrossRef]
Hatvani, L. On the parametrically excited pendulum equation with a step function coefficient. International Journal of Non-Linear Mechanics 2015, 77, 172–182. [Google Scholar] [CrossRef]
Belyakov, A.O.; Seyranian, A.P.; Luongo, A. Dynamics of the pendulum with periodically varying length. Physica D: Nonlinear Phenomena 2009, 238, 1589–1597. [Google Scholar] [CrossRef]
Yu, Y.; Ma, J.; Shi, X.; Wu, J.; Cai, S.; Li, Z.; et al. Study on the variable length simple pendulum oscillation based on the relative mode transfer method. PLoS ONE 2024, 19, e0299399. [Google Scholar] [CrossRef]
Wright, J.A.; Bartuccelli, M.; Gentile, G. Comparisons between the pendulum with varying length and the pendulum with oscillating support. Journal of Mathematical Analysis and Applications 2017, 449, 1684–1707. [Google Scholar] [CrossRef]
Anderle, M.; Čelikovský, S.; Vyhlídal, T. Lyapunov Based Adaptive Control for Varying Length Pendulum with Unknown Viscous Friction. In Proceedings of the 2021 23rd International Conference on Process Control (PC), Strbske Pleso, Slovakia; 2021; pp. 1–6. [Google Scholar] [CrossRef]
Yang, T.; Fang, B.; Li, S.; Huang, W. Explicit analytical solution of a pendulum with periodically varying length. European Journal of Physics 2010, 31, 1089–1096. [Google Scholar] [CrossRef]
Matveev, A.S. The Instability of Optimal Control Problems to Time Delay. SIAM Journal on Control and Optimization 2005, 43, 1757–1786. [Google Scholar] [CrossRef]
Hao, L.; Pagani, R.; Beschi, M.; Legnani, G. Dynamic and Friction Parameters of an Industrial Robot: Identification, Comparison and Repetitiveness Analysis. Robotics 2021, 10, 49. [Google Scholar] [CrossRef]

Figure 1. An example of the trajectory of the controlled system (2) under the action of bang-bang control

ω (t)

for the case

μ = 0.05

,

ω_{0} = 0.75

.

Figure 1. An example of the trajectory of the controlled system (2) under the action of bang-bang control

ω (t)

for the case

μ = 0.05

,

ω_{0} = 0.75

.

Figure 2. Cases 1)-10) of sign changes in the function

ψ_{2} (t)

. The dashed gray line represents situations that do not satisfy the PMP. The solid red line represents cases that do not contradict the PMP.

Figure 2. Cases 1)-10) of sign changes in the function

ψ_{2} (t)

. The dashed gray line represents situations that do not satisfy the PMP. The solid red line represents cases that do not contradict the PMP.

Figure 3. All possible variants of optimal control encountered in problem (10).

Figure 4. Optimal time curve

T (C)

in problem (5) for the case of

μ = 0.1

,

ω_{0} = 0.5

.

Figure 4. Optimal time curve

T (C)

in problem (5) for the case of

μ = 0.1

,

ω_{0} = 0.5

.

Figure 5. The reachability set of optimal trajectories and control switching points for various values of C in the case of a single oscillation.

μ = 0.1

,

ω_{0} = 0.5

.

Figure 5. The reachability set of optimal trajectories and control switching points for various values of C in the case of a single oscillation.

μ = 0.1

,

ω_{0} = 0.5

.

Figure 6. Optimal trajectory

x (t)

, control

ω (t)

, and phase portrait for

μ = 0.1

,

ω_{0} = 0.5

.

Figure 6. Optimal trajectory

x (t)

, control

ω (t)

, and phase portrait for

μ = 0.1

,

ω_{0} = 0.5

.

Figure 7. The reachability set of optimal trajectories and control switching points for various values of C in the case of no more than 3 semi-oscillations.

μ = 0.2

,

ω_{0} = 0.75

.

Figure 7. The reachability set of optimal trajectories and control switching points for various values of C in the case of no more than 3 semi-oscillations.

μ = 0.2

,

ω_{0} = 0.75

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Time-Optimal Motions of Mechanical System with Viscous Friction

Abstract

Keywords:

Subject:

1. Introduction

2. Optimal Control Problem Statement

3. General Properties of the Controlled System (2)

4. Solution of the Optimal Control Problem for a Single Semi-Oscillation

5. Solution to the General Timing Optimal Problem

6. Main Result

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe