Preprint
Article

This version is not peer-reviewed.

A Formal Proof of Castigliano Theorem and a Related Generalization Including a Non-Linear Case

Submitted:

15 November 2025

Posted:

18 November 2025

You are already at the latest version

Abstract
This short communication develops a formal proof of Castilgiano Theorem in a elasticity context. The results are base on standard tools of applied functional analysis and calculus of variations. It is worth mentioning such results here presented may be easily extended to a non-linear elasticity context. Finally, in the last section we present a numerical example in order to illustrate the results applicability.
Keywords: 
;  ;  ;  ;  

1. Introduction

In this article version we present some corrections and improvements concerning the previous versions [1,2].
In this article version we also include a new extension to a non-linear elasticity case.
In the next section we present the mathematical formalism of a result in elasticity theory known as the Castigliano’s Theorem.
We also present a generalization of such an theorem and its connection with the principle of virtual work in a elasticity theory context.
Furthermore, in this article version we also include a new extension to a non-linear elasticity case.
The results are obtained through an application of basic tools of functional analysis and calculus variations to a solid mechanics theory.
Our main reference in solid mechanics is [3]. Similar results have been presented in section 40 of preprint [4].
For basic topics on the applied functional analysis and calculus of variations, please see [5,6].
For the Sobolev spaces involved, we would cite [7].
Remark 1.1.
In this text we have adopted the Einstein convention of summing up repeated indices, unless otherwise indicated.

2. A Formal Proof of Castigliano Theorem

Let Ω R 3 be an open, bounded and connected set with a regular (Lipischitzian) boundary denoted by Ω .
In a context of linear elasticity, consider the functional J : V R where
J ( u ) = E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ,
u = ( u 1 , u 2 , u 3 ) W 0 1 , 2 ( Ω ; R 3 ) V , f = ( f 1 , f 2 , f 3 ) L 2 ( Ω ; R 3 ) , Y = Y * = L 2 ( Ω ; R 3 ) , and
P i j R , i { 1 , 2 , 3 } , j { 1 , , N }
for some N N .
Here we have denoted
E i n = 1 2 Ω H i j k l e i j ( u ) e k l ( u ) d x ,
e i j ( u ) = 1 2 u i x j + u j x i .
Moreover H i j k l is a fourth order positive definite and constant tensor.
Observe that the variation of J in u i give us the following Euler-Lagrange equation
( H i j k l e k l ( u ) ) , j f i j = 1 N P i j δ ( x j ) = 0 , in Ω .
Symbolically such a system stands for
J ( u ) u i = 0 , i { 1 , 2 , 3 } ,
so that
( E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ) u l = 0 , l { 1 , 2 , 3 } .
We denote u V solution of (9) by u = u ( f , P ) , so that multiplying the concerning extremal equation by u i and integrating by parts, we get
H 1 ( u ( f , P ) , f , P ) = 2 E i n ( u ( f , P ) ) u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , P ) P i j = 0 , f Y * , P R 3 N .
Therefore
d d P l s H 1 ( u ( f , P ) , f , P ) = 0 ,
so that
2 d E i n d P l s d d P l s u i ( f , P ) , f i L 2 + j = 1 N u i ( x j , f , p ) P i j = 0 ,
where we recall that, from the Implicit Function Theorem, we have
d E i n d P l s = E i n u k u k P l s .
Hence, from this and (12), we obtain
d E i n d P l s + d d P l s E i n u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , p ) P i j = 0 ,
so that
d E i n d P l s + ( E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ) u k u k P l s P l s u i , f i L 2 + j = 1 N u i ( x j ) P i j = 0 .
From this, recalling that
( E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ) u k = 0 , k { 1 , 2 , 3 }
we obtain
d E i n d P l s u l ( x s ) = 0 ,
so that
u l ( x s ) = d E i n d P l s = d d P l s 1 2 Ω H i j k l e i j ( u ( f , P ) ) e k l ( u ( f , P ) ) d x ,
l { 1 , 2 , 3 } , s { 1 , , N } .
With such results in mind, we have proven the following theorem.
Theorem 2.1
(Castigliano). Considering the notations and definitions in this section, we have
u l ( x s ) = d E i n d P l s = d d P l s 1 2 Ω H i j k l e i j ( u ( f , P ) ) e k l ( u ( f , P ) ) d x ,
l { 1 , 2 , 3 } , s { 1 , , N } .

2.1. A Generalization of Castigliano Theorem

In this subsection, denoting by δ ( x k ) a standard Dirac delta function in a distributional sense, we present a more general version of the Castigliano theorem.
Considering the context of last section, we recall that
H 1 ( u ( f , P ) , f , P ) = 2 E i n ( u ( f , P ) ) u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , P ) P i j = 0 , f Y * , P R 3 N .
Therefore, here denoting the Gâteaux derivative of H 1 related to f l by
d d f l H 1 ( u ( f , P ) , f , P ) ,
from the extremal equation
d d f l H 1 ( u ( f , P ) , f , P ) = 0 ,
for x k Ω such that
x k x j , j { 1 , , N } ,
we have, in an appropriate distributional sense,
d d f l H 1 ( u ( f , P ) , f , P ) , δ ( x k ) L 2 = 0 ,
so that
2 d d f l E i n ( u ( f , P ) ) , δ ( x k ) L 2 d d f l u i ( f , P ) , f i L 2 + j = 1 N u i ( x j , f , P ) P i j , δ ( x k ) L 2 = 0 ,
that is
d d f l E i n ( u ( f , P ) ) , δ ( x k ) L 2 + u s E i n ( u ( f , P ) ) u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , P ) P i j u s d f l , δ ( x k ) L 2 f l u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , P ) P i j , δ x ( x k ) L 2 = 0 .
From such results, we may obtain
d d f l E i n ( u ( f , P ) ) , δ ( x k ) L 2 u l ( x ) , δ ( x k ) L 2 = 0 ,
so that
d d f l E i n ( u ( f , P ) ) , δ ( x k ) L 2 u l ( x k ) = 0 ,
that is
u l ( x k ) = d d f l E i n ( u ( f , P ) ) , δ ( x k ) L 2 ,
l { 1 , 2 , 3 } , x k Ω such that x k x j , j { 1 , , N } .
With such results in mind, we have proven the following theorem.
Theorem 2.2
(The Generalized Castigliano Theorem). Considering the notations and definitions in this section, here again denoting the Gâteaux derivative of H 1 related to f i by
d d f i H 1 ( u ( f , P ) , f , P ) ,
we have
u i ( x k ) = d d f i E i n ( u ( f , P ) ) , δ ( x k ) L 2 ,
i { 1 , 2 , 3 } , x k Ω such that x k x j , j { 1 , , N } .

2.2. The Virtual Work Principle

Considering the definitions, results and statements of the previous section and subsection, we may easily prove the following theorem.
Theorem 2.3
(The virtual work principle). Let x l Ω such that x l x j , j { 1 , , N } .
For a virtual constant load P ^ R on x l at the direction of u k ( x l ) , define now J : V R where
J ( u ) = E i n u i , f i L 2 j = 1 N u i ( x j ) P i j P ^ u k ( x l ) .
Under such hypotheses,
u k ( x l ) = d E i n ( u ( f , P , P ^ ) ) d P ^ P ^ = 0 ,
k { 1 , 2 , 3 } , x l Ω such that x l x j .
Proof. 
The proof is exactly the same as in the Castigliano Theorem in the previous section except by setting the virtual load P ^ = 0 in the end of this calculation and will not be repeated. □

3. The Castigliano Theorem for a Non-Linear Elasticity Case

Let Ω R 3 be an open, bounded and connected set with a regular (Lipischitzian) boundary denoted by Ω .
In a context of non-linear elasticity, consider the functional J : V R where
J ( u ) = E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ,
u = ( u 1 , u 2 , u 3 ) W 0 1 , 2 ( Ω ; R 3 ) V , f = ( f 1 , f 2 , f 3 ) L 2 ( Ω ; R 3 ) , Y = Y * = L 2 ( Ω ; R 3 ) , and
P i j R , i { 1 , 2 , 3 } , j { 1 , , N }
for some N N .
Here we have denoted
E i n = 1 2 Ω H i j k l e i j ( u ) e k l ( u ) d x ,
e i j ( u ) = 1 2 u i x j + u j x i + u m , i u m , j .
Moreover H i j k l is a fourth order positive definite and constant tensor.
Observe that the variation of J in u i give us the following Euler-Lagrange equation
( H i j k l e k l ( u ) ) , j ( H i m k l e k l ( u ) u m , j ) , j f i j = 1 N P i j δ ( x j ) = 0 , in Ω .
Symbolically such a system stands for
J ( u ) u i = 0 , i { 1 , 2 , 3 } ,
so that
( E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ) u l = 0 , l { 1 , 2 , 3 } .
We denote u V solution of (9) by u = u ( f , P ) , so that multiplying the concerning extremal equation by u i and integrating by parts, we get
H 1 ( u ( f , P ) , f , P ) = 2 E i n ( u ( f , P ) ) + 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , P ) P i j = 0 , f Y * , P R 3 N ,
where
σ i j ( u ) = H i j k l e k l ( u ) , i , j { 1 , 2 , 3 } .
Therefore
d d P l s H 1 ( u ( f , P ) , f , P ) = 0 ,
so that
2 d E i n d P l s + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 d d P l s u i ( f , P ) , f i L 2 + j = 1 N u i ( x j , f , P ) P i j = 0 ,
where we recall that, from the Implicit Function Theorem, we have
d E i n d P l s = E i n u k u k P l s .
Hence, from this and (12), we obtain
d E i n d P l s + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 + d d P l s E i n u i ( f , P ) , f i L 2 j = 1 N u i ( x j , f , P ) P i j = 0 ,
so that
d E i n d P l s + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 + ( E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ) u k u k P l s P l s u i , f i L 2 + j = 1 N u i ( x j ) P i j = 0 .
From this, recalling that
( E i n u i , f i L 2 j = 1 N u i ( x j ) P i j ) u k = 0 , k { 1 , 2 , 3 }
we obtain
d E i n d P l s + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 u l ( x s ) = 0 ,
so that
u l ( x s ) = d E i n d P l s + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 = d d P l s 1 2 Ω H i j k l e i j ( u ( f , P ) ) e k l ( u ( f , P ) ) d x + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 ,
l { 1 , 2 , 3 } , s { 1 , , N } .
With such results in mind, we have proven the following theorem.
Theorem 3.1.
Considering the notations and definitions in this section, in particular for
E i n = 1 2 Ω H i j k l e i j ( u ) e k l ( u ) d x
and
e i j ( u ) = 1 2 u i x j + u j x i + u m , i u m , j ,
we have
u l ( x s ) = d E i n d P l s + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 = d d P l s 1 2 Ω H i j k l e i j ( u ( f , P ) ) e k l ( u ( f , P ) ) d x + d d P l s 1 2 σ i j ( u ( f , P ) ) ( u m ( f , P ) ) , i ( u m ( f , P ) ) , j L 2 ,
l { 1 , 2 , 3 } , s { 1 , , N } .

3.1. A Numerical Example Related to the Castigliano Theorem

Let Ω = [ 0 , 1 ] R be the axis of a straight beam with a rectangular cross section of dimensions b × h , where units in this subsections refer to the international system.
Let
I = b h 3 12
and denote by E > 0 the Young modulus for a steel beam.
Assume such a beam is subject to a vertical load P > 0 uniformly distributed on Ω .
Assume also the beam is clamped at x = 0 and simply supported at x = 1 .
Denoting by w V = W 2 , 2 ( Ω ) the vertical field of displacements results from the action of P, the related boundary conditions are given by
w ( 0 ) = w ( 1 ) = 0 , w , x ( 0 ) = 0 , w , x x ( 1 ) = 0 .
The total beam energy is defined by J : V R , where
J ( w ) = E I 2 Ω w , x x 2 d x Ω P w d x .
In order to apply the results of the previous section, we free the rotations of the beam at x = 0 , considering a moment load M on x = 0 with general work
M w , x ( 0 ) .
Hence, we define
V 1 = { w W 2 , 2 ( Ω ) : w ( 0 ) = w ( 1 ) = w , x x ( 1 ) = 0 } ,
and define J 1 : V 1 R , by
J 1 ( w ) = E I 2 Ω w , x x 2 d x Ω P w d x M w , x ( 0 ) .
We emphasize again, through the methods of the previous section, we intend to obtain the value of M which corresponds to
w , x ( 0 ) = 0 .
Let φ W 2 , 2 . The variation
δ J 1 ( w ; φ ) ,
stands for
δ J 1 ( w ; φ ) = E I Ω w , x x φ , x x d x Ω P φ d x M φ , x ( 0 ) .
Here assuming, w is smooth enough, integrating by parts and recalling that φ V 1 , we obtain
δ J 1 ( w ; φ ) = E I Ω w , x x x x φ d x E I w , x x ( 0 ) φ , x ( 0 ) Ω P φ d x M φ , x ( 0 ) ,
so that the extremal condition
δ J 1 ( w ; φ ) = 0 , φ V 1 ,
provide us the following natural boundary condition
M = E I w , x x ( 0 )
and the equation
E I w , x x x x P = 0 , in Ω .
Here we recall the remaining essential boundary conditions,
w ( 0 ) = w ( 1 ) = w , x x ( 1 ) = 0 .
A particular solution of such an equation
E I w , x x x x P = 0 , in Ω
stands for
w p ( x ) = a x 4 ,
where
a = P 4 ! E I .
The concerning general solution stands for
w ( x ) = w p ( x ) + b x 3 + c x 2 + d x + e .
The boundary condition w ( 0 ) = 0 implies that e = 0 .
The boundary condition
M = E I w , x x ( 0 ) ,
stands for
M = E I ( 2 c ) ,
so that
c = M 2 E I .
Moreover, from
w , x x ( 1 ) = 0 ,
we obtain
12 a + 6 b + 2 c = 0 .
From
w ( 1 ) = 0
we have
a + b + c + d = 0 .
From these last two equations, we obtain
b = c 3 2 a b ( c ) ,
and
d = a 2 3 c .
From the Castigliano Theorem, M R must be such that
w x ( 0 ) = d E i n d M = 0 ,
where
E i n = E I 2 Ω w , x x 2 d x .
Observe that
d E i n d M = d E i n d c d c d M = 0 ,
so that in fact, it suffices to obtain,
d E i n d c = 0 .
Moreover, observe that
E i n = E I 2 Ω w , x x 2 d x = E I 2 0 1 ( 12 a x 2 + 6 b ( c ) x + 2 c ) 2 d x = E I 2 0 1 ( 144 a 2 x 4 + 36 b ( c ) 2 x 2 + 4 c 2 + 144 a b ( c ) x 3 + 48 a c x 2 + 24 x b ( c ) c ) d x = E I 2 144 a 2 x 5 5 + 36 b ( c ) 2 x 3 3 + 4 c 2 x + 144 a b ( c ) x 4 4 + 48 a c x 3 3 + 24 b ( c ) c x 2 2 0 1 = E I 2 144 a 2 5 + 36 b ( c ) 2 3 + 4 c 2 + 144 a b ( c ) 4 + 48 a c 3 + 24 c b ( c ) 2 .
Hence, we must have
d E i n d c = E i n b d b d c + E i n c = 0
so that,
12 ( 2 ) b ( c ) d b ( c ) d c + 8 c + 36 a d b ( c ) d c + 16 a + 12 b ( c ) + 12 c d b ( c ) d c = 0 .
Recalling that
d b ( c ) d c = 1 / 3 ,
we have got
8 b ( c ) + 8 c 12 a + 16 a + 12 b ( c ) 4 c = 0 .
From such a result and recalling that
b ( c ) = c 3 2 a ,
we have got
8 c 3 + 16 a 12 a + 16 a 4 c 24 a 4 c = 0 ,
so that
8 c 3 4 a = 0 ,
which has a solution
c = 3 a 2 .
Thus,
c = 3 P ( 2 ) 4 ! E I ,
so that
M = 2 c E I = 3 P 4 ! .

3.2. Checking This Last Result for M by Solving the Concerning Ordinary Differential Equation

In this section we check the result obtained for M by solving the following ODE,
E I w , x x x x P = 0 , in Ω = [ 0 , 1 ] ,
with the boundary conditions,
w ( 0 ) = w ( 1 ) = w , x ( 0 ) = w , x x ( 1 ) = 0 .
We recall the general solution stands for
w ( x ) = a x 4 + b x 3 + c x 2 + d x + e ,
where
a = P 4 ! E I .
From w ( 0 ) = 0 , we obtain e = 0 .
From w , x ( 0 ) = 0 , we obtain d = 0 .
From w ( 1 ) = 0 , we have
a + b + c = 0 .
From w x x ( 1 ) = 0 , we have
12 a + 6 b + 2 c = 0 .
From such results, we obtain
b = 5 a 2 ,
and
c = 3 a 2
Observe that
M ( x ) = E I w , x x ,
so that in particular
M = M ( 0 ) = E I w , x x ( 0 ) = E I ( 2 c ) = 3 P 4 ! .
This value for M here obtained coincide with the one obtained in the previous subsection, as expected.
The objective of this section is complete.

4. Conclusion

In this article, we have presented a formal proof Castigliano Theorem in a elasticity theory context.
We have also presented a generalization of such a result and a numerical example to exemplify its applicability.

Conflicts of Interest

The author declares no conflict of interest concerning this article.

References

  1. Botelho, F. A Formal Proof of Castigliano Theorem and a Related Generalization. Preprints 2025, 2025080573. [Google Scholar] [CrossRef]
  2. Botelho, F. A Formal Proof of Castigliano Theorem and a Related Generalization. Preprints 2025, 2025080573. [Google Scholar] [CrossRef]
  3. Beer F, Johnston E, DeWolf J and Mazurek D. Mechanics of Materials, 7th edition, Mc Graw Hill - Asia, 2015.
  4. Botelho FS. Duality Principles and Numerical Procedures for a Large Class of Non-Convex Models in the Calculus of Variations. Preprints 2023, 2023020051. [CrossRef]
  5. Botelho F. Functional Analysis and Applied Optimization in Banach Spaces, (Springer Switzerland, 2014).
  6. Botelho FS. Functional analysis, calculus of variations and numerical methods for models in physics and engineering. CRC Taylor and Francis, Florida - USA, 2020.
  7. Adams RA and Fournier JF. Sobolev Spaces, 2nd edn. (Elsevier, New York, 2003).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated