Submitted:
09 May 2026
Posted:
09 May 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Methodology and Training Setups
2.1. Network Architectures
- 1.
- Vanilla MLP: The original MLP with no additional structure.
- 2.
- Vanilla MLP (match): A widened MLP matching the parameter count of Transponder-NS, used as a parameter-matched baseline.
- 3.
- CMG layer-wise: An MLP where the standard tanh activation is replaced by explicit CMG-GLLF. All neurons in the same hidden layer share one CMG curve, adding only two trainable parameters per hidden layer: the inflection rate and the deviate inflection point I.
- 4.
- Transponder-NS: An MLP with transponder modulation using both node-wise and scalar-wise modulators. The modulator hidden dimension is 8. Scalar-wise modulation provides layer-level control, while node-wise modulation provides channel-level adjustment of hidden representations.
- 5.
- Transponder-S: An MLP with transponder modulation using scalar-wise modulation only.
- 6.
- CMG-TNS: An MLP combining layer-wise CMG activation with Transponder-NS modulation.
- 7.
- CMG-TS: An MLP combining layer-wise CMG activation with Transponder-S modulation.
- 8.
- PirateNet: A physics-informed residual adaptive network using global U and V modulators and adaptive residual connections controlled by a trainable parameter initialized to zero [4].
2.2. Optimization Process
| Task | Vanilla MLP structure | Adam iters | L-BFGS iters | Learning rates |
|---|---|---|---|---|
| Burgers | [2, 20, 20, 20, 1] | 15,000 | 15,000 | 2e-3, 1e-3, 5e-4 |
| Allen-Cahn | [2, 20, 20, 20, 1] | 40,000 | 15,000 | 2e-3, 1e-3, 5e-4 |
| Diffusion-Reaction | [2, 30, 30, 30, 1] | 20,000 | None | 2e-3, 1e-3, 5e-4 |
3. Results
3.1. Derivation of the Explicit CMG-Newton Formulation
3.2. Empirical Validation of CMG-Explicit
| Method | Mean relative L2 error |
|---|---|
| CMG Node-wise | 6.34E-03 |
| CMG Layer-wise | 8.83E-03 |
| Tanh | 2.10E-02 |
3.3. Evaluation of Transponder and CMG on Different PINN Tasks
3.4. Explainability of CMG and Transponder Design
4. Conclusion
A. Technical Appendices
A.1. Detailed Derivation of the CMG-Explicit Formulation
Normalization.
Logistic-phase expression.
Inverse problem.
One-step Newton approximation.
Final expression.
References
- Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-Informed Neural Networks: A Deep Learning Framework for Solving Forward and Inverse Problems Involving Nonlinear Partial Differential Equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
- Cuomo, S.; Di Cola, V.S.; Giampaolo, F.; Rozza, G.; Raissi, M.; Piccialli, F. Scientific Machine Learning Through Physics-Informed Neural Networks: Where We Are and What’s Next. J. Sci. Comput. 2022, 92, 88. [Google Scholar] [CrossRef]
- Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-Informed Machine Learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
- Wang, S.; Li, B.; Chen, Y.; Perdikaris, P. PirateNets: Physics-Informed Deep Learning with Residual Adaptive Networks. J. Mach. Learn. Res. 2024, 25, 1–51. [Google Scholar]
- Rishi, J.; Gafoor, A.; Kumar, S.; Subramani, D. On the Training Efficiency of Shallow Architectures for Physics Informed Neural Networks. In Proceedings of the Computational Science – ICCS 2024; Springer, 2024; Vol. 14834, Lecture Notes in Computer Science; pp. 363–377. [Google Scholar] [CrossRef]
- Hu, W.F.; Shih, Y.J.; Lin, T.S.; Lai, M.C. A Shallow Physics-Informed Neural Network for Solving Partial Differential Equations on Static and Evolving Surfaces. Comput. Methods Appl. Mech. Eng. 2024, 418, 116486. [Google Scholar] [CrossRef]
- Gu, W.; Zhang, Y.; Muscoloni, A.; Cannistraci, C.V. A Generalized Logistic-Logit Function and Its Application to Multi-Layer Perceptron and Neuron Segmentation, 2025. Preprint. [CrossRef]
- Zhang, Y.; Gu, W.; Hu, W.; Li, J.; Cannistraci, C.V. From Transformer to TRANSPONDER: Introducing Contextual Modulation Training for Residual Learning in LLMs, 2025. Preprint, version 2. 30 September 2025. [Google Scholar] [CrossRef]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference on Learning Representations, 2015; p. 1412.6980. [Google Scholar] [CrossRef]
- Nocedal, J. Updating Quasi-Newton Matrices with Limited Storage. Math. Comput. 1980, 35, 773–782. [Google Scholar] [CrossRef]
- Lu, L.; Meng, X.; Mao, Z.; Karniadakis, G.E. DeepXDE: A Deep Learning Library for Solving Differential Equations. SIAM Rev. 2021, 63, 208–228. [Google Scholar] [CrossRef]
- Kelley, C.T. Solving Nonlinear Equations with Newton’s Method; Society for Industrial and Applied Mathematics: Philadelphia, PA, 2003. [Google Scholar] [CrossRef]

| Method | Mean relative L2 error |
|---|---|
| CMG-Explicit | 6.34E-03 |
| CMG-Implicit | 1.54E-01 |
| Tanh | 2.10E-02 |
| Method | Burgers | Allen-Cahn | Diff-Reaction | Mean Rank | Worst Rank | #Param (B/AC) | #Param (DR) |
|---|---|---|---|---|---|---|---|
| Vanilla MLP | 2.10E-02 | 1.68E-02 | 3.90E-03 | 6 | 8 | 921 | 1981 |
| Vanilla MLP (match) | 1.48E-02 | 1.18E-02 | 7.66E-03 | 6.67 | 8 | 2241 | 3781 |
| CMG layer-wise | 8.83E-03 | 1.36E-02 | 6.69E-03 | 6 | 7 | 927 | 1987 |
| Transponder-NS | 1.43E-03 | 2.41E-03 | 1.66E-03 | 1.67 | 2 | 2211 | 3861 |
| Transponder-S | 4.12E-03 | 8.83E-03 | 5.12E-03 | 4.67 | 6 | 1311 | 2531 |
| CMG-TNS | 1.46E-03 | 1.82E-03 | 3.90E-03 | 2.33 | 4 | 2217 | 3867 |
| CMG-TS | 1.14E-02 | 6.04E-03 | 4.78E-03 | 4.67 | 5 | 1317 | 2537 |
| PirateNet | 3.46E-02 | 5.21E-03 | 7.83E-04 | 4 | 8 | 2202 | 3912 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.