Submitted:
14 August 2024
Posted:
19 August 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Reproduction
- v0.0.1: This was the initial release of the KAN model, laying the groundwork with core functionalities.
- v0.0.5: Recognized as the most stable early version of KAN, addressing several early-stage bugs and performance issues.
- v0.1.0 and v0.2.0: These versions introduced significant improvements. One major change was in the model’s initialization process. Additionally, the method model.learn() was replaced with model.fit(), and support for multiplication was added. Originally, KAN handled multiplication by decomposition, e.g., . The introduction of a direct multiplication operator enhanced performance but introduced bugs in the addition operator.
- v0.2.2: Currently, this is the most stable version regarding the addition operator. This version is used in this paper.
- v0.2.3: The latest version, which still has unresolved bugs related to the addition operator.
3. KAN Preliminary
3.1. Steps to Generate Symbolic Output
4. Metrics
4.1. Performance Metrics
4.2. Mathematical Complexity
5. Results
5.1. Simple Functions
5.2. Analysis of Input Range
5.3. Complex Functions
6. Conclusion
References
- Liu, Z.; Wang, Y.; Vaidya, S.; Ruehle, F.; Halverson, J.; Soljačić, M.; Hou, T.Y.; Tegmark, M. Kan: Kolmogorov-arnold networks. arXiv 2024, arXiv:2404.19756. [Google Scholar] [CrossRef]
- Shukla, K.; Toscano, J.D.; Wang, Z.; Zou, Z.; Karniadakis, G.E. A comprehensive and fair comparison between mlp and kan representations for differential equations and operator networks. arXiv 2024, arXiv:2406.02917. [Google Scholar] [CrossRef]
- Yu, R.; Yu, W.; Wang, X. Kan or mlp: A fairer comparison. arXiv 2024, arXiv:2407.16674. [Google Scholar] [CrossRef]
- Kolmogorov, A. N. On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition. In Akademii Nauk; Russian Academy of Sciences, 1957; vol. 114, pp. 953–956. [Google Scholar]
- Hornik, K.; Stinchcombe, M.; White, H. Multilayer feedforward networks are universal approximators. Neural networks 1989, 2, 359–366. [Google Scholar] [CrossRef]


| Symbolic Function | Complexity | Symbolic Function | Complexity |
|---|---|---|---|
| x | 1 | 1 | |
| 2 | 2 | ||
| 3 | 3 | ||
| 4 | 4 | ||
| 5 | 5 | ||
| 3 | 3 | ||
| 2 | 2 | ||
| 3 | 3 | ||
| 2 | 2 | ||
| 3 | 3 | ||
| 4 | 4 | ||
| 4 | 4 | ||
| 0 | 0 | 3 |
| Func | Cplx | Metric: | Metric: -Complexity loss | ||||||
| -Avg | R@1 | R@3 | R@5 | -Avg | R@1 | R@3 | R@5 | ||
| exp(cocs+) | 6 | 0.9998 | 100 | 100 | 100 | -1.49 | 100 | 100 | 100 |
| exp(cos+) | 8 | 0.9999 | 100 | 100 | 100 | -1.63 | 100 | 100 | 100 |
| exp(cos+) | 10 | 0.9998 | 100 | 100 | 100 | -0.75 | 100 | 100 | 100 |
| cos(cos+) | 10 | 0.9999 | 100 | 100 | 100 | -1.43 | 100 | 100 | 100 |
| log(cos+) | 10 | 0.9999 | 100 | 100 | 100 | -1.43 | 100 | 100 | 100 |
| sqrt(cos+) | 10 | 0.9999 | 100 | 100 | 100 | 1.17 | 66.6 | 100 | 100 |
| tan(cos+) | 15 | 0.7499 | 0 | 33.3 | 33.3 | 1.44 | 0 | 33.3 | 33.3 |
| 1/(cos+) | 6 | 0.9999 | 100 | 100 | 100 | -1.69 | 100 | 100 | 100 |
| 1/(cos+)2 | 6 | 0.9995 | 100 | 100 | 100 | -0.96 | 66.6 | 100 | 100 |
| 1/(cos+) | 8 | 0.9997 | 100 | 100 | 100 | -1.13 | 100 | 100 | 100 |
| 1/(cos+)2 | 8 | 0.9988 | 33.3 | 100 | 100 | -0.46 | 33.3 | 100 | 100 |
| 1/(cos+) | 10 | 0.9999 | 100 | 100 | 100 | -1.03 | 66.6 | 100 | 100 |
| 1/(cos+)2 | 10 | 0.9982 | 33.3 | 33.3 | 66.6 | -0.14 | 33.3 | 66.6 | 66.6 |
| 1/(cos+)3 | 9 | 0.9992 | 33.3 | 66.6 | 100 | -0.49 | 33.3 | 66.6 | 100 |
| 1/(cos+)3 | 12 | 0.9987 | 0 | 66.6 | 100 | -0.11 | 0 | 66.6 | 100 |
| 1/(cos+)3 | 15 | 0.9968 | 0 | 0 | 66.6 | 0.25 | 0 | 0 | 66.6 |
| 1/(cos+) | 6 | 0.9977 | 0 | 33.3 | 100 | -0.37 | 0 | 33.3 | 100 |
| 1/(cos+)2 | 8 | 0.9896 | 0 | 66.6 | 66.6 | -0.12 | 0 | 66.6 | 66.6 |
| 1/(cos+)3 | 15 | 0.9968 | 0 | 0 | 66.6 | 0.25 | 0 | 0 | 66.6 |
| Func | Input Range | Metric: | |||
| -Avg | R@1 | R@3 | R@5 | ||
| exp(cocs+) | [0.01, 1] | 0.9999 | 100 | 100 | 100 |
| exp(cocs+) | [0.01, 2] | 0.9998 | 100 | 100 | 100 |
| exp(cocs+) | [0.01, 5] | 0.9974 | 66.6 | 100 | 100 |
| exp(cocs+) | [0.01, 10] | 0.9352 | 33.3 | 66.6 | 66.6 |
| exp(cocs+) | [0.01, 100] | nan | |||
| exp(cocs+) | [0.01, 10] + minmax | 0.9584 | 33.3 | 100 | 100 |
| exp(cocs+) | [0.5, 1] | 0.9990 | 66.6 | 66.6 | 66.6 |
| exp(cocs+) | [0.1, 1] | 0.9772 | 0 | 0 | 66.6 |
| exp(cocs+) | [0.01, 1] | nan | |||
| exp(cocs+) | [0.1, 1] + minmax | 0.6833 | 33.3 | 33.3 | 100 |
| exp(cocs+) | [0.1, 1] + minmax + shifted | 0.9469 | 33.3 | 33.3 | 66.6 |
| exp(cocs+) () | [0.1, 1] | 0.9949 | 33.3 | 33.3 | 66.6 |
| Func | Cplx | Metric: | |||
| -Avg | R@1 | R@3 | R@5 | ||
| log(exp + ) + exp( + cos) | 14 | 0.9656 | 16.7 | 50 | 66.6 |
| sqrt(exp + ) + exp( + cos) | 14 | 0.9653 | 16.7 | 33.3 | 66.6 |
| 1/(exp + ) + exp( + cos) | 14 | 0.9049 | 0 | 16.7 | 50 |
| 1/(exp + )2 + exp( + cos) | 14 | 0.8499 | 16.7 | 66.6 | 66.6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).