Submitted:
31 July 2024
Posted:
05 August 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Differential Entropy, Mutual Information and Entropy Rate: Definitions and Notation
3. Agent Learning and Mutual Information Gain
4. Minimum Error Entropy
5. Log Ratio of Entropy Powers
6. Minimum Mean Squared Error (MMSE) Estimation
6.1. MMSE Smoothing
6.2. MMSE Prediction
6.3. MMSE Filtering
7. Entropy Power and MSE
8. Fixed Lag Smoothing Example
9. Properties and Families (Classes) of Probability Densities
9.1. Properties
9.2. Families or Classes
9.2.1. Location-Scale Family
9.2.2. Exponential Family
10. Discussion
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| i.i.d. | independent and identically distributed |
| MMSE | minimum mean squared error |
| Q | entropy power |
| MMSPE(M) | minimum mean squared prediction error of order M |
| MSE | mean squared error |
| MI Gain | mutual information gain |
References
- Wiener, N. Extrapolation, interpolation, and smoothing of stationary time series: with engineering applications; MIT Press, 1949.
- Ljung, L.; Soderstrom, T. Theory and Practice of Recursive Identification; MIT Press, 1983.
- Haykin, S. Adaptive Filter Theory; Prentice-Hall, 2002.
- Honig, M.L.; Messerschmitt, D.G. Adaptive filters: structures, algorithms, and applications; Kluwer Academic Publishers: Hingham, MA, 1984. [Google Scholar]
- Tishby, N.; Zaslavsky, N. Deep Learning and the Information Bottleneck Principle. CoRR 2015. [Google Scholar] [CrossRef]
- Crutchfield, J.P.; Feldman, D.P. Synchronizing to the environment: Information-theoretic constraints on agent learning. Advances in Complex Systems 2001, 4, 251–264. [Google Scholar] [CrossRef]
- Crutchfield, J.P.; Feldman, D.P. Regularities unseen, randomness observed: Levels of entropy convergence. Chaos: An Interdisciplinary Journal of Nonlinear Science 2003, 13, 25–54. [Google Scholar] [CrossRef] [PubMed]
- Gibson, J.D. Mutual Information Gain and Linear/Nonlinear Redundancy for Agent Learning, Sequence Analysis and Modeling. Entropy 2020, 22, 608–624. [Google Scholar] [CrossRef] [PubMed]
- Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley-Interscience, 2006.
- Shannon, C.E. A mathematical theory of communication. Bell Sys. Tech. Journal 1948, 27, 379–423. [Google Scholar] [CrossRef]
- Kalata, P.; Priemer, R. Linear prediction, filtering, and smoothing: An information-theoretic approach. Information Sciences 1979, 17, 1–14. [Google Scholar] [CrossRef]
- Kalata, P.; Priemer, R. On minimal error entropy stochastic approximation. Int. Journal of Systems Sciences 1974, 5, 895–906. [Google Scholar] [CrossRef]
- Kalata, P.R.; Priemer, R. When should smoothing cease? Proceedings of the IEEE 1974, 62, 1289–1290. [Google Scholar] [CrossRef]
- Gibson, J.D. Log Ratio of Entropy Powers. Proc. UCSD Information Theory and Applications, 2018.
- Shynk, J.J. Probability, random variables, and random processes: theory and signal processing applications; John Wiley & Sons, 2012.
- Hudson, J.E. Signal Processing Using Mutual Information. IEEE Signal Processing Magazine 2006, 23, 50–54. [Google Scholar] [CrossRef]
- Kraskov, A.; Stogbauer, A.; Grassberger, P. Estimating Mutual Information. Physical Review: E 2004, 69, 006138–1. [Google Scholar] [CrossRef] [PubMed]
- Chirarattananon, S.; Anderson, B. The Fixed-Lag Smoother as a Stable, Finite-Dimensional Linear Filter. Automatica 1971, 7, 657–669. [Google Scholar] [CrossRef]
- Gibson, J.D.; Bhaskaranand, M. Performance improvement with decoder output smoothing in differential predictive coding. Proc. UCSD Information Theory and Applications, 2014.
- Lukacs, E. Characteristic Functions; Griffin London, 1970.
- Lehmann, E.L. Testing Statistical Hypotheses; John Wiley & Sons, Inc., 1986.
- Lehmann, E.L. Theory of Point Estimation; John Wiley & Sons, Inc., 1983.
- Hogg, R.V.; Craig, A.T. Intorduction to Mathematical Statistics; Macmillan, 1970.
| L | Incremental MI Gain | Total MI Gain | |
| 1 | 0.2485 | 0.1145 | 0.1145 |
| 2 | 0.2189 | 0.0633 | 0.1748 |
| 3 | 0.2033 | 0.0369 | 0.2117 |
| 4 | 0.1950 | 0.0209 | 0.2326 |
| 5 | 0.1906 | 0.01235 | 0.24795 |
| 15 | 0.1857 | 0.00255 | 0.2505 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).