Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Kernel Complexity for Nonparametric Distributions and Detection of Its Changes

Version 1 : Received: 28 November 2019 / Approved: 3 December 2019 / Online: 3 December 2019 (03:34:19 CET)

How to cite: Hirai, S.; Yamanishi, K. Kernel Complexity for Nonparametric Distributions and Detection of Its Changes. Preprints 2019, 2019110381. https://doi.org/10.20944/preprints201911.0381.v1 Hirai, S.; Yamanishi, K. Kernel Complexity for Nonparametric Distributions and Detection of Its Changes. Preprints 2019, 2019110381. https://doi.org/10.20944/preprints201911.0381.v1

Abstract

This paper addresses the issues of how we can quantify structural information for nonparametric distributions and how we can detect its changes. Structural information refers to an index for a global understanding of a data distribution. When we consider the problem of clustering using a parametric model such as a Gaussian mixture model, the number of mixture components (clusters) can be thought of as structural information in the model. However, there does not exist any notion of structural information for nonparametric modeling of data. In this paper we introduce a novel notion of {\em kernel complexity} (KC) as structural information in the nonparametric setting. The key idea of KC is to combine the information bias inspired by the Gini index with the information quantity measured in terms of the normalized maximum likelihood (NML) code length. We empirically show that KC has a property similar to the number of clusters in a parametric model. We further propose a framework for structural change detection with KC in nonparametric distributions. With synthetic and real data sets we empirically demonstrate that our framework enables us to detect structural changes underlying the data and their early warning signals.

Keywords

change-point detection; kernel density estimation; normalized maximum likelihood (NML); minimum description length (MDL) principle; kernel complexity (KC)

Subject

Computer Science and Mathematics, Probability and Statistics

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.