Codice23/2014
TitoloCovariance Based Unsupervised Classification in Functional Data Analysis
Data2014-06-13
Autore/iIeva, F., Paganoni, A.M., Tarabelloni, N.
LinkDownload full text
PubblicatoAccepted for publication: "Journal of Machine Learning Research" (2016)
AbstractIn this paper we propose a new algorithm to perform unsupervised classification of multivariate and functional data when the difference between the two populations lies in their covariances, rather than in their means. The algorithm relies on a proper quantification of distance between the estimated covariance operators of the populations, and identifies as clusters those groups maximising the distance between their induced covariances. The naive implementation of such an algorithm is computationally forbidding, so we propose an heuristic formulation with a much lighter complexity and we study its convergence properties, along with its computational cost. We also propose to use an enhanced estimator for the estimation of discrete covariances of functional data, namely a linear shrinkage estimator, in order to improve the precision of the classification. We establish the effectiveness of our algorithm through applications to both synthetic data and a real dataset coming from a biomedical context, showing also how the use of shrinkage estimation may lead to substantially better results.