The multivariate Watson distribution: maximum-likelihood estimation and other aspects

DOI10.1016/J.JMVA.2012.08.010MaRDI QIDQ1931867zbMATH OpenOpenAlexFDO

Publication date 16 January 2013

Published in Journal of Multivariate Analysis (Search for Journal in Brave)

Full work available at URL https://arxiv.org/abs/1104.4422

directional statistics confluent hypergeometric special function hypergeometric identities Watson distribution Kummer function diametrical clustering

Mathematics Subject Classification ID

Directional data; spatial statistics (62H11) Classification and discrimination; cluster analysis (statistical aspects) (62H30) Estimation in multivariate analysis (62H12) Confluent hypergeometric functions, Whittaker functions, ({}_1F_1) (33C15) Applications of hypergeometric functions (33C90)

Abstract: This paper studies fundamental aspects of modelling data using multivariate Watson distributions. Although these distributions are natural for modelling axially symmetric data (i.e., unit vectors where

p m x

are equivalent), for high-dimensions using them can be difficult. Why so? Largely because for Watson distributions even basic tasks such as maximum-likelihood are numerically challenging. To tackle the numerical difficulties some approximations have been derived---but these are either grossly inaccurate in high-dimensions (emph{Directional Statistics}, Mardia & Jupp. 2000) or when reasonably accurate (emph{J. Machine Learning Research, W. & C.P., v2}, Bijral emph{et al.}, 2007, pp. 35--42), they lack theoretical justification. We derive new approximations to the maximum-likelihood estimates; our approximations are theoretically well-defined, numerically accurate, and easy to compute. We build on our parameter estimation and discuss mixture-modelling with Watson distributions; here we uncover a hitherto unknown connection to the "diametrical clustering" algorithm of Dhillon emph{et al.} (emph{Bioinformatics}, 19(13), 2003, pp. 1612--1619).

Recommendations

Cited in

(19)

This page was built for publication: The multivariate Watson distribution: maximum-likelihood estimation and other aspects

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1931867)