Bayesian inference on group differences in multivariate categorical data
From MaRDI portal
Publication:1663099
DOI10.1016/J.CSDA.2018.04.010zbMATH Open1469.62135arXiv1606.09415OpenAlexW2963879461MaRDI QIDQ1663099FDOQ1663099
Massimiliano Russo, Daniele Durante, Bruno Scarpa
Publication date: 21 August 2018
Published in: Computational Statistics and Data Analysis (Search for Journal in Brave)
Abstract: Multivariate categorical data are common in many fields. We are motivated by election polls studies assessing evidence of changes in voters opinions with their candidates preferences in the 2016 United States Presidential primaries or caucuses. Similar goals arise routinely in several applications, but current literature lacks a general methodology which combines flexibility, efficiency, and tractability in testing for group differences in multivariate categorical data at different---potentially complex---scales. We address this goal by leveraging a Bayesian representation which factorizes the joint probability mass function for the group variable and the multivariate categorical data as the product of the marginal probabilities for the groups, and the conditional probability mass function of the multivariate categorical data, given the group membership. To enhance flexibility, we define the conditional probability mass function of the multivariate categorical data via a group-dependent mixture of tensor factorizations, thus facilitating dimensionality reduction and borrowing of information, while providing tractable procedures for computation, and accurate tests assessing global and local group differences. We compare our methods with popular competitors, and discuss improved performance in simulations and in American election polls studies.
Full work available at URL: https://arxiv.org/abs/1606.09415
Computational methods for problems pertaining to statistics (62-08) Bayesian inference (62F15) Contingency tables (62H17)
Cites Work
- Asymptotic Behaviour of the Posterior Distribution in Overfitted Mixture Models
- Title not available (Why is that?)
- Title not available (Why is that?)
- The log-linear group-lasso estimator and its asymptotic properties
- Simplex Factor Models for Multivariate Unordered Categorical Data
- Nonparametric Bayes Modeling of Multivariate Categorical Data
- Bayesian Modeling of Temporal Dependence in Large Sparse Contingency Tables
- Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem
- Bayesian Factorizations of Big Sparse Tensors
- Stochastic search variable selection for log-linear models
- Tensor decompositions and sparse log-linear models
- Categorical data fusion using auxiliary information
- Simultaneous factor analysis of dichotomous variables in several groups
- Modeling Clustered Ordered Categorical Data: A Survey
- Nonparametric Bayes modeling for case control studies with many predictors
- Shared kernel Bayesian screening
Cited In (4)
This page was built for publication: Bayesian inference on group differences in multivariate categorical data
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q1663099)