A latent class distance association model for cross-classified data with a categorical response variable
SourceThe British Journal of Mathematical & Statistical Psychology, 67, 3, (2014), pp. 514-40
Article / Letter to editor
Display more detailsDisplay less details
The British Journal of Mathematical & Statistical Psychology
SubjectRadboudumc 15: Urological cancers RIHS: Radboud Institute for Health Sciences
In this paper we propose a latent class distance association model for clustering in the predictor space of large contingency tables with a categorical response variable. The rows of such a table are characterized as profiles of a set of explanatory variables, while the columns represent a single outcome variable. In many cases such tables are sparse, with many zero entries, which makes traditional models problematic. By clustering the row profiles into a few specific classes and representing these together with the categories of the response variable in a low-dimensional Euclidean space using a distance association model, a parsimonious prediction model can be obtained. A generalized EM algorithm is proposed to estimate the model parameters and the adjusted Bayesian information criterion statistic is employed to test the number of mixture components and the dimensionality of the representation. An empirical example highlighting the advantages of the new approach and comparing it with traditional approaches is presented.
Upload full text
Use your RU credentials (u/z-number and password) to log in with SURFconext to upload a file for processing by the repository team.