Data di Pubblicazione:
2021
Citazione:
(2021). Model-based clustering clustering for categorical data via Hamming distance . Retrieved from http://hdl.handle.net/10446/194004
Abstract:
In this work a model-based approach for clustering categorical data with no natural ordering is introduced. The proposed method exploits the Hamming distance to define a family of probability mass functions to model categorical data. The elements of this family are considered as kernels of a finite mixture model with unknown number of components. Fully Bayesian inference is provided using a sampling strategy based on a trans-dimensional blocked Gibbs-sampler, facilitating computation with respect to the customary reversible-jump algorithm. Model performances are assessed via a simulation study, showing improvements both in terms of prediction and estimation, with respect to existing approaches. Finally, our method is illustrated with application to reference datasets.
Tipologia CRIS:
1.4.02 Abstract in atti di convegno - Conference abstracts
Elenco autori:
Argiento, Raffaele; Filippi-Mazzola, Edoardo; Paci, Lucia
Link alla scheda completa:
Link al Full Text:
Titolo del libro:
CLADAG 2021: Book of abstracts and short papers, 3th Scientific Meeting of the Classification and Data Analysis Group - Firenze, September 9-11, 2021
Pubblicato in: