Publication Date:
2021
Short description:
(2021). Model-based clustering clustering for categorical data via Hamming distance . Retrieved from http://hdl.handle.net/10446/194004
abstract:
In this work a model-based approach for clustering categorical data with no natural ordering is introduced. The proposed method exploits the Hamming distance to define a family of probability mass functions to model categorical data. The elements of this family are considered as kernels of a finite mixture model with unknown number of components. Fully Bayesian inference is provided using a sampling strategy based on a trans-dimensional blocked Gibbs-sampler, facilitating computation with respect to the customary reversible-jump algorithm. Model performances are assessed via a simulation study, showing improvements both in terms of prediction and estimation, with respect to existing approaches. Finally, our method is illustrated with application to reference datasets.
Iris type:
1.4.02 Abstract in atti di convegno - Conference abstracts
List of contributors:
Argiento, Raffaele; Filippi-Mazzola, Edoardo; Paci, Lucia
Book title:
CLADAG 2021: Book of abstracts and short papers, 3th Scientific Meeting of the Classification and Data Analysis Group - Firenze, September 9-11, 2021
Published in: