Autores
Yáñez Márquez Cornelio
Título HICCS: Hybrid Instance Clustering Based on Compact Sets
Tipo Congreso
Sub-tipo Memoria
Descripción 13th International Conference on Software Process Improvement, CIMPS 2024
Resumen Data clustering is a widely used technique in Pattern Recognition, Artificial Intelligence and Data Mining. It has been shown to be useful in many practical domains such as text mining, bioinformatics, image segmentation and wireless sensor networks. Recently, there has been a growing emphasis on clustering mixed (or hybrid) as well as incomplete datasets. In this paper we introduce HICCS, a hierarchical clustering for hybrid and incomplete data. Our proposal has several novel characteristics, because it uses compact sets as initial groups instead as single objects, it uses the similarity between cluster representatives as inter-cluster dissimilarity, avoiding additional computations and incorporates a merging strategy to evade order-dependencies. By comparing our proposal with other existing algorithms, we argue that HICCS is one of the best available clustering methods for handling mixed and incomplete datasets. We note that HICCS estimates the true partitions of data, having a significant better performance with respect to other algorithms. It can improve the cluster quality and applicability. The effectiveness of HICCS is shown for seven artificial data sets of varying complexities and fifteen real-life data sets with hybrid and incomplete data. © 2024 IEEE.
Observaciones DOI 10.1109/CIMPS65195.2024.11095964
Lugar Mérida
País Mexico
No. de páginas 284-291
Vol. / Cap.
Inicio 2024-10-16
Fin 2024-10-18
ISBN/ISSN 9798331510862