Mitigating the effect of dataset shift in clustering

Maldonado, Sebastián; Saltos, Ramiro; Vairetti, Carla; Delpiano, José

Mitigating the effect of dataset shift in clustering

dc.coverage	DOI: 10.1016/j.patcog.2022.109058
dc.creator	Maldonado, Sebastián
dc.creator	Saltos, Ramiro
dc.creator	Vairetti, Carla
dc.creator	Delpiano, José
dc.date	2023
dc.date.accessioned	2025-11-18T19:48:48Z
dc.date.available	2025-11-18T19:48:48Z
dc.description	<p>Dataset shift is a relevant topic in unsupervised learning since many applications face evolving environments, causing an important loss of generalization and performance. Most techniques that deal with this issue are designed for data stream clustering, whose goal is to process sequences of data efficiently under Big Data. In this study, we claim dataset shift is an issue for static clustering tasks in which data is collected over a long period. To mitigate it, we propose Time-weighted kernel k-means, a k-means variant that includes a time-dependent weighting process. We do this via the induced ordered weighted average (IOWA) operator. The weighting process acts as a gradual forgetting mechanism, prioritizing recent examples over outdated ones in the clustering algorithm. The computational experiments show the potential Time-weighted kernel k-means has in evolving environments.</p>	eng
dc.identifier	https://investigadores.uandes.cl/en/publications/97442da3-1a74-476a-8c78-b539169d1a85
dc.identifier.uri	https://repositorio.uandes.cl/handle/uandes/55745
dc.language	eng
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.source	vol.134 (2023) p.109058
dc.subject	Clustering
dc.subject	Dataset shift
dc.subject	Induced ordered weighted average
dc.subject	Kernel k-means
dc.subject	OWA operators
dc.title	Mitigating the effect of dataset shift in clustering	eng
dc.type	Article	eng
dc.type	Artículo	spa

Collections

PURE

Mitigating the effect of dataset shift in clustering

Files

Collections