ICEB 2004 Proceedings (Beijing, China)

A New Clustering Algorithm for Categorical Attributes

Chunbin Tang, Management School, Fudan University, Shanghai 200433, ChinaFollow
Weidong Zhao, Software School, Fudan University, Shanghai 200433, ChinaFollow

Document Type

Article

Abstract

Clustering over categorical attributes is an important yet tough task. In this paper, we present a new algorithm K-meansⅡ to extend the famous K-means algorithm which is efficient only on numerical clustering, by using new cluster center definitions and new similarity measures. Thus, our algorithm can be used in categorical clustering while preserving the efficiency. Experiments on both real-life datasets and synthetic datasets show that the K-meansⅡ algorithm can produce high quality results and deserve good scalability at the same time.

Recommended Citation

Tang, Chunbin and Zhao, Weidong, "A New Clustering Algorithm for Categorical Attributes" (2004). ICEB 2004 Proceedings (Beijing, China). 219.
https://aisel.aisnet.org/iceb2004/219

Download

COinS

ICEB 2004 Proceedings (Beijing, China)

A New Clustering Algorithm for Categorical Attributes

Document Type

Abstract

Recommended Citation

Search

Links

Browse

Author Corner

Links

ICEB 2004 Proceedings (Beijing, China)

A New Clustering Algorithm for Categorical Attributes

Authors

Document Type

Abstract

Recommended Citation

Share

Search

Links

Browse

Author Corner

Links