To read this content please select one of the options below:

A new cluster validity index using maximum cluster spread based compactness measure

M. Arif Wani (Postgraduate Department of Computer Science, University of Kashmir, Srinagar, India)
Romana Riyaz (Postgraduate Department of Computer Science, University of Kashmir, Srinagar, India)

International Journal of Intelligent Computing and Cybernetics

ISSN: 1756-378X

Article publication date: 13 June 2016

612

Abstract

Purpose

The most commonly used approaches for cluster validation are based on indices but the majority of the existing cluster validity indices do not work well on data sets of different complexities. The purpose of this paper is to propose a new cluster validity index (ARSD index) that works well on all types of data sets.

Design/methodology/approach

The authors introduce a new compactness measure that depicts the typical behaviour of a cluster where more points are located around the centre and lesser points towards the outer edge of the cluster. A novel penalty function is proposed for determining the distinctness measure of clusters. Random linear search-algorithm is employed to evaluate and compare the performance of the five commonly known validity indices and the proposed validity index. The values of the six indices are computed for all nc ranging from (nc min, nc max) to obtain the optimal number of clusters present in a data set. The data sets used in the experiments include shaped, Gaussian-like and real data sets.

Findings

Through extensive experimental study, it is observed that the proposed validity index is found to be more consistent and reliable in indicating the correct number of clusters compared to other validity indices. This is experimentally demonstrated on 11 data sets where the proposed index has achieved better results.

Originality/value

The originality of the research paper includes proposing a novel cluster validity index which is used to determine the optimal number of clusters present in data sets of different complexities.

Keywords

Citation

Wani, M.A. and Riyaz, R. (2016), "A new cluster validity index using maximum cluster spread based compactness measure", International Journal of Intelligent Computing and Cybernetics, Vol. 9 No. 2, pp. 179-204. https://doi.org/10.1108/IJICC-02-2016-0006

Publisher

:

Emerald Group Publishing Limited

Copyright © 2016, Emerald Group Publishing Limited

Related articles