Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification

Ahmad, N. (2010) Cluster identification and separation in the growing self-organizing map: Application in protein sequence classification. Neural Computing and Applications, 19 (4). pp. 531-542. ISSN 0941-0643

[img] Text
Norashikin_JournalOfNeuralComputing.pdf - Published Version
Restricted to Registered users only

Download (604kB)

Abstract

Growing self-organizing map (GSOM) has been introduced as an improvement to the self-organizing map (SOM) algorithm in clustering and knowledge discovery. Unlike the traditional SOM, GSOM has a dynamic structure which allows nodes to grow reflecting the knowledge discovered from the input data as learning progresses. The spread factor parameter (SF) in GSOM can be utilized to control the spread of the map, thus giving an analyst a flexibility to examine the clusters at different granularities. Although GSOM has been applied in various areas and has been proven effective in knowledge discovery tasks, no comprehensive study has been done on the effect of the spread factor parameter value to the cluster formation and separation. Therefore, the aim of this paper is to investigate the effect of the spread factor value towards cluster separation in the GSOM. We used simple k-means algorithm as a method to identify clusters in the GSOM. By using Davies-Bouldin index, clusters formed by different values of spread factor are obtained and the resulting clusters are analyzed. In this work, we show that clusters can be more separated when the spread factor value is increased. Hierarchical clusters can then be constructed by mapping the GSOM clusters at different spread factor values. © 2009 Springer-Verlag London Limited.

Item Type: Article
Uncontrolled Keywords: Cluster identification; Cluster separation; Unsupervised neural networks; Dynamic self-organizing map; Protein sequence classification
Subjects: Q Science > Q Science (General)
Divisions: Faculty of Information and Communication Technology > Department of Software Engineeering
Depositing User: Dr. Norashikin Ahmad
Date Deposited: 08 Aug 2011 04:35
Last Modified: 19 Sep 2021 18:04
URI: http://eprints.utem.edu.my/id/eprint/87
Statistic Details: View Download Statistic

Actions (login required)

View Item View Item