Adaptive double self-organizing maps for clustering gene expression profiles

H. Ressom, D. Wang, P. Natarajan

Research output: Contribution to journalArticlepeer-review

29 Scopus citations

Abstract

This paper introduces a new model of self-organizing map (SOM) known as adaptive double self-organizing map (ADSOM). ADSOM has a flexible topology and performs data partitioning and cluster visualization simultaneously without requiring a priori knowledge about the number of clusters. It combines features of the popular SOM with two-dimensional position vectors, which serve as a visualization tool to detect the number of clusters present in the data. ADSOM updates its free parameters and allows convergence of its position vectors to a fairly consistent number of clusters provided its initial number of nodes is greater than the expected number of clusters. A novel index is introduced based on hierarchical clustering of the final locations of position vectors. The index allows automated detection of the number of clusters, thereby reducing human error that could be incurred from counting clusters visually. To test ADSOM's consistency in data partitioning, we examine the number of common profiles found in the clusters that were obtained by varying the initial number of nodes. This provides a confidence measure for the clusters formed by ADSOM and illustrates the effect of different initial number of nodes on data partitioning. The reliance of ADSOM in identifying number of clusters is demonstrated by applying it to publicly available yeast gene expression data.

Original languageEnglish (US)
Pages (from-to)633-640
Number of pages8
JournalNeural Networks
Volume16
Issue number5-6
DOIs
StatePublished - Jan 1 2003
Externally publishedYes

All Science Journal Classification (ASJC) codes

  • Cognitive Neuroscience
  • Artificial Intelligence

Keywords

  • Cluster validation
  • Clustering
  • Self-organizing maps
  • Tree-based index

Fingerprint Dive into the research topics of 'Adaptive double self-organizing maps for clustering gene expression profiles'. Together they form a unique fingerprint.

Cite this