Scientific data classification: A case study

Gung Wei Chirn, Jason T.L. Wang, Zhiyuan Wang

Research output: Contribution to journalConference articlepeer-review

4 Scopus citations


Scientific data classification is the activity of determining whether or not an unlabeled scientific object belongs to an existing class. It is an important operation in the management of scientific databases. In this paper we present a case study for scientific data classification. Specifically, we develop a tool for DNA sequence classification. The tool works by generating and matching gapped fingerprints of DNA sequences. Experimental results obtained by applying our tool to classifying a set of Alu sequences demonstrate the good performance of the tool. While the reported research focuses on DNA classification, our techniques should generalize to any domain (e.g. multimedia) where data are naturally represented as sequences.

Original languageEnglish (US)
Pages (from-to)216-222
Number of pages7
JournalProceedings of the International Conference on Tools with Artificial Intelligence
StatePublished - 1997
EventProceedings if the 1997 IEEE 9th IEEE International Conference on Tools with Artificial Intelligence - Newport Beach, CA, USA
Duration: Nov 3 1997Nov 8 1997

All Science Journal Classification (ASJC) codes

  • Software


Dive into the research topics of 'Scientific data classification: A case study'. Together they form a unique fingerprint.

Cite this