New techniques for extracting features from protein sequences

J. T.L. Wang, Q. Ma, D. Shasha, C. H. Wu

Research output: Contribution to journalArticlepeer-review

89 Scopus citations


In this paper we propose new techniques to extract features from protein sequences. We then use the features as inputs for a Bayesian neural network (BNN) and apply the BNN to classifying protein sequences obtained from the PIR (Protein Information Resource) database maintained at the National Biomedical Research Foundation. To evaluate the performance of the proposed approach, we compare it with other protein classifiers built based on sequence alignment and machine learning methods. Experimental results show the high precision of the proposed classifier and the complementarity of the bioinformatics tools studied in the paper.

Original languageEnglish (US)
Pages (from-to)426-441
Number of pages16
JournalIBM Systems Journal
Issue number2
StatePublished - 2001

All Science Journal Classification (ASJC) codes

  • Software
  • Theoretical Computer Science
  • General Computer Science
  • Information Systems
  • Computer Graphics and Computer-Aided Design
  • Computational Theory and Mathematics


Dive into the research topics of 'New techniques for extracting features from protein sequences'. Together they form a unique fingerprint.

Cite this