Evaluating the Significance of Sequence Motifs by the Minimum Description Length Principle

Qicheng Ma, Jason Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Sdiscover is a tool capable of finding subsequences, possibly separated by arbitrarily long gaps, in a set of sequences. These subsequences are referred to as motifs. This paper proposes a method to evaluate the significance of the sequence motifs found by Sdiscover. The method is based on the minimum description length principle and Shannon's coding theory. The equivalence of the proposed method to the Bayesian inference is also discussed.

Original languageEnglish (US)
Title of host publicationProceedings of the Fifth Joint Conference on Information Sciences, JCIS 2000, Volume 2
EditorsP.P. Wang, P.P. Wang
Pages798-801
Number of pages4
Volume5
Edition2
StatePublished - Dec 1 2000
EventProceedings of the Fifth Joint Conference on Information Sciences, JCIS 2000 - Atlantic City, NJ, United States
Duration: Feb 27 2000Mar 3 2000

Other

OtherProceedings of the Fifth Joint Conference on Information Sciences, JCIS 2000
Country/TerritoryUnited States
CityAtlantic City, NJ
Period2/27/003/3/00

All Science Journal Classification (ASJC) codes

  • Computer Science(all)

Cite this