Automated Discovery of Active Motifs in Multiple RNA Secondary Structures

Jason T.L. Wang, Bruce A. Shapiro, Dennis Shasha, Kaizhong Zhang, Chia Yo Chang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Scopus citations

Abstract

In this paper we present a method for discovering approximately common motifs (also known as active motifs) in multiple RNA secondary structures. The secondary structures can be represented as ordered trees (i.e., the order among siblings matters). Motifs in these trees are connected subgraphs that can differ in both substitutions and deletions/insertions. The proposed method consists of two steps: (1) find candidate motifs in a small sample of the secondary structures; (2) search all of the secondary structures to determine how frequently these motifs occur (within the allowed approximation) in the secondary structures. To reduce the running time, we develop two optimization heuristics based on sampling and pattern matching techniques. Experimental results obtained by running these algorithms on both generated data and RNA secondary structures show the good performance of the algorithms. To demonstrate the utility of our algorithms, we discuss their applications to conducting the phylogenetic study of RNA sequences obtained from GenBank.

Original languageEnglish (US)
Title of host publicationProceedings - 2nd International Conference on Knowledge Discovery and Data Mining, KDD 1996
EditorsEvangelos Simoudis, Jiawei Han, Usama M. Fayyad
PublisherAAAI press
Pages70-75
Number of pages6
ISBN (Electronic)1577350049, 9781577350040
StatePublished - 1996
Event2nd International Conference on Knowledge Discovery and Data Mining, KDD 1996 - Portland, United States
Duration: Aug 2 1996Aug 4 1996

Publication series

NameProceedings - 2nd International Conference on Knowledge Discovery and Data Mining, KDD 1996

Conference

Conference2nd International Conference on Knowledge Discovery and Data Mining, KDD 1996
Country/TerritoryUnited States
CityPortland
Period8/2/968/4/96

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'Automated Discovery of Active Motifs in Multiple RNA Secondary Structures'. Together they form a unique fingerprint.

Cite this