TY - JOUR
T1 - Detecting conserved secondary structures in RNA molecules using constrained structural alignment
AU - Khaladkar, Mugdha
AU - Patel, Vandanaben
AU - Bellofatto, Vivian
AU - Wilusz, Jeffrey
AU - Wang, Jason T.L.
N1 - Funding Information:
We thank Marvin Nakayama, Bruce Shapiro and Bin Tian for helpful conversations during the preparation of this paper. This work was partially supported by National Science Foundation grant IIS-0707571 and National Institutes of Health grant R01 AI053835.
PY - 2008/8
Y1 - 2008/8
N2 - Constrained sequence alignment has been studied extensively in the past. Different forms of constraints have been investigated, where a constraint can be a subsequence, a regular expression, or a probability matrix of symbols and positions. However, constrained structural alignment has been investigated to a much lesser extent. In this paper, we present an efficient method for constrained structural alignment and apply the method to detecting conserved secondary structures, or structural motifs, in a set of RNA molecules. The proposed method combines both sequence and structural information of RNAs to find an optimal local alignment between two RNA secondary structures, one of which is a query and the other is a subject structure in the given set. The method allows a biologist to annotate conserved regions, or constraints, in the query RNA structure and incorporates these regions into the alignment process to obtain biologically more meaningful alignment scores. A statistical measure is developed to assess the significance of the scores. Experimental results based on detecting internal ribosome entry sites in the RNA molecules of hepatitis C virus and Trypanosoma brucei demonstrate the effectiveness of the proposed method and its superiority over existing techniques.
AB - Constrained sequence alignment has been studied extensively in the past. Different forms of constraints have been investigated, where a constraint can be a subsequence, a regular expression, or a probability matrix of symbols and positions. However, constrained structural alignment has been investigated to a much lesser extent. In this paper, we present an efficient method for constrained structural alignment and apply the method to detecting conserved secondary structures, or structural motifs, in a set of RNA molecules. The proposed method combines both sequence and structural information of RNAs to find an optimal local alignment between two RNA secondary structures, one of which is a query and the other is a subject structure in the given set. The method allows a biologist to annotate conserved regions, or constraints, in the query RNA structure and incorporates these regions into the alignment process to obtain biologically more meaningful alignment scores. A statistical measure is developed to assess the significance of the scores. Experimental results based on detecting internal ribosome entry sites in the RNA molecules of hepatitis C virus and Trypanosoma brucei demonstrate the effectiveness of the proposed method and its superiority over existing techniques.
KW - Constrained structural alignment
KW - RNA motif
KW - Viruses and protozoa
UR - http://www.scopus.com/inward/record.url?scp=45549092593&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=45549092593&partnerID=8YFLogxK
U2 - 10.1016/j.compbiolchem.2008.03.013
DO - 10.1016/j.compbiolchem.2008.03.013
M3 - Article
C2 - 18472302
AN - SCOPUS:45549092593
SN - 1476-9271
VL - 32
SP - 264
EP - 272
JO - Computational Biology and Chemistry
JF - Computational Biology and Chemistry
IS - 4
ER -