TY - JOUR
T1 - Searching for Non-coding RNAs in Genomic Sequences Using ncRNAscout
AU - Bao, Michael
AU - Cervantes Cervantes, Miguel
AU - Zhong, Ling
AU - Wang, Jason T.L.
N1 - Funding Information:
We thank the anonymous reviewers for many suggestions. This research was supported in part by US National Science Foundation (Grant No. IIS-0707571 ).
PY - 2012/4
Y1 - 2012/4
N2 - Recently non-coding RNA (ncRNA) genes have been found to serve many important functions in the cell such as regulation of gene expression at the transcriptional level. Potentially there are more ncRNA molecules yet to be found and their possible functions are to be revealed. The discovery of ncRNAs is a difficult task because they lack sequence indicators such as the start and stop codons displayed by protein-coding RNAs. Current methods utilize either sequence motifs or structural parameters to detect novel ncRNAs within genomes. Here, we present an ab initio ncRNA finder, named ncRNAscout, by utilizing both sequence motifs and structural parameters. Specifically, our method has three components: (i) a measure of the frequency of a sequence, (ii) a measure of the structural stability of a sequence contained in a t-score, and (iii) a measure of the frequency of certain patterns within a sequence that may indicate the presence of ncRNA. Experimental results show that, given a genome and a set of known ncRNAs, our method is able to accurately identify and locate a significant number of ncRNA sequences in the genome. The ncRNAscout tool is available for downloading at http://bioinformatics.njit.edu/ncRNAscout.
AB - Recently non-coding RNA (ncRNA) genes have been found to serve many important functions in the cell such as regulation of gene expression at the transcriptional level. Potentially there are more ncRNA molecules yet to be found and their possible functions are to be revealed. The discovery of ncRNAs is a difficult task because they lack sequence indicators such as the start and stop codons displayed by protein-coding RNAs. Current methods utilize either sequence motifs or structural parameters to detect novel ncRNAs within genomes. Here, we present an ab initio ncRNA finder, named ncRNAscout, by utilizing both sequence motifs and structural parameters. Specifically, our method has three components: (i) a measure of the frequency of a sequence, (ii) a measure of the structural stability of a sequence contained in a t-score, and (iii) a measure of the frequency of certain patterns within a sequence that may indicate the presence of ncRNA. Experimental results show that, given a genome and a set of known ncRNAs, our method is able to accurately identify and locate a significant number of ncRNA sequences in the genome. The ncRNAscout tool is available for downloading at http://bioinformatics.njit.edu/ncRNAscout.
KW - Genome-wide ncRNA discovery
KW - Sequence motifs
KW - Structural parameters
UR - http://www.scopus.com/inward/record.url?scp=84863432518&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863432518&partnerID=8YFLogxK
U2 - 10.1016/j.gpb.2012.05.004
DO - 10.1016/j.gpb.2012.05.004
M3 - Article
C2 - 22768985
AN - SCOPUS:84863432518
SN - 1672-0229
VL - 10
SP - 114
EP - 121
JO - Genomics, Proteomics and Bioinformatics
JF - Genomics, Proteomics and Bioinformatics
IS - 2
ER -