Effective classification of microRNA precursors using feature mining and AdaBoost algorithms

Ling Zhong, Jason T.L. Wang, Dongrong Wen, Virginie Aris, Patricia Soteropoulos, Bruce A. Shapiro

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

MicroRNAs play important roles in most biological processes, including cell proliferation, tissue differentiation, and embryonic development, among others. They originate from precursor transcripts (pre-miRNAs), which contain phylogenetically conserved stem-loop structures. An important bioinformatics problem is to distinguish the pre-miRNAs from pseudo pre-miRNAs that have similar stem-loop structures. We present here a novel method for tackling this bioinformatics problem. Our method, named MirID, accepts an RNA sequence as input, and classifies the RNA sequence either as positive (i.e., a real pre-miRNA) or as negative (i.e., a pseudo pre-miRNA). MirID employs a feature mining algorithm for finding combinations of features suitable for building pre-miRNA classification models. These models are implemented using support vector machines, which are combined to construct a classifier ensemble. The accuracy of the classifier ensemble is further enhanced by the utilization of an AdaBoost algorithm. When compared with two closely related tools on twelve species analyzed with these tools, MirID outperforms the existing tools on the majority of the twelve species. MirID was also tested on nine additional species, and the results showed high accuracies on the nine species. The MirID web server is fully operational and freely accessible at http://bioinformatics. njit.edu/MirID/. Potential applications of this software in genomics and medicine are also discussed.

Original languageEnglish (US)
Pages (from-to)486-493
Number of pages8
JournalOMICS A Journal of Integrative Biology
Volume17
Issue number9
DOIs
StatePublished - 2013

All Science Journal Classification (ASJC) codes

  • Biotechnology
  • Biochemistry
  • Molecular Medicine
  • Molecular Biology
  • Genetics

Fingerprint

Dive into the research topics of 'Effective classification of microRNA precursors using feature mining and AdaBoost algorithms'. Together they form a unique fingerprint.

Cite this