Journal of Biomedical Science and Engineering

Volume 8, Issue 10 (October 2015)

ISSN Print: 1937-6871   ISSN Online: 1937-688X

Google-based Impact Factor: 0.66  Citations  h5-index & Ranking

Sequence Motif-Based One-Class Classifiers Can Achieve Comparable Accuracy to Two-Class Learners for Plant microRNA Detection

HTML  XML Download Download as PDF (Size: 644KB)  PP. 684-694  
DOI: 10.4236/jbise.2015.810065    5,184 Downloads   6,161 Views  Citations
Author(s)

ABSTRACT

microRNAs (miRNAs) are short nucleotide sequences expressed by a genome that are involved in post transcriptional modulation of gene expression. Since miRNAs need to be co-expressed with their target mRNA to observe an effect and since miRNAs and target interactions can be cooperative, it is currently not possible to develop a comprehensive experimental atlas of miRNAs and their targets. To overcome this limitation, machine learning has been applied to miRNA detection. In general binary learning (two-class) approaches are applied to miRNA discovery. These learners consider both positive (miRNA) and negative (non-miRNA) examples during the training process. One-class classifiers, on the other hand, use only the information for the target class (miRNA). The one-class approach in machine learning is gradually receiving more attention particularly for solving problems where the negative class is not well defined. This is especially true for miRNAs where the positive class can be experimentally confirmed relatively easy, but where it is not currently possible to call any part of a genome a non-miRNA. To do that, it should be co-expressed with all other possible transcripts of the genome, which currently is a futile endeavor. For machine learning, miRNAs need to be transformed into a feature vector and some currently used features like minimum free energy vary widely in the case of plant miRNAs. In this study it was our aim to analyze different methods applying one-class approaches and the effectiveness of motif-based features for prediction of plant miRNA genes. We show that the application of these one-class classifiers is promising and useful for this kind of problem which relies only on sequence- based features such as k-mers and motifs comparing to the results from two-class classification. In some cases the results of one-class are, to our surprise, more accurate than results from two-class classifiers.

Share and Cite:

Yousef, M. , Allmer, J. and Khalifa, W. (2015) Sequence Motif-Based One-Class Classifiers Can Achieve Comparable Accuracy to Two-Class Learners for Plant microRNA Detection. Journal of Biomedical Science and Engineering, 8, 684-694. doi: 10.4236/jbise.2015.810065.

Cited by

[1] Computational detection of pre-microRNAs
miRNomics, 2022
[2] miRNAFinder: A Comprehensive Web Resource for Plant Pre-microRNA Classification
2021
[3] MultiKOC: Multi-One-Class Classifier Based K-Means Clustering
2021
[4] Classification of Precursor MicroRNAs from Different Species Based on K-mer Distance Features
2021
[5] Machine learning for plant microRNA prediction: A systematic review
2021
[6] miRNAFinder: A pre-microRNA classifier for plants and analysis of feature impact
2020
[7] Hamming Distance and K-mer Features for Classification of Pre-cursor microRNAs from Different Species
2019
[8] Computational methods for the ab initio identification of novel microRNA in plants: a systematic review
2019
[9] 一种改进的 microRNA 预测模型集成方法
2018
[10] Categorization of species based on their microRNAs employing sequence motifs, information-theoretic sequence feature extraction, and k-mers
2017
[11] Differential Expression of Toxoplasma gondii MicroRNAs in Murine and Human Hosts
Non-coding RNAs and Inter-kingdom Communication, 2016
[12] The impact of feature selection on one and two-class classification performance for plant microRNAs
2016
[13] Feature Selection Has a Large Impact on One-Class Classification Accuracy for MicroRNAs in Plants
Advances in Bioinformatics, 2016

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.