The Research on Identification of Gene Splice Sites by Support Vector Machine

HTML  XML Download Download as PDF (Size: 305KB)  PP. 53-57  
DOI: 10.4236/jbise.2016.910B007    1,322 Downloads   2,246 Views  Citations

ABSTRACT

The recognition of splicing sites is a very important step in the eukaryotic DNA se-quence analysis. Many scholars are working hard to improve the accuracy of identifi-cation. Our team carried out research on this issue based on support vector machine, which is one famous algorithm in data mining. The training and testing data is from the HS3D dataset, and excellent accuracy rate is achieved by nucleic acid sequence orthogonal coding and RBF core function, and the cross validation experiment hints that base pattern information is mainly located within 20 nucleotides upstream and downstream splice sites.

Share and Cite:

Li, H. and He, G. (2016) The Research on Identification of Gene Splice Sites by Support Vector Machine. Journal of Biomedical Science and Engineering, 9, 53-57. doi: 10.4236/jbise.2016.910B007.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.