A Novel Treatment Optimization System and Top Gene Identification via Machine Learning with Application on Breast Cancer

HTML  XML Download Download as PDF (Size: 5922KB)  PP. 79-99  
DOI: 10.4236/jbise.2018.115008    951 Downloads   2,936 Views  
Author(s)

ABSTRACT

Traditional treatment selection of cancers mainly relies on clinical observations and doctor’s judgment, but most outcomes can hardly be predicted. Through Genomics Topology, we use 272 breast cancer patients’ clinical and gene information as an example to propose a treatment optimization and top gene identification system. This study faces certain challenges such as collinearity and the Curse of Dimensionality within data, so by the idea of Analysis of Variance (ANOVA), Principal Component Analysis (PCA) is implemented to resolve this issue. Several genes, for example, SLC40A1 and ACADSB, are found to be both statistically significant and biological-studies supported; the model developed can precisely predict breast cancer mortality, recurrence time, and survival time, with an average MSE of 3.697, accuracy rate of 88.97%, and F1 score of 0.911. The result and methodology used in this study provide a channel for people to further look into the more precise prediction of other cancer outcomes through machine learning and assist in the discovery of targetable pathways for next-generation cancer treatment methods.

Share and Cite:

Wu, Y. and Chen, Y. (2018) A Novel Treatment Optimization System and Top Gene Identification via Machine Learning with Application on Breast Cancer. Journal of Biomedical Science and Engineering, 11, 79-99. doi: 10.4236/jbise.2018.115008.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.