Isfahan University of Medical Sciences

Science Communicator Platform

Stay connected! Follow us on X network (Twitter):
Share this content! On (X network) By
Identification of Gene Signatures for Classifying of Breast Cancer Subtypes Using Protein Interaction Database and Support Vector Machines Publisher



Gharibi A1 ; Sehhati MR2 ; Vard A2 ; Mohebian MR3
Authors
Show Affiliations
Authors Affiliations
  1. 1. Department of Medical Engineering, Faculty of Advanced Medical Technology, Isfahan University of Medical Sciences, Isfahan, Iran
  2. 2. Faculty of Advanced Medical Technology, Isfahan University of Medical Sciences, Isfahan, Iran
  3. 3. Department of Electrical Engineering, University of Isfahan, Isfahan, Iran

Source: 2015 5th International Conference on Computer and Knowledge Engineering, ICCKE 2015 Published:2015


Abstract

Many studies have used the microarray gene expression data in order to classifying breast cancer subtypes. However, the classification accuracy was not acceptable in many cases even by applying the algorithms to only a single set of data. In this regard, using appropriate algorithm in every step of whole procedure, applying useful bioinformatics databases, considering the interaction among genes, and properly combining analytical steps are the main challenging problems. In this study a solution was proposed which followed a three step process. In the first step a filter feature selection method was used to produce a small set of informative genes. In the second step, the primary selected genes were mapped on the protein-protein interaction network to extend the gene set according to the linking among corresponding proteins. Thus, a portion of genes that was pruned in the first stage is added again to the primary set of selected genes. In the final stage, by using support vector machine-based recursive feature elimination (SVMRFE) method, the final set of informative genes was identified. After that, we compared our proposed algorithm with decision tree methods in the same datasets. The proposed procedure was evaluated on two publicly available DNA microarray dataset, including 456 samples on breast cancer. The proposed algorithm reached to 100% accuracy for predicting Luminal B by using the JMI method in the first step. In conclusion the proposed method showed an appealing improvement in classification accuracy for a multiclass prediction problem. We can predict subtypes with greater than 91.2% overall accuracy by proposed algorithm. However, the accuracy of prediction subtypes in tree decision method is 78.6%. © 2015 IEEE.
Other Related Docs
18. Evaluating Radiomics Feature Reduction for Thyroid Nodule Segmentation in Thermal Imaging, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2025)