• Jun 03, 2019 News!Vol.9, No.5-Vol.10, No.3 have been indexed by EI (Inspec).   [Click]
  • Sep 24, 2020 News!Vol.12, No.5 has been published with online version.   [Click]
  • Jul 31, 2020 News!Vol.12, No.4 has been published with online version.   [Click]
General Information
Prof. Wael Badawy
Department of Computing and Information Systems Umm Al Qura University, Canada
I'm happy to take on the position of editor in chief of IJCTE. We encourage authors to submit papers concerning any branch of computer theory and engineering.
IJCTE 2013 Vol.5(4): 585-592 ISSN: 1793-8201
DOI: 10.7763/IJCTE.2013.V5.755

A Voting-Based Combination System for Protein Cellular Localization Sites Prediction

Hafida Bouziane, Belhadri Messabih, and Abdallah Chouarfa
Abstract—During recent years, machine learning techniques have been attracting significant attentions in molecular biology and genomic era. They have become increasingly important to solve real-world problems such as elucidating protein function. An important step in the search for knowledge of protein function is to predict its cellular localization sites. Many computational methods that try to solve this problem have been developed over the years but the imbalanced distribution of proteins in cellular locations enormously influences the behavior of these methods. Hence, the performance and efficiency of the existing prediction methods still need to be improved. A computational method for efficiently predicting protein cellular localization is highly required. In this paper, we explore the use of four supervised machine learning algorithms in predicting the cellular localization sites of proteins from the primary sequence information. Our experiments were performed using Naïve Bayesian, k-Nearest Neighbor and feed-forward Neural Network classifiers. The experts were evaluated with and without cross-validation on E.coli and Yeast benchmarks and combined using majority voting rule for improving classification accuracy on each dataset. The experimental results show that the proposed combination system significantly outperforms the best individual classifier.

Index Terms—Protein localization, naïve Bayesian classifier, k-nearest neighbor classifier, neural network classifier, combination of classifiers, E.coli, yeast.

The authors are with the USTO-MB University, BP 1505 El Mnaouer 3100 Oran ALGERIA (e-mail: h_bouziane@ univ-usto.dz, messabih@ univ-usto.dz, chouarfia@ univ-usto.dz).


Cite:Hafida Bouziane, Belhadri Messabih, and Abdallah Chouarfa, "A Voting-Based Combination System for Protein Cellular Localization Sites Prediction," International Journal of Computer Theory and Engineering vol. 5, no. 4, pp. 585-592, 2013.

Copyright © 2008-2020. International Association of Computer Science and Information Technology. All rights reserved.