• May 27, 2016 News!The submission for Special Issue is officially open now!   [Click]
  • May 03, 2016 News!Vol.6, No.6 has been indexed by EI (Inspec).   [Click]
  • Mar 17, 2017 News!Vol.9, No.2 has been published with online version. 13 peer reviewed articles from 4 specific areas are published in this issue.   [Click]
General Information
Editor-in-chief
Prof. Wael Badawy
Department of Computing and Information Systems Umm Al Qura University, Canada
I'm happy to take on the position of editor in chief of IJCTE. We encourage authors to submit papers concerning any branch of computer theory and engineering.
IJCTE 2012 Vol.5(2): 214-222 ISSN: 1793-8201
DOI: 10.7763/IJCTE.2013.V5.681

A Distributed N-Gram Indexing System to Optimizing Persian Information Retrieval

Mohadese Danesh, Behrouz Minaei, and Omid Kashefi
Abstract—As the amount of information and the number of queries has been increasing today, indexing is a good solution to fight with the inherent complexity of text retrieval and accelerating information retrieval in different languages. Also N-Gram Indexing is a solution of the issues such as stemming, misspellings, multilingual and partial matching and has the advantages of language independent and error endurance. Persian is a name of a language which is common in the Middle East. It is spoken in some countries like Iran, Afghanistan and Tajikistan. Therefore, Persian is the language of many documents is published on the net. But, not more researches have been done about the Persian documents retrieval. In this paper, we present a method for Persian documents retrieving using N-gram indexing and distribution technique. The proposed index is a method of more effective answering queries that increases the quality of information retrieval substantially and we gain more optimizing retrieval in Persian documents. But the speed of N-gram indexing is low; to solve this problem we design a distributed N-gram indexing mechanism for large systems of Persian language. Compare with the other methods in this field, we improve the quality of retrieved documents and also the speed of information retrieval.

Index Terms—Information retrieval, indexing, n-gram, distributed, Persian.

The authors are with the School of Computer Engineering, Iran University of Science and Technology, Tehran, Iran (e-mail: mddanesh@comp.iust.ac.ir, b_minaei@iust.ac.ir, kashefi@{iust.ac.ir, ieee.org}).

[PDF]

Cite: Mohadese Danesh, Behrouz Minaei, and Omid Kashefi, "A Distributed N-Gram Indexing System to Optimizing Persian Information Retrieval," International Journal of Computer Theory and Engineering vol. 5, no. 2, pp. 214-222, 2013.
Copyright © 2008-2015. International Journal of Computer Theory and Engineering. All rights reserved.
E-mail: ijcte@vip.163.com