• May 27, 2016 News!The submission for Special Issue is officially open now!   [Click]
  • May 03, 2016 News!Vol.6, No.6 has been indexed by EI (Inspec).   [Click]
  • Mar 17, 2017 News!Vol.9, No.2 has been published with online version. 13 peer reviewed articles from 4 specific areas are published in this issue.   [Click]
General Information
Editor-in-chief
Prof. Wael Badawy
Department of Computing and Information Systems Umm Al Qura University, Canada
I'm happy to take on the position of editor in chief of IJCTE. We encourage authors to submit papers concerning any branch of computer theory and engineering.
IJCTE 2009 Vol.1(1): 27-34 ISSN: 1793-8201
DOI: 10.7763/IJCTE.2009.V1.5

Compounded Uniqueness Level: Geo-Location Indexing Using Address Parser

M. Shoaib Jameel and Tejbanta Singh Chingtham

Abstract—Geo-location searching is an important feature forany search engine and research in this field is not new. The only issue that remains is how a search engine know whether a webpage belongs to India or the USA? URLs ending with [.in] are the ultimate choice for India but not all web sites from India end with [.in]. This paper describes a technology known as the address parser. The address parser searches for patterns in a web page that communicates address information. The address parser does not parse every web page of a website for extracting the address but only works on those URLs where the probability of finding an address of the website owner is maximum, thereby eliminating false positives. A central knowledge base is built manually, which contains information like States of a country followed by their city names and other relevant information that may help the address parser do precise local indexing. It was observed that the address parser was not only able to recognize the address patterns in the web pages but also indexed them to city specific information. As a result, a person located in Gangtok, Sikkim, India searched for[universities]; the searching module showed the link of [Sikkim Manipal University] first, followed by other links from India. This work also focuses on the importance of the terms contained in the URLs for geographical based indexing and searching.

Index Terms—Address Parser, Geo-location Indexing, Information Retrieval, Localized Searching

M. Shoaib Jameel was with the Department of Computer Science and Engineering, Sikkim Manipal Institute of Technology, Majitar, Rangpo, East Sikkim - 737132 INDIA. He is now with the Department of Research and Development/Scientific Services, Tata Steel Limited, India (tel.: +919234502858).
Tejbanta Singh Chingtham is with the Department of Computer Science and Engineering, Sikkim Manipal Institute of Technology, Majitar, Rangpo, East Sikkim - 737132, India.

[PDF]

Cite: M. Shoaib Jameel and Tejbanta Singh Chingtham, "Compounded Uniqueness Level: Geo-Location Indexing Using Address Parser," International Journal of Computer Theory and Engineering vol. 1, no. 1, pp. 27-34, 2009.

Copyright © 2008-2015. International Journal of Computer Theory and Engineering. All rights reserved.
E-mail: ijcte@vip.163.com