Exploring the Terrier information Retrieval Platform for Web Search of Documents Written in Macedonian

Vangelovski, Vasil and Gievska, Sonja (2013) Exploring the Terrier information Retrieval Platform for Web Search of Documents Written in Macedonian. In: Proceedings of the Tenth Conference on Informatics and Information Technology. Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Macedonia, Skopje, Macedonia, pp. 109-112. ISBN 978-608-4699-01-9

[img]
Preview
Text
978-608-4699-01-9_pp109-112.pdf

Download (270kB) | Preview
Official URL: http://ciit.finki.ukim.mk

Abstract

Terrier is a modular and scalable platform for rapid development of Information Retrieval (IR) systems. This paper presents a short overview of the Terrier architecture and describes ways in which it can be extended for more effective indexing and searching of documents written in Macedonian language. Although Terrier supports out of the box search in a few non-English languages, the Macedonian language poses some specific challenges, especially when search of Web content is involved. An integrated search platform is developed for the purpose of this research extending the text retrieval engine with a more advanced content filtering capabilities. Some of the proposed methods can be easily applied to other non-English languages.

Item Type: Book Section
Subjects: International Conference on Informatics and Information Technologies > Intelligent Systems
International Conference on Informatics and Information Technologies > Robotics
International Conference on Informatics and Information Technologies > Bioinformatics
Depositing User: Vangel Ajanovski
Date Deposited: 28 Oct 2016 00:15
Last Modified: 28 Oct 2016 00:15
URI: http://eprints.finki.ukim.mk/id/eprint/11309

Actions (login required)

View Item View Item