Application of Statistical N-Gram Clustering Algorithm of Words in Macedonian Language

Ilioski, Bojan and Popeska, Zhaneta (2013) Application of Statistical N-Gram Clustering Algorithm of Words in Macedonian Language. In: Proceedings of the Tenth Conference on Informatics and Information Technology. Faculty of Computer Science and Engineering, Ss. Cyril and Methodius University in Skopje, Macedonia, Skopje, Macedonia, pp. 106-108. ISBN 978-608-4699-01-9

[img]
Preview
Text
978-608-4699-01-9_pp106-108.pdf

Download (124kB) | Preview
Official URL: http://ciit.finki.ukim.mk

Abstract

The N-gram algorithm, one of the most famous algorithms used for statistical clustering of words, determines the similarity between two words, based upon a statistical analysis. In this paper we present the work of this algorithm and the results obtained on clustering 10 000 words in the Macedonian language. The realization of this algorithm is made in the programming language Java.

Item Type: Book Section
Subjects: International Conference on Informatics and Information Technologies > Intelligent Systems
International Conference on Informatics and Information Technologies > Robotics
International Conference on Informatics and Information Technologies > Bioinformatics
Depositing User: Vangel Ajanovski
Date Deposited: 28 Oct 2016 00:15
Last Modified: 28 Oct 2016 00:15
URI: http://eprints.finki.ukim.mk/id/eprint/11056

Actions (login required)

View Item View Item