AMBIENTUM BIOETHICA BIOLOGIA CHEMIA DIGITALIA DRAMATICA EDUCATIO ARTIS GYMNAST. ENGINEERING EPHEMERIDES EUROPAEA GEOGRAPHIA GEOLOGIA HISTORIA HISTORIA ARTIUM INFORMATICA IURISPRUDENTIA MATHEMATICA MUSICA NEGOTIA OECONOMICA PHILOLOGIA PHILOSOPHIA PHYSICA POLITICA PSYCHOLOGIA-PAEDAGOGIA SOCIOLOGIA THEOLOGIA CATHOLICA THEOLOGIA CATHOLICA LATIN THEOLOGIA GR.-CATH. VARAD THEOLOGIA ORTHODOXA THEOLOGIA REF. TRANSYLVAN
|
|||||||
The STUDIA UNIVERSITATIS BABEŞ-BOLYAI issue article summary The summary of the selected article appears at the bottom of the page. In order to get back to the contents of the issue this article belongs to you have to access the link from the title. In order to see all the articles of the archive which have as author/co-author one of the authors mentioned below, you have to access the link from the author's name. |
|||||||
STUDIA INFORMATICA - Issue no. Sp.Issue%201 / 2009 | |||||||
Article: |
A ROMANIAN STEMMER. Authors: CLAUDIU SORIN IRIMIAŞ. |
||||||
Abstract: This paper presents an improvement of the Romanian stemmer algorithm described on Martin Porters Snowball web-site. The changes made to the original algorithm are minimal but our experimental results indicate an increase of the accuracy with almost 10%, no loss being identified in the computationaltime. Two different experiments were made, the first was made on a 22,570 Romanian words vocabulary, and the second was accomplished using an article from a Romanian newspaper as input. The Romanian stemmer is based on a suffix stripping algorithm which consists of a set of rules to be applied to theinput word to find its root form. Because of its efficiency, especially in regards totime and accuracy the Romanian suffix stripping algorithm is suited to be usedin the information retrieval field for problems that require a smaller amount of computational time and do not necessitate that the accuracy of the result is over 80%. Key words and phrases. suffix striping, stemming algorithm, Romanian stemmer. |
|||||||