The STUDIA UNIVERSITATIS BABEŞ-BOLYAI issue article summary

The summary of the selected article appears at the bottom of the page. In order to get back to the contents of the issue this article belongs to you have to access the link from the title. In order to see all the articles of the archive which have as author/co-author one of the authors mentioned below, you have to access the link from the author's name.

 
       
         
    STUDIA INFORMATICA - Issue no. Sp.Issue%201 / 2009  
         
  Article:   A ROMANIAN STEMMER.

Authors:  CLAUDIU SORIN IRIMIAŞ.
 
       
         
  Abstract:   This paper presents an improvement of the Romanian stemmer algorithm described on Martin Porters Snowball web-site. The changes made to the original algorithm are minimal but our experimental results indicate an increase of the accuracy with almost 10%, no loss being identified in the computationaltime. Two different experiments were made, the first was made on a 22,570 Romanian words vocabulary, and the second was accomplished using an article from a Romanian newspaper as input. The Romanian stemmer is based on a suffix stripping algorithm which consists of a set of rules to be applied to theinput word to find its root form. Because of its efficiency, especially in regards totime and accuracy the Romanian suffix stripping algorithm is suited to be usedin the information retrieval field for problems that require a smaller amount of computational time and do not necessitate that the accuracy of the result is over 80%.

Key words and phrases. suffix striping, stemming algorithm, Romanian stemmer.
 
         
     
         
         
      Back to previous page