public class LjubesicPandzicStemmer extends Object
Ova klasa implementira stemer za hrvatski "Simple stemmer for
Croatian v0.1" Nikole Ljubešića i Ivana Pandžića. Originalna
implementacija u Python-u je dostupna na adresi:
http://nlp.ffzg.hr/resources/tools/stemmer-for-croatian/
Stemer predstavlja poboljšanje ranijeg algoritma opisanog u radu:
Retrieving Information in Croatian: Building a Simple and Efficient
Rule-Based Stemmer, Nikola Ljubešić, Damir Boras, Ozren Kubelka,
Digital Information and Heritage, 313–320 (2007).
This class implements the "Simple stemmer for Croatian v0.1" by
Nikola Ljubešić and Ivan Pandžić. The original implementation in
Python is available at:
http://nlp.ffzg.hr/resources/tools/stemmer-for-croatian/
The stemmer represents an improvement of an earlier algorithm
described in the paper:
Retrieving Information in Croatian: Building a Simple and Efficient
Rule-Based Stemmer, Nikola Ljubešić, Damir Boras, Ozren Kubelka,
Digital Information and Heritage, 313–320 (2007).
| Constructor | Description |
|---|---|
LjubesicPandzicStemmer() |
| Modifier and Type | Method | Description |
|---|---|---|
String |
stemWord(String word) |
Ako se naiđe na neku od stop-reči, ona se preskače.
|
public String stemWord(String word)
If a stop-word is encountered, it is skipped. Otherwise, the suffix of the word is first transformed and then removed.
word - Reč koju treba obraditi
Copyright © 2018. All rights reserved.