Download Snowball for Java here.
It includes a class named org.tartarus.snowball.ext.italianStemmer
which extends SnowballStemmer
.
To use a SnowballStemmer
please take a look at the following test code for verb attaccare
present tense:
import org.junit.Test;
import org.tartarus.snowball.SnowballStemmer;
import org.tartarus.snowball.ext.italianStemmer;
public class SnowballItalianStemmerTest {
@Test
public void testSnowballItalianStemmerAttaccare() {
SnowballStemmer stemmer = (SnowballStemmer) new italianStemmer();
String[] tokens = "attacco attacchi attacca attacchiamo attaccate attaccano".split(" ");
for (String string : tokens) {
stemmer.setCurrent(string);
stemmer.stem();
String stemmed = stemmer.getCurrent();
Assert.assertEquals("attacc", stemmed);
System.out.println(stemmed);
}
}
}
Output:
attacc
attacc
attacc
attacc
attacc
attacc
For another example of use see TestApp.java
included in the same tgz file.
Lucene, which is written in Java, uses Snowball for stemming, for example as a filter in SnowballFilter.