snowball Questions
1
After parsing many documents, I have a lot of rows/columns with Ukrainian text that should be indexed for full-text search in Postgres.
I've found that Postgres 14 supports by default 29 languages,...
Poteet asked 30/9, 2022 at 10:22
5
Solved
I'm building a search engine for a website where users can be of many different countries and post text content.
I'll consider that:
- A french generates content in french and english
- A german g...
Germanism asked 14/6, 2012 at 22:16
1
Solved
1
I'd like to use the stemmer here for merging word counts.
http://snowball.tartarus.org/download.html
The page has a download link, but I'm not sure how to integrate the files into my eclipse projec...
1
Solved
I need to find out which analyzer (type, language..) is configured in a specific index.
I tried http://localhost:9200/wazzup/_mapping but it only gives information about field names/types.
Thanks
...
Woald asked 25/3, 2014 at 14:31
7
Solved
Do you know any java implementation of the Porter2 stemmer(or any better stemmer written in java)? I know that there is a java version of Porter(not Porter2) here :
http://tartarus.org/~martin/Por...
Wallinga asked 9/12, 2010 at 10:21
1
Solved
i'm searching a java library or something to do stemming of italian strings of words.
The goal is to compare italian words. In this moment words like "attacco", "attacchi","attaccare" etc., are co...
3
Solved
I need to take a paragraph of text and extract from it a list of "tags". Most of this is quite straight forward. However I need some help now stemming the resulting word list to avoid duplicates. E...
Performative asked 10/10, 2008 at 10:43
3
Solved
Just getting started with Lucene.Net. I indexed 100,000 rows using standard analyzer, ran some test queries, and noticed plural queries don't return results if the original term was singular. I und...
Anglesey asked 6/10, 2010 at 17:45
1
© 2022 - 2024 — McMap. All rights reserved.