Hebrew dictionary for PostgreSQL on Heroku?
Asked Answered
V

1

7

Reading Heroku help on enabling full text search in PostgreSQL I see that it doesn't support Hebrew by default. Does anyone know how to add support for Hebrew dictionary in PostgreSQL on Heroku?

Vasoinhibitor answered 4/4, 2013 at 7:26 Comment(4)
did you manage to get full text search running on pg?Stratovision
I did, using pg_search gem github.com/Casecommons/pg_search The caveat is that I couldn't find any hebrew dictionaries, so I ended up using the default non-Hebrew dictionary which doesn't support all kinds of nice things you could expect Hebrew dictionary to support, like matching results that start with different letter.Vasoinhibitor
so did you get hebrew searches working without the dictionary?Stratovision
yep, hebrew searches work, but they are kinda basic. For example, if you look for "נשי" you'll find "נשים", but you won't find "לנשים".Vasoinhibitor
E
5

I work on Heroku Postgres, and would like input on this matter from those able to vend it.

I'm looking into this but so far the waters have been murky as to how Hebrew is supported in many/any open source projects, including dedicated full text searching projects like Lucene or Xapian. There are full blown toolchains for dealing with this, but their integration with PostgreSQL is not yet existent as far as I know, e.g. hebstem, hspell and libhspell, HebMorph.

If someone knows of what the current state of the art is for this in Postgres, I can try to make it work on Heroku at a time of my discretion, depending on the precise details of that implementation that I have to review somewhat carefully.

As-is my attempts to locate an ispell dictionary have been questionable, as is the efficacy of ispell style dictionaries for Hebrew given the reportedly very different stemming rules.

Related work:

Thoughts?

Ecphonesis answered 9/4, 2013 at 0:30 Comment(2)
does basic search in hebrew work (without a dictionary)?Stratovision
You can find the Hunspell hebrew dictionary here: src.chromium.org/svn/trunk/deps/third_party/…Tuppeny

© 2022 - 2024 — McMap. All rights reserved.