Dictionary File Structure of Open Spell-Checkers
Asked Answered
M

3

8

Is there any explanation docs or tutorials of the file structure of FreeDict, Aspell, Hunspell/OpenOffice Dictionaries especially concerning the switches at the end of each row in each .dic file? My guess is that the switches describe the semantic interpretation of the word whether it's a

  • noun
  • adjective
  • adverb
  • adverbial
  • etc.

or any combination of the above. But I don't know how to match these to the switch characters.

I'm also curios about what the .aff file describes.

Maddening answered 24/10, 2011 at 20:59 Comment(0)
E
1

This looks like a good starting point, and the downloads at this page may have the format documentation you're looking for.

Ensign answered 25/5, 2012 at 4:30 Comment(0)
P
1

Just a couple of links that might help you:

this is on sthackoverflow : What's the format of the OpenOffice dictionaries?

this second one is a good start

http://sourceforge.net/apps/mediawiki/freedict/index.php?title=Main_Page

hope this helps

Pouf answered 26/5, 2012 at 10:39 Comment(0)
A
1

In Hunspell the tags you choose are arbitrary, they have no meaning other than that which you assign to them. You can choose from using letters, numbers (1-65535) and more.

The affix file describes many things, but is mainly concerned with how words are inflected.

For example:

$ test.dic
4
apple/a
banana/a
green/b
small/b

$ test.aff
SFX a Y 2 # Allow the following 2 suffixes to words with the "a" flag.
SFX a 0 s . # An "s" at the end for words ending in any letter (signified by the dot). "Apples" and "bananas". 
SFX a 0 s' . # "Apples'" and "bananas'".

SFX b Y 2
SFX b 0 er . # "Greener" and "smaller".
SFX b 0 est . # "Greenest" and "smallest".

The manual explains most of the things in detail. There are also test files one can look at.

Aurify answered 1/12, 2013 at 0:19 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.