I would like to parse open office supporting hunspell formatted aff
and dic
files.
English aff
and dic
files can be downloaded from here for example : http://extensions.openoffice.org/en/project/english-dictionaries-apache-openoffice
I want to scan each line of the given .dic
file and generate every possible word of the each line with the provided .aff
file
How can i do that?
I have installed NHunspell framework but it does not have that feature : https://www.nuget.org/packages/NHunspell/
For example for the english language lets consider
make/UAGS
make can be make, made, makes, making
etc
Now i need parser to give me all these combinations. How can i obtain them? Ty very much
So basically i want to scan each line of the dictionary and generate all possible words from the word of that line and i dont know how can i do that
I can also write my own parsers, but it seems to me rules are pretty complex and there are no detailed and easy documentation about this
Here what i want basically. The image explains very clearly
Giving analyze/ADSG
, en.dic
and en.aff
file and obtaining all the following words
analyze, analyzes, analyzing, analyzed, reanalyze, reanalyzes, reanalyzing, reanalyzed