Hi I've just hacked the hunspell sources I got from CVS so that I could get rid of this hunspell bug: https://sourceforge.net/tracker/?func=detail&aid=3288562&group_id=143754&atid=756395 The issue was that hunspell would accept words with one suffix+compound, but not two suffixes+compound. Patch attached in the above link. ------- Since most words in Quichua have stem+suffix_level1+suffix_level2 +lots_of_compounds, this was a showstopper for my spellchecker. At first I wrote a generator script that would create all the different inflexes and stick it into a dictionary, but it was a bad hack and very slow with a dictionary file of 170000+entries for only some 3000 words... But now also analysis benefits :D hunspell -d qu_EC -m mikunakuykunatakapash pa:mikunakuy st:mikuna # base plur # Nounification "amongst us" pa:kuna st:kuna pa:ta st:ta pa:ka st:ka pa:pash [For the curious the word means something like "for to the big feasts, also"] -------- The patch is just two lines, one for each of two functions in affixmgr.cxx, compound_check() and compound_check_morph(). For LO only the one in AffMgr::compound_check() is needed, I think, haven't seen the morphology analysis being used anywhere in LO... best Arno Teigseth
Attachment:
signature.asc
Description: This is a digitally signed message part