Date: prev next · Thread: first prev next last
2011 Archives by date, by thread · List index


Hi

I've just hacked the hunspell sources I got from CVS so that I could get
rid of this hunspell bug:
https://sourceforge.net/tracker/?func=detail&aid=3288562&group_id=143754&atid=756395

The issue was that hunspell would accept words with one suffix+compound,
but not two suffixes+compound. Patch attached in the above link.

-------

Since most words in Quichua have stem+suffix_level1+suffix_level2
+lots_of_compounds, this was a showstopper for my spellchecker.

At first I wrote a generator script that would create all the different
inflexes and stick it into a dictionary, but it was a bad hack and very
slow with a dictionary file of 170000+entries for only some 3000
words...

But now also analysis benefits :D

hunspell -d qu_EC -m   
mikunakuykunatakapash  pa:mikunakuy st:mikuna # base plur #
Nounification "amongst us" pa:kuna st:kuna pa:ta st:ta pa:ka st:ka
pa:pash

[For the curious the word means something like "for to the big feasts,
also"]
--------

The patch is just two lines, one for each of two functions in
affixmgr.cxx, compound_check() and compound_check_morph().

For LO only the one in AffMgr::compound_check() is needed, I think,
haven't seen the morphology analysis being used anywhere in LO...

best
Arno Teigseth

Attachment: signature.asc
Description: This is a digitally signed message part


Context


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.