Date: prev next · Thread: first prev next last
2014 Archives by date, by thread · List index


Dear László and others,

We think we may have found a bug in libhyphen. It could be that it is just
a limitation of the algorithm, but anyway it's an issue for us.

The problem is that some patterns in a dictionary are ignored in some
cases, namely when the match string of that pattern is a part of the match
string of another pattern, and more specifically when it's not just a
prefix.

Let me clarify that with an example. When a dictionary consists of these
two patterns, the word `foobar' is not hyphenated because the first pattern
is ignored:

oo1b
foob

The second, longer pattern doesn't even have to match, as the second
example shows (the first pattern is still ignored):

oo1b
foobz

I have a patch that solves part of the problem:

https://github.com/bertfrees/libhyphen-nar/blob/adc2b74a19469e4dc93777fcdb82e36e566a0472/src/patches/bug.patch

With this patch the given examples will be handled correctly, but in other
situations it will still fail, such as here:

oo1b
foobaz

Have I indeed found a bug here, and does my patch make sense, or am I just
expecting too much and are we hitting the limits of the algorithm?

Thanks for considering,
Bert Frees

Context


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.