Dear László and others,
We think we may have found a bug in libhyphen. It could be that it is just
a limitation of the algorithm, but anyway it's an issue for us.
The problem is that some patterns in a dictionary are ignored in some
cases, namely when the match string of that pattern is a part of the match
string of another pattern, and more specifically when it's not just a
prefix.
Let me clarify that with an example. When a dictionary consists of these
two patterns, the word `foobar' is not hyphenated because the first pattern
is ignored:
oo1b
foob
The second, longer pattern doesn't even have to match, as the second
example shows (the first pattern is still ignored):
oo1b
foobz
I have a patch that solves part of the problem:
https://github.com/bertfrees/libhyphen-nar/blob/adc2b74a19469e4dc93777fcdb82e36e566a0472/src/patches/bug.patch
With this patch the given examples will be handled correctly, but in other
situations it will still fail, such as here:
oo1b
foobaz
Have I indeed found a bug here, and does my patch make sense, or am I just
expecting too much and are we hitting the limits of the algorithm?
Thanks for considering,
Bert Frees
Context
- Issue with libhyphen · Bert Frees
Privacy Policy |
Impressum (Legal Info) |
Copyright information: Unless otherwise specified, all text and images
on this website are licensed under the
Creative Commons Attribution-Share Alike 3.0 License.
This does not include the source code of LibreOffice, which is
licensed under the Mozilla Public License (
MPLv2).
"LibreOffice" and "The Document Foundation" are
registered trademarks of their corresponding registered owners or are
in actual use as trademarks in one or more countries. Their respective
logos and icons are also subject to international copyright laws. Use
thereof is explained in our
trademark policy.