Date: prev next · Thread: first prev next last
2016 Archives by date, by thread · List index

Dear Rimas,

On Sun, Apr 3, 2016 at 11:55 PM, Rimas Kudelis <> wrote:

With all the data you shared, I'm even more certain that this belongs to
the locale data, much like quotation characters and number formatting
characters. I'm not sure if this locale property is readily available
for inclusion in locale data though.

A while ago, I created the LO locale XML files for Church Slavonic
("Church Slavic"). Please see here:

There was nothing there about hyphenation characters, AFAICT.

It might be that Slavonic is a very
rare exception to the common rule of using hyphens for that, and that
this hasn't been accounted for anywhere.

I don't think it's that uncommon. Unicode includes a number of
script-specific hyphenation characters, for example U+058A Armenian
Hyphen, U+1400 Canadian Syllabics Hyphen, etc. How are users supposed
to use those? Also, some Indic scripts, IIUC, do not use a hyphen
character at all; they just split a word across line.  What if users
are using a legacy codepage where the Hyphen is encoded somewhere
other that U+002D? (BTW, strictly speaking U+002D is *not* a hyphen,
and LO should really be using U+2010 for hyphenation). Or the user
wants to set some decorative character to be a hyphen.

IMHO, a hyphenation character should be settable from the user
interface, for example, together with the "Characters at line end" and
"Characters at line begin" in Format->Paragraph -> Text Flow. It
should not involve having to hack an XML file and rebuild LO from

BTW, despite setting LEFTHYPHMIN and RIGHTHYPHMIN in the hyphenation
dictionary, "Characters at line end" and "Characters at line begin"
cannot be set lower than 2. But Church Slavic uses LEFTHYPHEMIN = 1
(Ancient Greek uses both LEFTHYPHMIN and RIGHTHYPHMIN = 1). Is this a
bug? Or a feature?

anything about this neither in the LDML standard, nor in our DTD for
locale definition files

So, I guess as a first step, LO should support changing the hyphen
character in the XML locale files. Or, is this really a Hunspell
issue, and it should be specified from the hyphenation dictionary
extension? Could someone confirm this?



To unsubscribe e-mail to:
Posting guidelines + more:
List archive:
All messages sent to this list will be publicly archived and cannot be deleted


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.