[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[libreoffice-l10n] hyphen breaking behaviour
- Subject: [libreoffice-l10n] hyphen breaking behaviour
- From: Michael Bauer <email@example.com>
- Date: Wed, 18 May 2016 15:09:41 +0100
- To: LibreOffice l10n <firstname.lastname@example.org>
How we've managed to fix the smart quote issue for our locale (many thanks to Mikhel and Eike!) I've got another headache I would like to tackle. I've asked Eike but he's not entirely sure and suggested I ask the list.
The problem is this: Gaelic restricts the breaking of hyphens across lines if there are less than two letters before the hyphen. In effect, this affects a finite number of prefixes, mostly:
n-WORD (e.g. ar n-aran)
t-WORD (e.g. an t-aran)
h-WORD (e.g. na h-aran)
an-WORD (e.g. seall an-seo)
(and a few more, I can, if necessary supply a full list).
At the moment, at the end of a line a word like an-seo just gets broken into an-
seo. The only known fix is that once you're done writing, you do a global replace of <an-> to <tU+2011> (i.e. a NON-BREAKING HYPHEN) etc.
Is there something one could exploit to force specific instance of HYPHEN to the NON-BREAKING HYPHEN? The main problem I can see would be that if one cannot distinguish between single t and t word finally, something like cat-ghrìosaich might also end up with a non-breaking hyphen.
Eike suggested the following:
Maybe some rule for AutoCorrection could match this, but I'm not very
familiar with. However, the en-GB lists are under
extras/source/autocorr/lang/en-GB/ in the source tree, likely such rules
similar to the ones in
extras/source/autocorr/lang/en-GB/DocumentList.xml would go into
extras/source/autocorr/lang/gd-GB/DocumentList.xml and using something
<block-list:block block-list:abbreviated-name="n-.*" block-list:name="‑"/>
where the second "‑" is a U+2011 NON-BREAKING HYPHEN.
The ".*" seems to indicate "any sequence of non-word-boundary characters
including none", so also "n-" without a following WORD would be matched.
Apparently there is no support for ".+" or other regular expressions.
To unsubscribe e-mail to: email@example.com
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/l10n/
All messages sent to this list will be publicly archived and cannot be deleted
- Prev by Date: [libreoffice-l10n] Re: [IMPORTANT] Documentation team CONFERENCE CALL Wednesday May 18th
- Next by Date: [libreoffice-l10n] Documentation hangout in 10 minutes
- Previous by thread: [libreoffice-l10n] [IMPORTANT] Documentation team CONFERENCE CALL Wednesday May 18th
- Next by thread: [libreoffice-l10n] Documentation hangout in 10 minutes