Date: prev next · Thread: first prev next last
2015 Archives by date, by thread · List index


Hi Khaled,

On Monday, 2015-06-29 20:40:46 +0200, Khaled Hosny wrote:

The script is generally implicit in the text.

You want to rely on automatic detection of scripts depending on the
language chosen? Do you plan to implement that? However, even then the
resulting tag would include the script code if it wasn't the default
script of the language.

Almost every character in Unicode has a script property, the exceptions
is characters that has Inherit (unusually combining marks) or Common
(punctuation mostly), put there is a simple and pretty reliable way to
resolve the script of those characters from the context.

We already handle this at the text shaping level in VCL for platforms
where HarfBuzz is used.

I think we talk about two different things here. My view is from correct
language tag attribution that we need anyway, for document storage and
spell-checkers and locale dependent representation. When I mention
"language tag" I'm always talking about BCP 47 language tags. You, and
possibly Richard, have the runtime view and what could be automatically
detected. So, even if detected automatically we'll have to assign
a language tag that for the non-default script of a language includes
the ISO 15924 script code.

We could of course come up with a huge table of all scripts and possible
languages associated with each and present things entirely different in
the UI. But, for MSO interoperability and ODF conformance we'll have to
stick with the (to me) unfortunate and arbitrary "Western"/CTL/CJK
classification in which we'll have to squeeze all language tags. The
correct route to go is probably to assign known scripts to these
classes, whether detected automatically or not, and distribute language
tags according to their (implied or not) script over those classes.

  Eike

-- 
LibreOffice Calc developer. Number formatter stricken i18n transpositionizer.
GPG key "ID" 0x65632D3A - 2265 D7F3 A7B0 95CC 3918  630B 6A6C D5B7 6563 2D3A
Better use 64-bit 0x6A6CD5B765632D3A here is why: https://evil32.com/
Care about Free Software, support the FSFE https://fsfe.org/support/?erack

Attachment: pgpo6jcPCG9iv.pgp
Description: PGP signature


Context


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.