A little more information:
On Aug 29, 2011, at 10:22 , James Wilde wrote:
Can't think of a better subject line.
This is about the Ubuntu repository version of LibO running under Ubuntu 11.04 linux. The
repository version is described as follows:
LibreOffice 3.3.3
OOO 330m 19 (Build:301)
LibreOffice 3.3.3.1 Ubuntu package 1.3.3.3ubuntu2
I have been testing Gnucash on linux, and in particular the ability to export data to an html
file and read that into other programs, specifically LibreOffice Calc. The whole process went
excellently with one small exception: the translation of the Swedish characters, å, ä and ö (and
of course the capitals, Å, Ä and Ö which I have not tested).
The html file quite clearly includes the following at the beginning:
<meta http-equiv="content-type" content="text-html; charset=utf-8">
However, I have had the explanation that the conversion below is the result of LibO taking the
input as ISO-8859-1 and converting it to UTF-8. If someone in here can confirm this, I will write
this up as a bug. As mentioned below in the original mail, this does not occur on Windows, nor on
the Mac. (I just tested it on the Mac).
Instead of seeing each of the three letters, in each case I saw two characters, å, for example
instead of å. I checked the three characters in a hex editor, and found the following:
å was represented by C3A5 in other applications, but as C383C2A5 in LibO
ä was represented by C3A4 in other applications, but as C383C2A4 in LibO
ö was represented by C3B6 in other applications, but as C383C2B6 in LibO
I checked the original data in Gnucash, the intermediary form in the html file and the contents
of content.xml in LibO. The first two had the short form, LibO had the second form.
I have also checked the same information in Windows, although I have not yet used a hex editor in
Windows. Both Excel and LibO Calc imported the data correctly, or rather, displayed it correctly.
Before I report this as a bug, I'd like to be sure it's not something I'm missing in my
configuration. All programs are set to use UTF-8, insofar as one can set it in the program
rather than in the operating system.
A late thought occurs to me. I use LibO Writer for writing Mandarin and have set 'Enabled for
Asian languages' under Languages. Can this have affected the way LibO translates the non-English
characters? Although LibO has actually _added_ two bytes to the characters.
//James
--
For unsubscribe instructions e-mail to: users+help@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted
Context
Privacy Policy |
Impressum (Legal Info) |
Copyright information: Unless otherwise specified, all text and images
on this website are licensed under the
Creative Commons Attribution-Share Alike 3.0 License.
This does not include the source code of LibreOffice, which is
licensed under the Mozilla Public License (
MPLv2).
"LibreOffice" and "The Document Foundation" are
registered trademarks of their corresponding registered owners or are
in actual use as trademarks in one or more countries. Their respective
logos and icons are also subject to international copyright laws. Use
thereof is explained in our
trademark policy.