On 2/5/2014 6:26 PM, Keith Bates wrote:
Here is the official bug report:
On 06/02/14 10:21, null wrote:
On 2/5/2014 6:16 PM, Keith Bates wrote:
Well, actually, there was an HTML tag in the file before. You may
not have seen it because the source was crammed into one line. I
think I have enough information to file a bug!
Yes!! It works!
I see that as well as the other changes relating to the DOCTYPE line
you've added a <html> tag.
Thank you for helping with this. Feel free to use the file as an
example if needed.
Thanks for reporting the issue. Now that it has been reported, it might
I hope that the devs can fix this soon,
Congrats and regards from
On 06/02/14 09:56, null wrote:
On 2/5/2014 5:29 PM, Keith Bates wrote:
Does this file work?:
It is a revised version of the Moneydance file you sent me. It
renders fine in LibreOffice. The reason I am asking you to try so
many things is so that I can attempt to gather enough information
to file a bug.
I've just tried all of those changes and none of them work.
Additionally, I've tried opening from a blank Writer/Web document,
from file manager, from the opening screen- all with the same result.
Just for reference, as we seem to be getting very different
results my version of LO reports as Version: 188.8.131.52 Build ID:
Mine reports as:
Build ID: 05dceb5d363845f2cf968344d7adab8dcfb2ba71
On 06/02/14 09:07, null wrote:
I just discovered something. If the DOCTYPE of the HTML file
is not the topmost line (line 1), LibreOffice renders the file
fine (instead of showing the source). However, If there is a
blank line before the DOCTYPE (like the HTML file Keith
uploaded), the HTML parser is tripped and shows the source. So
maybe LibreOffice expects the DOCTYPE to be on line 1, and if not
on line 1, panics and doesn't render the HTML. But still,
LibreOffice shouldn't have trouble rendering the file, even if
the person who wrote the HTML (or the generator) didn't follow
every strict standard. Web browsers know how to adapt to that
kind of stuff.
So, the file that Keith uploaded renders successfully in
LibreOffice 4.2 if the <!DOCTYPE ...> syntax is corrected to:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
, and the DOCTYPE is on line 1. Is anyone able to confirm this?
Not to mention the unclosed <P> (paragraph) tag in the HTML file
(luckily that didn't seem to influence whether Writer/Web
rendered the page correctly).
On 2/5/2014 2:45 PM, firstname.lastname@example.org wrote:
Uploading the document to the W3C validator
(http://validator.w3.org/) it appears the DOCTYPE is not valid.
It should be:
-//W3C//DTD HTML 4.0 Transitional//EN
(the file is missing "//EN" from the end).
The original document doesn't fail with the version of LO I
have, so cannot check, but try correcting the DOCTYPE. If that
allows the file to open as expected, it appears the problem is
with moneydance producing invalid output, rather than with LO.
Arbitrarily changing the DOCTYPE to indicate HTML5 may well also
fail, as the rest of the document is not valid HTML5.
It looks like the problem is with LibreOffice interpreting
DOCTYPEs. You see, every HTML is supposed to have a DOCTYPE at
of the file, explaining what version of HTML (HTML5, HTML4.01,
file uses. The file you uploaded has a DOCTYPE of HTML4.01
Transitional. Anyway, if I remove the <!DOCTYPE ...> part of
file, and reopen it in LibreOffice, it works just fine, and
file's contents (not the source). Even switching the DOCTYPE to
<!DOCTYPE html> (HTML5) causes the problem. So, in theory, if
to view an HTML file in LibreOffice 4.2, you have to remove its
<!DOCTYPE ...>, and it will work fine. Would you like an
to be filed so that the developers know of the problem and can
On 2/5/2014 12:05 AM, Keith Bates wrote:
I've produced a document that doesn't work with LO. It is at
Strangely it does open in Abiword.
Earlier documents produced by moneydance as recently as last
open fine in LO. I have not upgraded moneydance in that time.
Having quickly opened about 20 html's the problem is limited
produced lately by moneydance.
I haven't had time to look closely at the html source. I
possible that LO is now being more strict in its interpretation.
The work around in the short term might be to use Abiword.
To unsubscribe e-mail to: email@example.com
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted
- Re: [libreoffice-users] Importing HTML in LO 4.2 (continued)
- (message not available)
Impressum (Legal Info)
: Unless otherwise specified, all text and images
on this website are licensed under the
Creative Commons Attribution-Share Alike 3.0 License
This does not include the source code of LibreOffice, which is
licensed under the Mozilla Public License (MPLv2
"LibreOffice" and "The Document Foundation" are
registered trademarks of their corresponding registered owners or are
in actual use as trademarks in one or more countries. Their respective
logos and icons are also subject to international copyright laws. Use
thereof is explained in our trademark policy