Date: prev next · Thread: first prev next last
2014 Archives by date, by thread · List index


Hello,
I just discovered something. If the DOCTYPE of the HTML file is not the topmost line (line 1), LibreOffice renders the file fine (instead of showing the source). However, If there is a blank line before the DOCTYPE (like the HTML file Keith uploaded), the HTML parser is tripped and shows the source. So maybe LibreOffice expects the DOCTYPE to be on line 1, and if not on line 1, panics and doesn't render the HTML. But still, LibreOffice shouldn't have trouble rendering the file, even if the person who wrote the HTML (or the generator) didn't follow every strict standard. Web browsers know how to adapt to that kind of stuff. So, the file that Keith uploaded renders successfully in LibreOffice 4.2 if the <!DOCTYPE ...> syntax is corrected to: <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/1998/REC-html40-19980424/loose.dtd";>
, and the DOCTYPE is on line 1.  Is anyone able to confirm this?
Not to mention the unclosed <P> (paragraph) tag in the HTML file (luckily that didn't seem to influence whether Writer/Web rendered the page correctly).

Regards,
xmlhttprequest.open@gmail.com

On 2/5/2014 2:45 PM, libreoffice-ml.mbourne@spamgourmet.com wrote:
Uploading the document to the W3C validator (http://validator.w3.org/) it appears the DOCTYPE is not valid. It should be:
-//W3C//DTD HTML 4.0 Transitional//EN
(the file is missing "//EN" from the end).

The original document doesn't fail with the version of LO I have, so cannot check, but try correcting the DOCTYPE. If that allows the file to open as expected, it appears the problem is with moneydance producing invalid output, rather than with LO.

Arbitrarily changing the DOCTYPE to indicate HTML5 may well also fail, as the rest of the document is not valid HTML5.

Mark.


null wrote:
Hello,
     It looks like the problem is with LibreOffice interpreting
DOCTYPEs.  You see, every HTML is supposed to have a DOCTYPE at the top
of the file, explaining what version of HTML (HTML5, HTML4.01, etc) the
file uses.  The file you uploaded has a DOCTYPE of HTML4.01
Transitional.  Anyway, if I remove the <!DOCTYPE ...> part of the HTML
file, and reopen it in LibreOffice, it works just fine, and shows the
file's contents (not the source). Even switching the DOCTYPE to
<!DOCTYPE html> (HTML5) causes the problem.  So, in theory, if you want
to view an HTML file in LibreOffice 4.2, you have to remove its
<!DOCTYPE ...>, and it will work fine.  Would you like an official bug
to be filed so that the developers know of the problem and can try to
fix it?

Regards,
xmlhttprequest.open@gmail.com

On 2/5/2014 12:05 AM, Keith Bates wrote:
I've produced a document that doesn't work with LO. It is at
https://www.dropbox.com/s/mbylcso4r2qm2r2/test.html

Strangely it does open in Abiword.

Earlier documents produced by moneydance as recently as last month
open fine in LO. I have not upgraded moneydance in that time.

Having quickly opened about 20 html's the problem is limited to those
produced lately by moneydance.

I haven't had time to look closely at the html source. I suppose it's
possible that LO is now being more strict in its interpretation.

The work around in the short term might be to use Abiword.




.



--
To unsubscribe e-mail to: users+unsubscribe@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted

Context


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.