Hello,I just discovered something. If the DOCTYPE of the HTML file is not the topmost line (line 1), LibreOffice renders the file fine (instead of showing the source). However, If there is a blank line before the DOCTYPE (like the HTML file Keith uploaded), the HTML parser is tripped and shows the source. So maybe LibreOffice expects the DOCTYPE to be on line 1, and if not on line 1, panics and doesn't render the HTML. But still, LibreOffice shouldn't have trouble rendering the file, even if the person who wrote the HTML (or the generator) didn't follow every strict standard. Web browsers know how to adapt to that kind of stuff. So, the file that Keith uploaded renders successfully in LibreOffice 4.2 if the <!DOCTYPE ...> syntax is corrected to: <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/1998/REC-html40-19980424/loose.dtd">
, and the DOCTYPE is on line 1. Is anyone able to confirm this?Not to mention the unclosed <P> (paragraph) tag in the HTML file (luckily that didn't seem to influence whether Writer/Web rendered the page correctly).
Regards, xmlhttprequest.open@gmail.com On 2/5/2014 2:45 PM, libreoffice-ml.mbourne@spamgourmet.com wrote:
Uploading the document to the W3C validator (http://validator.w3.org/) it appears the DOCTYPE is not valid. It should be:-//W3C//DTD HTML 4.0 Transitional//EN (the file is missing "//EN" from the end).The original document doesn't fail with the version of LO I have, so cannot check, but try correcting the DOCTYPE. If that allows the file to open as expected, it appears the problem is with moneydance producing invalid output, rather than with LO.Arbitrarily changing the DOCTYPE to indicate HTML5 may well also fail, as the rest of the document is not valid HTML5.Mark. null wrote:Hello, It looks like the problem is with LibreOffice interpreting DOCTYPEs. You see, every HTML is supposed to have a DOCTYPE at the top of the file, explaining what version of HTML (HTML5, HTML4.01, etc) the file uses. The file you uploaded has a DOCTYPE of HTML4.01 Transitional. Anyway, if I remove the <!DOCTYPE ...> part of the HTML file, and reopen it in LibreOffice, it works just fine, and shows the file's contents (not the source). Even switching the DOCTYPE to <!DOCTYPE html> (HTML5) causes the problem. So, in theory, if you want to view an HTML file in LibreOffice 4.2, you have to remove its <!DOCTYPE ...>, and it will work fine. Would you like an official bug to be filed so that the developers know of the problem and can try to fix it? Regards, xmlhttprequest.open@gmail.com On 2/5/2014 12:05 AM, Keith Bates wrote:I've produced a document that doesn't work with LO. It is at https://www.dropbox.com/s/mbylcso4r2qm2r2/test.html Strangely it does open in Abiword. Earlier documents produced by moneydance as recently as last month open fine in LO. I have not upgraded moneydance in that time. Having quickly opened about 20 html's the problem is limited to those produced lately by moneydance. I haven't had time to look closely at the html source. I suppose it's possible that LO is now being more strict in its interpretation. The work around in the short term might be to use Abiword..
-- To unsubscribe e-mail to: users+unsubscribe@global.libreoffice.org Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/ Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette List archive: http://listarchives.libreoffice.org/global/users/ All messages sent to this list will be publicly archived and cannot be deleted