Date: prev next · Thread: first prev next last
2018 Archives by date, by thread · List index


Thanks Noel!

I’ve poked through

  https://github.com/LibreOffice/core/tree/master/sw/source/filter/html

and couldn’t find anything hinting at expanding HTML entities.  Looking
at the parser you indicated, looks like this code would cover HTML
entities:

  https://github.com/LibreOffice/core/blob/master/svtools/source/svhtml/parhtml.cxx#L394-L622

which are listed here:

  https://dev.w3.org/html5/html-author/charref

I’ll take a closer look…  

Cheers,
Jens


On Thu, Sep 27, 2018 at 11:24:40AM +0200, Noel Grandin wrote:

On 2018/09/27 11:10 AM, Jens Tröger wrote:
I’ve been poking through the HTML reader (rather superficially, I admit) in
search for the code that expands HTML entities to Unicode. I did that to


Probably
    HTMLParser::ScanText
at
    svtools/source/svhtml/parhtml.cxx:394

-- 
Jens Tröger
http://savage.light-speed.de/

Context


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.