On 28/06/11 00:06, Séamas Ó Brógáin wrote:
I wonder if some regex wizard knows of a way to delete HTML (or XML or
whatever) tags from a text file: in other words, selecting everything
from each occurrence of< to the next occurrence of>.
If you are trying to do this from within LibreOffice use the
"Alternative Find and Replace for Writer" Plugin.
There is an option to remove HTML tags.
You can also google html tag removal using bash and Python. Both work.
--
Cheers Simon
Simon Cropper
Website Administrator
http://www.fossworkflowguides.com
The fossWorkflow Guides
--
Unsubscribe instructions: E-mail to users+help@global.libreoffice.org
In case of problems unsubscribing, write to postmaster@documentfoundation.org
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted
Context
- Re: [libreoffice-users] Regular expressions · Simon Cropper (The foss Workflow Guides)
Privacy Policy |
Impressum (Legal Info) |
Copyright information: Unless otherwise specified, all text and images
on this website are licensed under the
Creative Commons Attribution-Share Alike 3.0 License.
This does not include the source code of LibreOffice, which is
licensed under the Mozilla Public License (
MPLv2).
"LibreOffice" and "The Document Foundation" are
registered trademarks of their corresponding registered owners or are
in actual use as trademarks in one or more countries. Their respective
logos and icons are also subject to international copyright laws. Use
thereof is explained in our
trademark policy.