Date: prev next · Thread: first prev next last
2011 Archives by date, by thread · List index



I need some "guidance" with the extent of my dictionary files for LibreOffice and OOo.

My largest dictionaries are about 638,000 words in the spelling word .dic file. I need to know how large it too large.

I found out this morning that if I compare that word list with a combined list for chemical and medical words, over 98,000 words from that combined list is not in the current .dic word list[s].

Now here it the issue, how far should I take this project?

I am going to add all the "missing" words that are part of the open-source community's lexicon that are not in the current lists, but where do I stop, and how should I format the "finalized" files?

Should there be one super large list, or should I break it up into sub-lists? Should the "standard" words go into one .dic file, while medical, chemistry, and computer/tech words each have their own .dic file within the .oxt file?

Right now, there is an English dictionary [default one?] that includes US, British, Canadian, and some other versions of English put together as one .oxt file, but separate .dic files. I was wondering if that would be the route I should go with my super-size dictionaries.

To be honest, 20 years ago the spelling dictionary project I was working on has about 177,000 words and I was told that the English language was about 250,000 words. Now I have looked at a combined word list and it has about 737K words in it and there are more words/terms still needing to be checked. The largest book style dictionary now has 25+ volumes to it when it was only 15 about 15-20 years ago. So I really think the final super-sized dictionary word list could one day go over one million in the next year or two. I just have to figure out if it is worth building a list for LO to that size.

Your input would help me make the best US, British, and Canadian English dictionaries out there for LibreOffice. This is for our users to use, so it would be nice for users to let me know what they think.


--
For unsubscribe instructions e-mail to: users+help@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted

Context


Privacy Policy | Impressum (Legal Info) | Copyright information: Unless otherwise specified, all text and images on this website are licensed under the Creative Commons Attribution-Share Alike 3.0 License. This does not include the source code of LibreOffice, which is licensed under the Mozilla Public License (MPLv2). "LibreOffice" and "The Document Foundation" are registered trademarks of their corresponding registered owners or are in actual use as trademarks in one or more countries. Their respective logos and icons are also subject to international copyright laws. Use thereof is explained in our trademark policy.