Hi Cyril,
I have compiled csv file for Slovenian (SL) dictionary from huge corpus of Slovenian language sorted by frequency, descending.
File consists from 63.000+ most frequent used words for Slovenian language.
I've attached zip file with 3 files in it: XLS frequency reference(CP1250) sorted descending , XLS word list sorted descending(CP1250),CSV list of words (UTF-8) sorted descending.
If you need to cut those files, do it from bottom, as there are words with lovest frequency, but i prefer to have as much words as possible.
v.1
Thank you.
Slovenian dictionary
Slovenian dictionary
Last edited by fvs114 on Sun Jun 05, 2011 8:14 pm, edited 1 time in total.
Re: Slovenian dictionary
Hi,
I´m confused a bit.
I saw description or change log for Portugese dictionary
UPDATE:
- 130000 words added!-
Is it a good point to start with 63K words for Slovenian dictionary, or should i start working on 265K project
?
What is average number of words in other dictionaries ?

I´m confused a bit.

I saw description or change log for Portugese dictionary
UPDATE:
- 130000 words added!-
Is it a good point to start with 63K words for Slovenian dictionary, or should i start working on 265K project

What is average number of words in other dictionaries ?

- cyril
- Developer
- Posts: 2079
- Joined: Tue Feb 02, 2010 4:02 pm
- Phone: Nexus One 2.3
- Location: Nice, France
Re: Slovenian dictionary
Hi
The only limitation is the final size of the apk, which is difficult to predict. I think 265k words would be too much, but anyway if the list is sorted you can give me everything and I will trim it if necessary.
The only limitation is the final size of the apk, which is difficult to predict. I think 265k words would be too much, but anyway if the list is sorted you can give me everything and I will trim it if necessary.
Cyril
Re: Slovenian dictionary
Hi Cyril,
I’ve made 265K file with words sorted descending, so you’ll be able to trim it, but i hope that won’t be necessary.
I think that is excellent selection , for this number of words, but if you’ll have to trim it hard, I’ll make another one slightly different.
v.2
If you’ll have to trim it, let me know how many lines (words) is included in dictionary, when will be finished.
Thank You

I’ve made 265K file with words sorted descending, so you’ll be able to trim it, but i hope that won’t be necessary.

I think that is excellent selection , for this number of words, but if you’ll have to trim it hard, I’ll make another one slightly different.
v.2
If you’ll have to trim it, let me know how many lines (words) is included in dictionary, when will be finished.
Thank You

Last edited by fvs114 on Sun Jun 05, 2011 8:30 pm, edited 1 time in total.
- cyril
- Developer
- Posts: 2079
- Joined: Tue Feb 02, 2010 4:02 pm
- Phone: Nexus One 2.3
- Location: Nice, France
Re: Slovenian dictionary
I just built the dictionary here.
There was no need to trim it as it has a good compression factor, so the whole list is there. Let me know if it's ok before I put it on the market.
There was no need to trim it as it has a good compression factor, so the whole list is there. Let me know if it's ok before I put it on the market.
Cyril
Re: Slovenian dictionary
Thanks Cyril, Great news !
I'm testing it now, and it's good, but I have to change few things - apply some filters, before official launch.
I've found some errors, have to add / change and delete few words, check personal names, and there is a problem with some CAPS....,etc. what i was unable to see without beta version.

I'm testing it now, and it's good, but I have to change few things - apply some filters, before official launch.
I've found some errors, have to add / change and delete few words, check personal names, and there is a problem with some CAPS....,etc. what i was unable to see without beta version.

Re: Slovenian dictionary
Hi Cyril,
Here is updated list with all corrections, and I think it is good enough for official launch.
v.3
260K words !!! (267.257 rows from previous version was reduced to 266.146 rows by filtering out unneeded values)
Current Beta version is indeed very,very good, with some minor exceptions, which will be be solved with this new list.
FVS114
Here is updated list with all corrections, and I think it is good enough for official launch.
v.3
260K words !!! (267.257 rows from previous version was reduced to 266.146 rows by filtering out unneeded values)

Current Beta version is indeed very,very good, with some minor exceptions, which will be be solved with this new list.
FVS114
- cyril
- Developer
- Posts: 2079
- Joined: Tue Feb 02, 2010 4:02 pm
- Phone: Nexus One 2.3
- Location: Nice, France
Re: Slovenian dictionary
The slovenian dictionary is now on the Market, based on your latest list.
Let me know if things have to be changed.
Let me know if things have to be changed.
Cyril
Re: Slovenian dictionary
You’re the man Cyril !!! Thank You !!!
If anything.... , I’ll let you know.

If anything.... , I’ll let you know.

