I've just found out that the Korean language has also an official list of words by frequency. The one thing that seems confusing to me is the fact that the database files have made separate entries for homonyms, based on their frequency. For example, the database has two separate entries for "bear", marking "bear1", frequency 15000 for "the animal bear" and "bear2", frequency 12000 for "to bear, or carry".
So, I'm not sure how this database can be used for making a dictionary. I'm thinking that maybe merging the numbers of the different entries for the homonyms might be the best solution, but I'm no excel guru, so I'm not sure if this work might require a whole lot of manual work. If this would be considered to be the requirement, I might need some help.
Or maybe it might be possible to simply leave the homonyms, delete the "differentiating number marks", and simply build up the dictionary? I really don't know. If this is possible, and making a list with redundant words, I think I'll be able to do this myself, but it might not be the best solution. I need help!
I've attached the list of the words in an exel file. The left is the frequency, the right is the list of words marked with the "number" for homonyms. I hope this works! And I'll be welcome to suggestions and any help from anyone.
...looks like I cannot upload excel files. Please check the file http://dl.dropbox.com/u/52727/korean%20words.xlsx here