Jump to content

User talk:AKA MBG/English Simple Wikipedia 20080214 freq wordlist

Page contents not supported in other languages.
Add topic
From Wiktionary
Latest comment: 4 years ago by 2A02:1812:D29:7200:E40A:FB6A:9BBC:399

Thanks for that! It's a very interesting list. Just looking quickly at the numbers, it strikes me as an unusually flat distribution. Would you agree? That pretty much matches my expectations. I think I would predict that something written in simple English would be flatter in the first 1000 words and then drop off faster than normal beyond that. Does that make sense?--Brett 00:45, 24 February 2008 (UTC)Reply

Yes, the comparison of Russian and Simple English Wikipedia corpora in the paper [1] (Fig. 3, page 10) shows that Simple English has the more steep sloping curve than Russian. -- AKA MBG 18:11, 14 August 2008 (UTC)Reply

Since this page was compiled in 2008, is it possible to run a count for the wiki at the current time ? I think it might be interesting to see if the numbers or rankings have changed in the last 2 years. 152.131.9.69 (talk) 22:21, 29 November 2010 (UTC)Reply

Sorry, but now I am working with the machine-readable dictonary build upon Wiktionary, welcome to test it: wiwordik. -- AKA MBG (talk) 18:44, 20 February 2011 (UTC)Reply

Hi all. I've noticed that there are capitilisation errors on this page: Because it says specifically not to change the page, I have not corrected these errors. jont76

Remark: there is no link for words which appeared due to the imperfection of current version of wiki-to-text-parser, e.g. "tr", "jpg", "ref", etc. These words are part of the HTML code same for bgcolor and some others (only strange that it is "ref" and not "href") 2A02:1812:D29:7200:E40A:FB6A:9BBC:399 (talk) 12:45, 6 July 2020 (UTC)Reply