Jump to content

User talk:AKA MBG/English Simple Wikipedia 20080214 freq wordlist

Page contents not supported in other languages.
Add topic
From Wiktionary
Latest comment: 5 years ago by 2A02:1812:D29:7200:E40A:FB6A:9BBC:399

Thanks for that! It's a very interesting list. Just looking quickly at the numbers, it strikes me as an unusually flat distribution. Would you agree? That pretty much matches my expectations. I think I would predict that something written in simple English would be flatter in the first 1000 words and then drop off faster than normal beyond that. Does that make sense?--Brett 00:45, 24 February 2008 (UTC)Reply

Yes, the comparison of Russian and Simple English Wikipedia corpora in the paper (Fig. 3, page 10) shows that Simple English has the more steep sloping curve than Russian. -- AKA MBG 18:11, 14 August 2008 (UTC)Reply

Since this page was compiled in 2008, is it possible to run a count for the wiki at the current time ? I think it might be interesting to see if the numbers or rankings have changed in the last 2 years. 152.131.9.69 (talk) 22:21, 29 November 2010 (UTC)Reply

Sorry, but now I am working with the machine-readable dictonary build upon Wiktionary, welcome to test it: wiwordik. -- AKA MBG (talk) 18:44, 20 February 2011 (UTC)Reply

Hi all. I've noticed that there are capitilisation errors on this page: Because it says specifically not to change the page, I have not corrected these errors. jont76

Remark: there is no link for words which appeared due to the imperfection of current version of wiki-to-text-parser, e.g. "tr", "jpg", "ref", etc. These words are part of the HTML code same for bgcolor and some others (only strange that it is "ref" and not "href") 2A02:1812:D29:7200:E40A:FB6A:9BBC:399 (talk) 12:45, 6 July 2020 (UTC)Reply