Nature Publishing Group, publisher of Nature, and other science journals and reference works
Nature
my account e-alerts subscribe register
   
Monday 06 July 2009
Journal Home
Current Issue
AOP
Archive
Download PDF
References
Export citation
Export references
Send to a friend
More articles like this

Letters to Nature
Nature 179, 595 (16 March 1957); doi:10.1038/179595a0

Distribution of Word Frequencies

I. J. GOOD

25 Scott House, Princess Elizabeth Way, Cheltenham.

THE purpose of this communication is to explain, in terms of the theory of information, the implications of the Zipf distribution of word frequencies1. The distribution is formally identical with the Pareto income and Willis taxonomic distributions, but the present discussion is restricted to word frequencies. The discussion resembles that of Mandelbrot2 but is simpler. The discussion by Parker-Rhodes and Joyce3 also resembles Mandelbrot's, but is fallacious.

  1. Zipf, G. K. , "The Psychobiology of Language" (Boston, 1935). "Human Behaviour and the Principle of Least Effort" (Cambridge, Mass., 1949).
  2. Mandelbrot, B. , in "Communication Theory", 486, edit. by Willis Jackson (London, 1953); "Information Theory: Third London Symposium", 135 (edit. by E. C. Cherry, London, 1956).
  3. Parker-Rhodes, A. F. , and Joyce, T. , Nature, 178, 1308 (1956). | Article |
  4. Corbet, A. S. , Fisher, R. A. , and Williams, C. B. , J. Anim. Ecol., 12, 42 (1943). Good, I. J. , Biometrika, 40, 237 (1953).
  5. Shannon, C. E. , Bell System Tech. J., 27, 379 and 623 (1948). Wiener, N. , "Cybernetics", 75 (New York and Paris, 1948). Good, I. J. , "Probability and the Weighing of Evidence", 75 (London and New York, 1950); Proc. Inst. Elect. Eng., C, 103, 200 (1956). | ISI |
  6. Herdan, G. , "Language as Choice and Chance" (Groningen, 1956).



© 1957 Nature Publishing Group
Privacy Policy