Distribution of Word Frequencies

Abstract

THE purpose of this communication is to explain, in terms of the theory of information, the implications of the Zipf distribution of word frequencies1. The distribution is formally identical with the Pareto income and Willis taxonomic distributions, but the present discussion is restricted to word frequencies. The discussion resembles that of Mandelbrot2 but is simpler. The discussion by Parker-Rhodes and Joyce3 also resembles Mandelbrot's, but is fallacious.

Access options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

References

  1. 1

    Zipf, G. K., “The Psychobiology of Language” (Boston, 1935). “Human Behaviour and the Principle of Least Effort” (Cambridge, Mass., 1949).

  2. 2

    Mandelbrot, B., in “Communication Theory”, 486, edit. by Willis Jackson (London, 1953); “Information Theory: Third London Symposium”, 135 (edit. by E. C. Cherry, London, 1956).

  3. 3

    Parker-Rhodes, A. F., and Joyce, T., Nature, 178, 1308 (1956).

  4. 4

    Corbet, A. S., Fisher, R. A., and Williams, C. B., J. Anim. Ecol., 12, 42 (1943). Good, I. J., Biometrika, 40, 237 (1953).

  5. 5

    Shannon, C. E., Bell System Tech. J., 27, 379 and 623 (1948). Wiener, N., “Cybernetics”, 75 (New York and Paris, 1948). Good, I. J., “Probability and the Weighing of Evidence”, 75 (London and New York, 1950); Proc. Inst. Elect. Eng., C, 103, 200 (1956).

  6. 6

    Herdan, G., “Language as Choice and Chance” (Groningen, 1956).

Download references

Author information

Rights and permissions

Reprints and Permissions

About this article

Cite this article

GOOD, I. Distribution of Word Frequencies. Nature 179, 595 (1957) doi:10.1038/179595a0

Download citation

Further reading

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.