USE OF THREE MODELS IN ANALYSIS OF DATA ON THE COUNTS OF WORDS IN A DOCUMENT
There is a class of processes, in which some quantity is distributed among the individuals of a society according to how much each individual already has. It is characterized by the power-law distributions. In this context, we derive a stochastic process model for the counts of words in a document from Simon’s [3] assumption II, solve Simon’s [3] and de Solla Price’s [2] models in closed form, and fit our model, four variants of Simon’s [3] model and two variants of de Solla Price’s [2] model into the counts of words in two poems: Commitment, and Paradise Lost, Book I. In contrast with de Solla Price’s [2] model, our model and Simon’s [3] model gave very similar results for both poems.
stochastic process model, Simon’s model, de Solla Price’s model, closed-form solution.