Language Evolution and Computation Bibliography

Our site (www.isrl.uiuc.edu/amag/langev) retired, please use https://langev.com instead.
Kosmas Kosmidis
2009
Journal of Statistical Mechanics: Theory and Experiment, pages P12008, 2009
Since language is tied to cognition, we expect the linguistic structures to reflect patterns we encounter in nature and analyzed by physics. Within this realm we investigate the process of protolanguage acquisition, using analytical and tractable methods developed within physics. ...MORE ⇓
Since language is tied to cognition, we expect the linguistic structures to reflect patterns we encounter in nature and analyzed by physics. Within this realm we investigate the process of protolanguage acquisition, using analytical and tractable methods developed within physics. A protolanguage is a mapping between sounds and objects (or concepts) of the perceived world. This mapping is represented by a matrix and the linguistic interaction among individuals is described by a random matrix model. There are two essential parameters in our approach. The strength of the linguistic interaction $\beta$, which following Chomsky's tradition, we consider as a genetically determined ability, and the number $N$ of employed sounds (the lexicon size). Our model of linguistic interaction is analytically studied using methods of statistical physics and simulated by Monte Carlo techniques. The analysis reveals an intricate relationship between the innate propensity for language acquisition $\beta$ and the lexicon size $N$, $N \sim \exp(\beta)$. Thus a small increase of the genetically determined $\beta$ may lead to an incredible lexical explosion. Our approximate scheme offers an explanation for the biological affinity of different species and their simultaneous linguistic disparity.
2007
Physica A: Statistical Mechanics and its Applications 379(2):665-671, 2007
We examine the evolution of the vocabulary of a group of individuals (linguistic agents) on a scale-free network, using Monte Carlo simulations and assumptions from evolutionary game theory. It is known that when the agents are arranged in a two-dimensional lattice structure and ...MORE ⇓
We examine the evolution of the vocabulary of a group of individuals (linguistic agents) on a scale-free network, using Monte Carlo simulations and assumptions from evolutionary game theory. It is known that when the agents are arranged in a two-dimensional lattice structure and interact by diffusion and encounter, then their final vocabulary size is the maximum possible. Knowing all available words is essential in order to increase the probability to 'survive' by effective reproduction. On scale-free networks we find a different result. It is not necessary to learn the entire vocabulary available. Survival chances are increased by using the vocabulary of the 'hubs' (nodes with high degree). The existence of the 'hubs' in a scale-free network is the source of an additional important fitness generating mechanism. (C) 2007 Elsevier B.V. All rights reserved.
2006
Physica A: Statistical Mechanics and its Applications 366:495-502, 2006
We use the formulation of equilibrium statistical mechanics in order to study some important characteristics of language. Using a simple expression for the Hamiltonian of a language system, which is directly implied by the Zipf law, we are able to explain several characteristic ...MORE ⇓
We use the formulation of equilibrium statistical mechanics in order to study some important characteristics of language. Using a simple expression for the Hamiltonian of a language system, which is directly implied by the Zipf law, we are able to explain several characteristic features of human language that seem completely unrelated, such as the universality of the Zipf exponent, the vocabulary size of children, the reduced communication abilities of people suffering from schizophrenia, etc. While several explanations are necessarily only qualitative at this stage, we have, nevertheless, been able to derive a formula for the vocabulary size of children as a function of age, which agrees rather well with experimental data.
Physica A: Statistical Mechanics and its Applications 370(2):808-816, 2006
We use the detrended fluctuation analysis (DFA) and the Grassberger-Proccacia analysis (GP) methods in order to study language characteristics. Despite that we construct our signals using only word lengths or word frequencies, excluding in this way huge amount of information from ...MORE ⇓
We use the detrended fluctuation analysis (DFA) and the Grassberger-Proccacia analysis (GP) methods in order to study language characteristics. Despite that we construct our signals using only word lengths or word frequencies, excluding in this way huge amount of information from language, the application of GP analysis indicates that linguistic signals may be considered as the manifestation of a complex system of high dimensionality, different from random signals or systems of low dimensionality such as the Earth climate. The DFA method is additionally able to distinguish a natural language signal from a computer code signal. This last result may be useful in the field of cryptography.
2005
Physica A: Statistical Mechanics and its Applications 353:595-612, 2005
We use Monte Carlo simulations and assumptions from evolutionary game theory in order to study the evolution of words and the population dynamics of a system made of two interacting species which initially speak two different languages. The species are characterized by their ...MORE ⇓
We use Monte Carlo simulations and assumptions from evolutionary game theory in order to study the evolution of words and the population dynamics of a system made of two interacting species which initially speak two different languages. The species are characterized by their identity, vocabulary, and have different initial fitness, i.e. reproduction capability. We investigate how different initial fitness affects the vocabulary of the species or the population dynamics by leading to a permanent populational advantage. We further find that the spatial distributions of the species may cause the system to exhibit pattern formation or segregation. We show that an initial fitness advantage, even though very quickly balanced, leads to better spatial arrangement and enhances survival probabilities of the species. In most cases the system will arrive at a final state where both languages coexist. However, in cases where one species greatly outnumbers the other in population and fitness, then only one species survives with its 'final' language having a slightly richer vocabulary than its initial language. Thus, our results offer an explanation for the existence and origin of synonyms in spoken languages.