Language Evolution and Computation Bibliography

Our site (www.isrl.uiuc.edu/amag/langev) retired, please use https://langev.com instead.
Vittorio Loreto
2015
PloS one 10:489-509, 2015
Language universals have long been attributed to an innate Universal Grammar. An alternative explanation states that linguistic universals emerged independently in every language in response to shared cognitive or perceptual biases. A computational model has recently shown how ...MORE ⇓
Language universals have long been attributed to an innate Universal Grammar. An alternative explanation states that linguistic universals emerged independently in every language in response to shared cognitive or perceptual biases. A computational model has recently shown how this could be the case, focusing on the paradigmatic example of the universal properties of colour naming patterns, and producing results in quantitative agreement with the experimental data. Here we investigate the role of an individual perceptual bias in the framework of the model. We study how, and to what extent, the structure of the bias influences the corresponding linguistic universal patterns. We show that the cultural history of a group of speakers introduces population-specific constraints that act against the pressure for uniformity arising from the individual bias, and we clarify the interplay between these two forces.
Journal of Memory and Language 84:205-223, 2015
Several recent theories have suggested that an increase in the number of non-native speakers in a language can lead to changes in morphological rules. We examine this experimentally by contrasting the performance of native and non-native English speakers in a simple Wug-task, ...MORE ⇓
Several recent theories have suggested that an increase in the number of non-native speakers in a language can lead to changes in morphological rules. We examine this experimentally by contrasting the performance of native and non-native English speakers in a simple Wug-task, showing that non-native speakers are significantly more likely to provide non -ed (i.e., irregular) past-tense forms for novel verbs than native speakers. Both groups are sensitive to sound similarities between new words and existing words (i.e., are more likely to provide irregular forms for novel words which sound similar to existing irregulars). Among both natives and non-natives, irregularizations are non-random; that is, rather than presenting as truly irregular inflectional strategies, they follow identifiable sub-rules present in the highly frequent set of irregular English verbs. Our results shed new light on how native and non-native learners can affect language structure.
2014
Physics of life reviews 11 2:311-2, 2014
The debate on language origin and evolution has benefited from a largely interdisciplinary effort, involving linguists, anthropologists, sociologist as well as physicists, mathematicians and computer scientists. A fundamental question is whether a shared communication system can ...MORE ⇓
The debate on language origin and evolution has benefited from a largely interdisciplinary effort, involving linguists, anthropologists, sociologist as well as physicists, mathematicians and computer scientists. A fundamental question is whether a shared communication system can emerge from repeated interactions among individuals, not relying on any a priori or innate language-specific structure. Modeling, and in particular language games, proved to be a powerful tool to gain insight on this beautiful mystery. In particular, fruitful investigations has been done concerning the possibility for a population of individuals to exploit local communication acts to build up a shared vocabulary [1] or a system of linguistic categories reproducing the universality and the hierarchies observed in anthropological data [2–4]. A particular effort has been also devoted to the origin of the complex organization of syntax in hierarchical structures, one of the core design features of human language. As Gong and coauthors highlighted in this review [5], a combinatorial and compositional structure can emerge out of a holistic language due to communication purposes, and explaining how this could possibly happen still represents an intriguing challenge [6–11]. It is important to remark how theoretical investigations should be, and are more and more, paralleled by a growing attention to a careful comparison with data on language formation. Different kind of data can be exploited to shed light on different questions. Diachronic and historical data related to migration patterns have been used for instance to study the timescales of language evolution. Anthropological studies on pre-industrialized populations [12,13] have been crucial in the understanding language universals. At the same times, experiments in cognitive science helped in shading light on the mechanisms emerging when individuals are called to perform communicative tasks [14]. It is worth mentioning in this perspective how advances in information and communication technologies allow nowadays the realization of focused experiments also in the framework of the emergence of linguistic structures exploiting the huge basin of web users. In particular, a general trend is emerging for the adoption of web-games as a very interesting laboratory to run experiments in the social-sciences and whenever the contribution of human beings is crucially required for research purposes. This is opening tremendous opportunities to monitor the emergence of specific linguistic features and their co-evolution with the structure of our conceptual spaces.
PloS one 9:839-862, 2014
Human languages are rule governed, but almost invariably these rules have exceptions in the form of irregularities. Since rules in language are efficient and productive, the persistence of irregularity is an anomaly. How does irregularity linger in the face of internal ...MORE ⇓
Human languages are rule governed, but almost invariably these rules have exceptions in the form of irregularities. Since rules in language are efficient and productive, the persistence of irregularity is an anomaly. How does irregularity linger in the face of internal (endogenous) and external (exogenous) pressures to conform to a rule? Here we address this problem by taking a detailed look at simple past tense verbs in the Corpus of Historical American English. The data show that the language is open, with many new verbs entering. At the same time, existing verbs might tend to regularize or irregularize as a consequence of internal dynamics, but overall, the amount of irregularity sustained by the language stays roughly constant over time. Despite continuous vocabulary growth, and presumably, an attendant increase in expressive power, there is no corresponding growth in irregularity. We analyze the set of irregulars, showing they may adhere to a set of minority rules, allowing for increased stability of irregularity over time. These findings contribute to the debate on how language systems become rule governed, and how and why they sustain exceptions to rules, providing insight into the interplay between the emergence and maintenance of rules and exceptions in language.
2013
Emergence of fast agreement in an overhearing population: The case of the naming gamePDF
Europhysics Letters 101(6):68004, 2013
The naming game (NG) describes the agreement dynamics of a population of N agents interacting locally in pairs leading to the emergence of a shared vocabulary. This model has its relevance in the novel fields of semiotic dynamics and specifically to opinion formation and language ...MORE ⇓
The naming game (NG) describes the agreement dynamics of a population of N agents interacting locally in pairs leading to the emergence of a shared vocabulary. This model has its relevance in the novel fields of semiotic dynamics and specifically to opinion formation and language evolution. The application of this model ranges from wireless sensor networks as spreading algorithms, leader election algorithms to user-based social tagging systems. In this paper, we introduce the concept of overhearing (i.e., at every time step of the game, a random set of Nδ individuals are chosen from the population who overhear the transmitted word from the speaker and accordingly reshape their inventories). When δ = 0 one recovers the behavior of the original NG. As one increases δ, the population of agents reaches a faster agreement with a significantly low-memory requirement. The convergence time to reach global consensus scales as logN as δ approaches 1.
2012
Advances in Complex Systems 15(03n04):1203002, 2012
Thirty authors of different disciplines, ranging from cognitive science and linguistics to mathematics and physics, address the topic of language origin and evolution. Language dynamics is investigated through an interdisciplinary effort, involving field and synthetic ...MORE ⇓
Thirty authors of different disciplines, ranging from cognitive science and linguistics to mathematics and physics, address the topic of language origin and evolution. Language dynamics is investigated through an interdisciplinary effort, involving field and synthetic experiments, modelling and comparison of the theoretical predictions with empirical data. The result consists in new insights that significantly contribute to the ongoing debate on the origin and the evolution of language. In this Topical Issue the state of the art of this novel and fertile approach is reported by major experts of the field.
Advances in Complex Systems 15(03n04):1250054, 2012
We investigate the directed and weighted complex network of free word associations in which players write a word in response to another word given as input. We analyze in details two large datasets resulting from two very different experiments: On the one hand the massive ...MORE ⇓
We investigate the directed and weighted complex network of free word associations in which players write a word in response to another word given as input. We analyze in details two large datasets resulting from two very different experiments: On the one hand the massive multiplayer web-based Word Association Game known as Human Brain Cloud, and on the other hand the South Florida Free Association Norms experiment. In both cases, the networks of associations exhibit quite robust properties like the small world property, a slight assortativity and a strong asymmetry between in-degree and out-degree distributions. A particularly interesting result concerns the existence of a characteristic scale for the word association process, arguably related to specific conceptual contexts for each word. After mapping, the Human Brain Cloud network onto the WordNet semantics network, we point out the basic cognitive mechanisms underlying word associations when they are represented as paths in an underlying semantic network. We derive in particular an expression describing the growth of the HBC graph and we highlight the existence of a characteristic scale for the word association process.
PNAS 109(18):6819--6824, 2012
Abstract One of the fundamental problems in cognitive science is how humans categorize the visible color spectrum. The empirical evidence of the existence of universal or recurrent patterns in color naming across cultures is paralleled by the observation that color names ...
Advances in Complex Systems 15(03n04):1150016, 2012
It is widely known that color names across the world's languages tend to be organized into a neat hierarchy with a small set of 'basic names' featuring in a comparatively fixed order across linguistic societies. However, to date, the basic names have only been defined through a ...MORE ⇓
It is widely known that color names across the world's languages tend to be organized into a neat hierarchy with a small set of 'basic names' featuring in a comparatively fixed order across linguistic societies. However, to date, the basic names have only been defined through a set of linguistic principles. There is no statistical definition that quantitatively separates the basic names from the rest of the color words across languages. Here we present a rigorous statistical analysis of the World Color Survey database hosting color word information from 110 non-industrialized languages. The central result is that those names for which a population of individuals show a larger overall agreement across languages turn out to be the basic ones exactly reproducing the color name hierarchy and, thereby, providing, for the first time, an empirical definition of the basic color names.
Artificial Life 18(1):107--121, 2012
This article adopts the category game model, which simulates the origins and evolution of linguistic categories in a group of artificial agents, to evaluate the effect of social structure on linguistic categorization. Based on the simulation results in a number of typical ...MORE ⇓
This article adopts the category game model, which simulates the origins and evolution of linguistic categories in a group of artificial agents, to evaluate the effect of social structure on linguistic categorization. Based on the simulation results in a number of typical networks, we examine the isolating and collective effects of some structural features, including average degree, shortcuts, and level of centrality, on the categorization process. This study extends the previous simulations mainly on lexical evolution, and illustrates a general framework to systematically explore the effect of social structure on language evolution.
Naming a Structured World: A Cultural Route to Duality of PatterningPDF
PLoS ONE 7(6):e37744, 2012
The lexicons of human languages organize their units at two distinct levels. At a first combinatorial level, meaningless forms (typically referred to as phonemes) are combined into meaningful units (typically referred to as morphemes). Thanks to this, many ...
2011
Journal of Statistical Mechanics: Theory and Experiment, 2011
Language dynamics is a rapidly growing field that focuses on all processes related to the emergence, evolution, change and extinction of languages. Recently, the study of self-organization and evolution of language and meaning has led to the idea that a community of language ...MORE ⇓
Language dynamics is a rapidly growing field that focuses on all processes related to the emergence, evolution, change and extinction of languages. Recently, the study of self-organization and evolution of language and meaning has led to the idea that a community of language users can be seen as a complex dynamical system, which collectively solves the problem of developing a shared communication framework through the back-and-forth signaling between individuals.

We shall review some of the progress made in the past few years and highlight potential future directions of research in this area. In particular, the emergence of a common lexicon and of a shared set of linguistic categories will be discussed, as examples corresponding to the early stages of a language. The extent to which synthetic modeling is nowadays contributing to the ongoing debate in cognitive science will be pointed out. In addition, the burst of growth of the web is providing new experimental frameworks. It makes available a huge amount of resources, both as novel tools and data to be analyzed, allowing quantitative and large-scale analysis of the processes underlying the emergence of a collective information and language dynamics.

Physics of Life Reviews 8:371--372, 2011
Three ingredients play a central role in the study of origins and evolution of language and meaning: biological constraints, knowledge transmission between successive generations (vertical transmission) and achievement of a common knowledge within a single ...
PLoS ONE 6(2):e16677, 2011
Human languages evolve continuously, and a puzzling problem is how to reconcile the apparent robustness of most of the deep linguistic structures we use with the evidence that they undergo possibly slow, yet ceaseless, changes. Is the state in which we observe languages today ...MORE ⇓
Human languages evolve continuously, and a puzzling problem is how to reconcile the apparent robustness of most of the deep linguistic structures we use with the evidence that they undergo possibly slow, yet ceaseless, changes. Is the state in which we observe languages today closer to what would be a dynamical attractor with statistically stationary properties or rather closer to a non-steady state slowly evolving in time? Here we address this question in the framework of the emergence of shared linguistic categories in a population of individuals interacting through language games. The observed emerging asymptotic categorization, which has been previously tested - with success - against experimental data from human languages, corresponds to a metastable state where global shifts are always possible but progressively more unlikely and the response properties depend on the age of the system. This aging mechanism exhibits striking quantitative analogies to what is observed in the statistical mechanics of glassy systems. We argue that this can be a general scenario in language dynamics where shared linguistic conventions would not emerge as attractors, but rather as metastable states.
PLoS ONE 6(6):e20109, 2011
Historical linguistics aims at inferring the most likely language phylogenetic tree starting from information concerning the evolutionary relatedness of languages. The available information are typically lists of homologous (lexical, phonological, syntactic) features or ...MORE ⇓
Historical linguistics aims at inferring the most likely language phylogenetic tree starting from information concerning the evolutionary relatedness of languages. The available information are typically lists of homologous (lexical, phonological, syntactic) features or characters for many different languages: a set of parallel corpora whose compilation represents a paramount achievement in linguistics.

From this perspective the reconstruction of language trees is an example of inverse problems: starting from present, incomplete and often noisy, information, one aims at inferring the most likely past evolutionary history. A fundamental issue in inverse problems is the evaluation of the inference made. A standard way of dealing with this question is to generate data with artificial models in order to have full access to the evolutionary process one is going to infer. This procedure presents an intrinsic limitation: when dealing with real data sets, one typically does not know which model of evolution is the most suitable for them. A possible way out is to compare algorithmic inference with expert classifications. This is the point of view we take here by conducting a thorough survey of the accuracy of reconstruction methods as compared with the Ethnologue expert classifications. We focus in particular on state-of-the-art distance-based methods for phylogeny reconstruction using worldwide linguistic databases.

In order to assess the accuracy of the inferred trees we introduce and characterize two generalizations of standard definitions of distances between trees. Based on these scores we quantify the relative performances of the distance-based algorithms considered. Further we quantify how the completeness and the coverage of the available databases affect the accuracy of the reconstruction. Finally we draw some conclusions about where the accuracy of the reconstructions in historical linguistics stands and about the leading directions to improve it.

Journal of Computational Science 2(4):316--323, 2011
Article history: Received 21 December 2010 Received in revised form 20 September 2011 Accepted 3 October 2011 Available online xxx Keywords: Category game Metastable states No-rejection algorithms Agent-based simulation abstract
2010
PNAS 107(6):2403-2407, 2010
The empirical evidence that human color categorization exhibits some universal patterns beyond superficial discrepancies across different cultures is a major breakthrough in cognitive science. As observed in the World Color Survey (WCS), indeed, any two groups of individuals ...MORE ⇓
The empirical evidence that human color categorization exhibits some universal patterns beyond superficial discrepancies across different cultures is a major breakthrough in cognitive science. As observed in the World Color Survey (WCS), indeed, any two groups of individuals develop quite different categorization patterns, but some universal properties can be identified by a statistical analysis over a large number of populations. Here, we reproduce the WCS in a numerical model in which different populations develop independently their own categorization systems by playing elementary language games. We find that a simple perceptual constraint shared by all humans, namely the human Just Noticeable Difference (JND), is sufficient to trigger the emergence of universal patterns that unconstrained cultural interaction fails to produce. We test the results of our experiment against real data by performing the same statistical analysis proposed to quantify the universal tendencies shown in the WCS [Kay P & Regier T. (2003) Proc. Natl. Acad. Sci. USA 100: 9085-9089], and obtain an excellent quantitative agreement. This work confirms that synthetic modeling has nowadays reached the maturity to contribute significantly to the ongoing debate in cognitive science.
Evolution of Communication and Language in Embodied Agents, pages 263-281, 2010
In this chapter we explore several language games of increasing complexity. We first consider the so-called Naming Game, possibly the simplest example of the complex processes leading progressively to the establishment of human-like languages. In this framework, a globally shared ...MORE ⇓
In this chapter we explore several language games of increasing complexity. We first consider the so-called Naming Game, possibly the simplest example of the complex processes leading progressively to the establishment of human-like languages. In this framework, a globally shared vocabulary emerges as a result of local adjustments of individual word-meaning association. The emergence of a common vocabulary only represents a first stage while it is interesting to investigate the emergence of higher forms of agreement, e.g., compositionality, categories, syntactic or grammatical structures. As an example in this direction we consider the so-called Category Game. Here one focuses on the process by which a population of individuals manages to categorize a single perceptually continuous channel. The problem of the emergence of a discrete shared set of categories out of a continuous perceptual channel is a notoriously difficult problem relevant for color categorization, vowels formation, etc. The central result here is the emergence of a hierarchical category structure made of two distinct levels: a basic layer, responsible for fine discrimination of the environment, and a shared linguistic layer that groups together perceptions to guarantee communicative success.
Evolution of Communication and Language in Embodied Agents, pages 67-81, 2010
Statistical physics has proven to be a very fruitful framework to describe phenomena outside the realm of traditional physics. In social phenomena, the basic constituents are not particles but humans and every individual interacts with a limited number of peers, usually ...MORE ⇓
Statistical physics has proven to be a very fruitful framework to describe phenomena outside the realm of traditional physics. In social phenomena, the basic constituents are not particles but humans and every individual interacts with a limited number of peers, usually negligible compared to the total number of people in the system. In spite of that, human societies are characterized by stunning global regularities that naturally call for a statistical physics approach to social behavior, i.e., the attempt to understand regularities at large scale as collective effects of the interaction among single individuals, considered as relatively simple entities. This is the paradigm of Complex Systems: an assembly of many interacting (and simple) units whose collective behavior is not trivially deducible from the knowledge of the rules governing their mutual interactions. In this chapter we review the main theoretical concepts and tools that physics can borrow to socially-motivated problems. Despite their apparent diversity, most research lines in social dynamics are actually closely connected from the point of view of both the methodologies employed and, more importantly, of the general phenomenological questions, e.g., what are the fundamental interaction mechanisms leading to the emergence of consensus on an issue, a shared culture, a common language or a collective motion?
Evolution of Communication and Language in Embodied Agents, pages 283-288, 2010
This chapter draws some conclusions from the computational and mathematical models of emergent symbolic communication systems reported in the earlier chapters. It also strongly pleads for a stronger interaction between linguistics and other human sciences studying similar issues.
2009
Reviews of Modern Physics 81(2):591-646, 2009
Statistical physics has proven to be a fruitful framework to describe phenomena outside the realm of traditional physics. Recent years have witnessed an attempt by physicists to study collective phenomena emerging from the interactions of individuals as elementary units in social ...MORE ⇓
Statistical physics has proven to be a fruitful framework to describe phenomena outside the realm of traditional physics. Recent years have witnessed an attempt by physicists to study collective phenomena emerging from the interactions of individuals as elementary units in social structures. A wide list of topics are reviewed ranging from opinion and cultural and language dynamics to crowd behavior, hierarchy formation, human dynamics, and social spreading. The connections between these problems and other, more traditional, topics of statistical physics are highlighted. Comparison of model results with empirical data from social systems are also emphasized.
Consensus and ordering in language dynamicsPDF
European Physical Journal B 71(4):557-564, 2009
We consider two social consensus models, the AB-model and the Naming Game restricted to two conventions, which describe a population of interacting agents that can be in either of two equivalent states (A or B) or in a third mixed (AB) state. Proposed in the context of language ...MORE ⇓
We consider two social consensus models, the AB-model and the Naming Game restricted to two conventions, which describe a population of interacting agents that can be in either of two equivalent states (A or B) or in a third mixed (AB) state. Proposed in the context of language competition and emergence, the AB state was associated with bilingualism and synonymy respectively. We show that the two models are equivalent in the mean field approximation, though the differences at the microscopic level have non-trivial consequences. To point them out, we investigate an extension of these dynamics in which confidence/trust is considered, focusing on the case of an underlying fully connected graph, and we show that the consensus-polarization phase transition taking place in the Naming Game is not observed in the AB model. We then consider the interface motion in regular lattices. Qualitatively, both models show the same behavior: a diffusive interface motion in a one-dimensional lattice, and a curvature driven dynamics with diffusing stripe-like metastable states in a two-dimensional one. However, in comparison to the Naming Game, the AB-model dynamics is shown to slow down the diffusion of such configurations.
PNAS 106(26):10511-10515, 2009
The enormous increase of popularity and use of the worldwide web has led in the recent years to important changes in the ways people communicate. An interesting example of this fact is provided by the now very popular social annotation systems, through which users annotate ...MORE ⇓
The enormous increase of popularity and use of the worldwide web has led in the recent years to important changes in the ways people communicate. An interesting example of this fact is provided by the now very popular social annotation systems, through which users annotate resources (such as web pages or digital photographs) with keywords known as ``tags.'' Understanding the rich emergent structures resulting from the uncoordinated actions of users calls for an interdisciplinary effort. In particular concepts borrowed from statistical physics, such as random walks (RWs), and complex networks theory, can effectively contribute to the mathematical modeling of social annotation systems. Here, we show that the process of social annotation can be seen as a collective but uncoordinated exploration of an underlying semantic space, pictured as a graph, through a series of RWs. This modeling framework reproduces several aspects, thus far unexplained, of social annotation, among which are the peculiar growth of the size of the vocabulary used by the community and its complex network structure that represents an externalization of semantic structures grounded in cognition and that are typically hard to access.
2008
International Journal of Modern Physics C 19(5):785-812, 2008
Language emergence and evolution have recently gained growing attention through multi-agent models and mathematical frameworks to study their behavior. Here we investigate further the Naming Game, a model able to account for the emergence of a shared vocabulary of form-meaning ...MORE ⇓
Language emergence and evolution have recently gained growing attention through multi-agent models and mathematical frameworks to study their behavior. Here we investigate further the Naming Game, a model able to account for the emergence of a shared vocabulary of form-meaning associations through social/cultural learning. Due to the simplicity of both the structure of the agents and their interaction rules, the dynamics of this model can be analyzed in great detail using numerical simulations and analytical arguments. This paper first reviews some existing results and then presents a new overall understanding.
Proceedings of 2008 IEEE Congress on Evolutionary Computation, pages 1686-1693, 2008
The language game approach is widely adopted to study conventionalization of linguistic knowledge. Most of contemporary models concentrate on the dynamics of language games in random or predefined social structures, but neglect the role of communicative constraints. This paper ...MORE ⇓
The language game approach is widely adopted to study conventionalization of linguistic knowledge. Most of contemporary models concentrate on the dynamics of language games in random or predefined social structures, but neglect the role of communicative constraints. This paper adopts one form of language games, the category game, to discuss whether some simple distance-related communicative constraint may affect the conventionalization of linguistic categories. By comparing the simulation results with those based on another form of language games, the naming game, we point out some essential differences between these two games which cause their distinct performances under the same communicative constraint. This study fills the gap between the dynamics of language games in random structures and that in complex networks, and suggests that internal properties of language games may reversely influence communicative constraints and social structures.
Biological Theory: Integrating Development, Evolution, and Cognition 3(2):154-163, 2008
The language game approach has recently been adopted to explore the conventionalization of linguistic knowledge in a social environment. Most contemporary studies focus on the dynamics of language games in random or predefined social networks, but neglect the reverse roles of ...MORE ⇓
The language game approach has recently been adopted to explore the conventionalization of linguistic knowledge in a social environment. Most contemporary studies focus on the dynamics of language games in random or predefined social networks, but neglect the reverse roles of communicative constraints in language evolution and social structures. This article, based on two forms of language games (the naming game and the category game), examines whether a simple, distance-based communicative constraint can affect the conventionalization of linguistic knowledge. The study bridges the gap between random networks and complex social structures, and illustrates that the internal properties of language games can influence the effects of communicative constraints and social structures.
PNAS 105(23):7936-7940, 2008
Categories provide a coarse-grained description of the world. A fundamental question is whether categories simply mirror an underlying structure of nature or instead come from the complex interactions of human beings among themselves and with the environment. Here, we address ...MORE ⇓
Categories provide a coarse-grained description of the world. A fundamental question is whether categories simply mirror an underlying structure of nature or instead come from the complex interactions of human beings among themselves and with the environment. Here, we address this question by modeling a population of individuals who co-evolve their own system of symbols and meanings by playing elementary language games. The central result is the emergence of a hierarchical category structure made of two distinct levels: a basic layer, responsible for fine discrimination of the environment, and a shared linguistic layer that groups together perceptions to guarantee communicative success. Remarkably, the number of linguistic categories turns out to be finite and small, as observed in natural languages.
2007
Chaos 17(2):026111, 2007
We review the behavior of a recently introduced model of agreement dynamics, called the ``Naming Game.'' This model describes the self-organized emergence of linguistic conventions and the establishment of simple communication systems in a population of agents with pairwise local ...MORE ⇓
We review the behavior of a recently introduced model of agreement dynamics, called the ``Naming Game.'' This model describes the self-organized emergence of linguistic conventions and the establishment of simple communication systems in a population of agents with pairwise local interactions. The mechanisms of convergence towards agreement strongly depend on the network of possible interactions between the agents. In particular, the mean-field case in which all agents communicate with all the others is not efficient, since a large temporary memory is requested for the agents. On the other hand, regular lattice topologies lead to a fast local convergence but to a slow global dynamics similar to coarsening phenomena. The embedding of the agents in a small-world network represents an interesting tradeoff: a local consensus is easily reached, while the long-range links allow to bypass coarsening-like convergence. We also consider alternative adaptive strategies which can lead to faster global convergence.(c) 2007 American Institute of Physics.
PNAS 104(5):1461-1464, 2007
Collaborative tagging has been quickly gaining ground because of its ability to recruit the activity of web users into effectively organizing and sharing vast amounts of information. Here we collect data from a popular system and investigate the statistical properties of tag ...MORE ⇓
Collaborative tagging has been quickly gaining ground because of its ability to recruit the activity of web users into effectively organizing and sharing vast amounts of information. Here we collect data from a popular system and investigate the statistical properties of tag cooccurrence. We introduce a stochastic model of user behavior embodying two main aspects of collaborative tagging: (i) a frequency-bias mechanism related to the idea that users are exposed to each other's tagging activity; (ii) a notion of memory, or aging of resources, in the form of a heavy-tailed access to the past state of the system. Remarkably, our simple modeling is able to account quantitatively for the observed experimental features with a surprisingly high accuracy. This points in the direction of a universal behavior of users who, despite the complexity of their own cognitive processes and the uncoordinated and selfish nature of their tagging activity, appear to follow simple activity patterns.
Nature Physics 3:758-760, 2007
Our social behaviour has evolved primarily through contact with a limited number of other individuals. Yet as a species we exhibit uniformities on a global scale. This kind of emergent behaviour is familiar territory for statistical physicists.
2006
Strategies for fast convergence in semiotic dynamicsPDF
Artificial Life X, pages 480-485, 2006
Semiotic dynamics is a novel field that studies how semiotic conventions spread and stabilize in a population of agents. This is a central issue both for theoretical and technological reasons since large system made up of communicating agents, like web communities or artificial ...MORE ⇓
Semiotic dynamics is a novel field that studies how semiotic conventions spread and stabilize in a population of agents. This is a central issue both for theoretical and technological reasons since large system made up of communicating agents, like web communities or artificial embodied agents teams, are getting widespread. In this paper we discuss a recently introduced simple multi-agent model which is able to account for the emergence of a shared vocabulary in a population of agents. In particular we introduce a new deterministic agents' playing strategy that strongly improves the performance of the game in terms of faster convergence and reduced cognitive effort for the agents.
J. Stat. Mech., 2006
What processes can explain how very large populations are able to converge on the use of a particular word or grammatical construction without global coordination? Answering this question helps to understand why new language constructs usually propagate along an S-shaped curve ...MORE ⇓
What processes can explain how very large populations are able to converge on the use of a particular word or grammatical construction without global coordination? Answering this question helps to understand why new language constructs usually propagate along an S-shaped curve with a rather sudden transition towards global agreement. It also helps to analyze and design new technologies that support or orchestrate self-organizing communication systems, such as recent social tagging systems for the web. The article introduces and studies a microscopic model of communicating autonomous agents performing language games without any central control. We show that the system undergoes a disorder/order transition, going trough a sharp symmetry breaking process to reach a shared set of conventions. Before the transition, the system builds up non-trivial scale-invariant correlations, for instance in the distribution of competing synonyms, which display a Zipf-like law. These correlations make the system ready for the transition towards shared conventions, which, observed on the time-scale of collective behaviors, becomes sharper and sharper with system size. This surprising result not only explains why human language can scale up to very large populations but also suggests ways to optimize artificial semiotic dynamics.
Bootstrapping communication in language games: strategy, topology and all thatPDF
Proceedings of the 6th International Conference on the Evolution of Language, pages 11-18, 2006
Semiotic dynamics is a fast growing field according to which language can be seen as an evolving and self-organizing system. In this paper we present a simple multi-agent framework able to account for the emergence of shared conventions in a population. Agents perform pairwise ...MORE ⇓
Semiotic dynamics is a fast growing field according to which language can be seen as an evolving and self-organizing system. In this paper we present a simple multi-agent framework able to account for the emergence of shared conventions in a population. Agents perform pairwise games and final consensus is reached without any outside control nor any global knowledge of the system. In particular we discuss how embedding the population in a non trivial interaction topology affects the behavior of the system and forces to carefully consider agents selection strategies. These results cast an interesting framework to address and study more complex issues in semiotic dynamics.
Europhysics Letters 73(6):969-975, 2006
In this paper we analyze the effect of a non-trivial topology on the dynamics of the so-called Naming Game, a recently introduced model which addresses the issue of how shared conventions emerge spontaneously in a population of agents. We consider in particular the small-world ...MORE ⇓
In this paper we analyze the effect of a non-trivial topology on the dynamics of the so-called Naming Game, a recently introduced model which addresses the issue of how shared conventions emerge spontaneously in a population of agents. We consider in particular the small-world topology and study the convergence towards the global agreement as a function of the population size $N$ as well as of the parameter $p$ which sets the rate of rewiring leading to the small-world network. As long as $p \gg 1/N$ there exists a crossover time scaling as $N/p^2$ which separates an early one-dimensional-like dynamics from a late stage mean-field-like behavior. At the beginning of the process, the local quasi one-dimensional topology induces a coarsening dynamics which allows for a minimization of the cognitive effort (memory) required to the agents. In the late stages, on the other hand, the mean-field like topology leads to a speed up of the convergence process with respect to the one-dimensional case.
Physical Review E 74:036105, 2006
The Naming Game is a model of non-equilibrium dynamics for the self-organized emergence of a linguistic convention or a communication system in a population of agents with pairwise local interactions. We present an extensive study of its dynamics on complex networks, that can be ...MORE ⇓
The Naming Game is a model of non-equilibrium dynamics for the self-organized emergence of a linguistic convention or a communication system in a population of agents with pairwise local interactions. We present an extensive study of its dynamics on complex networks, that can be considered as the most natural topological embedding for agents involved in language games and opinion dynamics. Except for some community structured networks on which metastable phases can be observed, agents playing the Naming Game always manage to reach a global consensus. This convergence is obtained after a time generically scaling with the population's size $N$ as $t\_{conv} \sim N^{1.4 \pm 0.1}$, i.e. much faster than for agents embedded on regular lattices. Moreover, the memory capacity required by the system scales only linearly with its size. Particular attention is given to heterogenous networks, in which the dynamical activity pattern of a node depends on its degree. High degree nodes have a fundamental role, but require larger memory capacity. They govern the dynamics acting as spreaders of (linguistic) conventions. The effects of other properties, such as the average degree and the clustering, are also discussed.
2005
Physical Review E 73:015102, 2005
We investigate how very large populations are able to reach a global consensus, out of local ``microscopic'' interaction rules, in the framework of a recently introduced class of models of semiotic dynamics, the so-called Naming Game. We compare in particular the convergence ...MORE ⇓
We investigate how very large populations are able to reach a global consensus, out of local ``microscopic'' interaction rules, in the framework of a recently introduced class of models of semiotic dynamics, the so-called Naming Game. We compare in particular the convergence mechanism for interacting agents embedded in a low-dimensional lattice with respect to the mean-field case. We highlight that in low-dimensions consensus is reached through a coarsening process which requires less cognitive effort of the agents, with respect to the mean-field case, but takes longer to complete. In 1-d the dynamics of the boundaries is mapped onto a truncated Markov process from which we analytically computed the diffusion coefficient. More generally we show that the convergence process requires a memory per agent scaling as N and lasts a time N^{1+2/d} in dimension d<5 (d=4 being the upper critical dimension), while in mean-field both memory and time scale as N^{3/2}, for a population of N agents. We present analytical and numerical evidences supporting this picture.