Language Evolution and Computation Bibliography

Our site (www.isrl.uiuc.edu/amag/langev) retired, please use https://langev.com instead.
Damian E Blasi
2018
Journal of Language Evolution 3(2):94-129, 2018
The increasing availability of large digital corpora of cross-linguistic data is revolutionizing many branches of linguistics. Overall, it has triggered a shift of attention from detailed questions about individual features to more global patterns amenable to rigorous, but ...MORE ⇓
The increasing availability of large digital corpora of cross-linguistic data is revolutionizing many branches of linguistics. Overall, it has triggered a shift of attention from detailed questions about individual features to more global patterns amenable to rigorous, but statistical, analyses. This engenders an approach based on successive approximations where models with simplified assumptions result in frameworks that can then be systematically refined, always keeping explicit the methodological commitments and the assumed prior knowledge. Therefore, they can resolve disputes between competing frameworks quantitatively by separating the support provided by the data from the underlying assumptions. These methods, though, often appear as a ‘black box’ to traditional practitioners. In fact, the switch to a statistical view complicates comparison of the results from these newer methods with traditional understanding, sometimes leading to misinterpretation and overly broad claims. We describe here this evolving methodological shift, attributed to the advent of big, but often incomplete and poorly curated data, emphasizing the underlying similarity of the newer quantitative to the traditional comparative methods and discussing when and to what extent the former have advantages over the latter. In this review, we cover briefly both randomization tests for detecting patterns in a largely model-independent fashion and phylolinguistic methods for a more model-based analysis of these patterns. We foresee a fruitful division of labor between the ability to computationally process large volumes of data and the trained linguistic insight identifying worthy prior commitments and interesting hypotheses in need of comparison.
2017
Nature Human Behaviour 1:723-729, 2017
Most languages of the world are taken to result from a combination of a vertical transmission process from older to younger generations of speakers or signers and (mostly) gradual changes that accumulate over time. In contrast, creole languages emerge within a few generations out ...MORE ⇓
Most languages of the world are taken to result from a combination of a vertical transmission process from older to younger generations of speakers or signers and (mostly) gradual changes that accumulate over time. In contrast, creole languages emerge within a few generations out of highly multilingual societies in situations where no common first language is available for communication (as, for instance, in plantations related to the Atlantic slave trade). Strikingly, creoles share a number of linguistic features (the ‘creole profile’), which is at odds with the striking linguistic diversity displayed by non-creole languages 1–4 . These common features have been explained as reflecting a hardwired default state of the possible grammars that can be learned by humans 1 , as straightforward solutions to cope with the pressure for efficient and successful communication 5 or as the byproduct of an impoverished transmission process 6 . Despite their differences, these proposals agree that creoles emerge from a very limited and basic communication system (a pidgin) that only later in time develops the characteristics of a natural language, potentially by innovating linguistic structure. Here we analyse 48 creole languages and 111 non-creole languages from all continents and conclude that the similarities (and differences) between creoles can be explained by genealogical and contact processes 7,8 , as with non-creole languages, with the difference that creoles have more than one language in their ancestry. While a creole profile can be detected statistically, this stems from an over-representation of Western European and West African languages in their context of emergence. Our findings call into question the existence of a pidgin stage in creole development and of creole-specific innovations. In general, given their extreme conditions of emergence, they lend support to the idea that language learning and transmission are remarkably resilient processes. There are striking similarities among creole languages. Blasi et al. show that these similarities can in fact be explained by the same processes as for non-creole languages, the difference being that creoles have more than one language in their ancestry.
2015
PNAS 112(5):1322-7, 2015
We summarize a number of findings in laryngology demonstrating that perturbations of phonation, including increased jitter and shimmer, are associated with desiccated ambient air. We predict that, given the relative imprecision of vocal fold vibration in desiccated versus humid ...MORE ⇓
We summarize a number of findings in laryngology demonstrating that perturbations of phonation, including increased jitter and shimmer, are associated with desiccated ambient air. We predict that, given the relative imprecision of vocal fold vibration in desiccated versus humid contexts, arid and cold ecologies should be less amenable, when contrasted to warm and humid ecologies, to the development of languages with phonemic tone, especially complex tone. This prediction is supported by data from two large independently coded databases representing 3,700+ languages. Languages with complex tonality have generally not developed in very cold or otherwise desiccated climates, in accordance with the physiologically based predictions. The predicted global geographic-linguistic association is shown to operate within continents, within major language families, and across language isolates. Our results offer evidence that human sound systems are influenced by environmental factors.