Noah A Smith
2012
Mapping the geographical diffusion of new wordsPDF
ArXiv Preprint, pages 1210.5268, 2012
Language in social media is rich with linguistic innovations, most strikingly in the new words and spellings that constantly enter the lexicon. Despite assertions about the power of social media to connect people across the world, we find that many of these neologisms are ...MORE ⇓
Language in social media is rich with linguistic innovations, most strikingly in the new words and spellings that constantly enter the lexicon. Despite assertions about the power of social media to connect people across the world, we find that many of these neologisms are restricted to geographically compact areas. Even for words that become ubiquituous, their growth in popularity is often geographical, spreading from city to city. Thus, social media text offers a unique opportunity to study the diffusion of lexical change. In this paper, we show how an autoregressive model of word frequencies in social media can be used to induce a network of linguistic influence between American cities. By comparing the induced network with the geographical and demographic characteristics of each city, we can measure the factors that drive the spread of lexical innovation.