Measuring Rhetoric: Statistical Language Models in Social Science

Citation data:

CONFERENCE: Law, Economics, & Business Workshop

Publication Year:
2015
Usage 364
Downloads 194
Abstract Views 170
Social Media 1
Tweets 1
Repository URL:
http://scholarship.law.berkeley.edu/law_econ/Spring2015/Schedule/10
Author(s):
Taddy, Matt
Tags:
Law and Economics
conference paper description
Abstract:Social scientists are embracing the idea of using `text as data’ as a way to quantify and evaluate social theories. I’ll discuss a brief history of how this strategy has worked and evolved, and pitch some new approaches for combining social measurement with state-of-the-art natural language processing. We'll focus on the massive multinomial regression models that serve as a basis for text analysis and the distributed computing strategies that allow inference on truly Big Data. I'll then work through a number of examples of social science questions being asked and answered via statistical NLP, with data from online reviews on Yelp, the US congressional record, and communications between buyers and sellers on eBay.