Extensions to Embedding Regression: Models for Context-Specific Description and Inference

dc.contributor.authorSpirling, Arthur
dc.date.accessioned2024-02-02T20:27:09Z
dc.date.available2024-02-02T20:27:09Z
dc.date.issued2024-02-02
dc.descriptionArthur Spirling is the Class of 1987 Professor of Politics at Princeton University. He received a bachelor’s and master’s degree from the London School of Economics, and a master’s degree and PhD from the University of Rochester. Previously, he served on the faculties of Harvard University and New York University. Spirling’s research centers on quantitative methods for analyzing political behavior, especially institutional development and the use of text-as-data. His work on these subjects has appeared in outlets such as the American Political Science Review, the American Journal of Political Science and the Journal of the American Statistical Association. Currently he is active on problems at the intersection of data science and social science, including those related to machine learning, and large language models. He previously won teaching and mentoring awards at Harvard and NYU, along with the “Emerging Scholar” prize from the Society for Political Methodology.
dc.description.abstractSocial scientists commonly seek to make statements about how word use varies over circumstances—including time, partisan identity, or some other document-level covariate. For example, researchers might wish to know how Republicans and Democrats diverge in their understanding of the term “immigration.” Building on the success of pretrained language models, we introduce the a la carte on text (conText) embedding regression model for this purpose. This fast and simple method produces valid vector representations of how words are used—and thus what words “mean”—in different contexts. We show that it outperforms slower, more complicated alternatives and works well even with very few documents. The model also allows for hypothesis testing and statements about statistical significance. We also provide extensions of the model to non-English languages and demonstrate applications for the same.
dc.identifier.urihttps://hdl.handle.net/2022/29604
dc.language.isoen
dc.publisherIndiana University Workshop in Methods
dc.relation.urihttps://purl.dlib.indiana.edu/iudl/media/t05346mf0m
dc.titleExtensions to Embedding Regression: Models for Context-Specific Description and Inference
dc.typePresentation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2024_02_02_wim_spirling_embeddings_slides.pdf
Size:
4.26 MB
Format:
Adobe Portable Document Format
Can’t use the file because of accessibility barriers? Contact us