Skip to content

Repository for paper "Embedding Regression: Models for Context-Specific Description and Inference"

License

Notifications You must be signed in to change notification settings

sophiaaknight/EmbeddingRegression

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

80 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Embedding Regression: Models for Context-Specific Description and Inference

Paper and related materials for Rodriguez, Spirling and Stewart (2022). The abstract for the paper is as follows

Social scientists commonly seek to make statements about how a word’s use varies over circumstances—whether that be time, partisan identity, or some other documentlevel covariate. A promising avenue is the use of domain-specific word embeddings, that simultaneously allow for statements of uncertainty and statistical inference. We introduce the a la Carte on Text (conText) embedding regression model for this purpose. We extend and validate a simple linear method of refitting pre-trained embeddings to local contexts that requires minimal input data. It outperforms well-known competitors for studying changes in meaning across groups and time. Our approach allows us to speak descriptively of systematic differences across covariates in the context in which words appear. It also allows comments about whether a particular use is statistically significantly different to another. We provide open-source software for fitting the model

You can find the paper here and a non-technical explainer here.

R software for fitting our models is here, along with a vignette and links to data sets.

Comments are very welcome: please send us an email, or open an "Issue" here.

About

Repository for paper "Embedding Regression: Models for Context-Specific Description and Inference"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published