A fast, flexible and transparent framework to estimate context-specific word and short document embeddings using the 'a la carte'
embeddings approach developed by Khodak et al. (2018) <doi:10.48550/arXiv.1805.05388> and evaluate hypotheses about covariate effects on embeddings using
the regression framework developed by Rodriguez et al. (2021)<doi:10.1017/S0003055422001228>. New version of the package applies a new estimator to measure the distance between word embeddings as described in Green et al. (2025) <doi:10.1017/pan.2024.22>.
Version: |
3.0.0 |
Depends: |
R (≥ 3.6.0) |
Imports: |
dplyr, Matrix (≥ 1.3-2), quanteda (≥ 3.0.0), text2vec (≥
0.6), reshape2 (≥ 1.4.4), fastDummies (≥ 1.6.3), stringr (≥
1.4.0), tidyr (≥ 1.1.3), ggplot2, broom, foreach, methods, estimatr, utils, stats, tidyselect |
Suggests: |
SnowballC (≥ 0.7.0), hunspell, knitr, rmarkdown, formatR, testthat (≥ 3.0.0), doParallel, data.table, RcppParallel |
Published: |
2025-09-03 |
DOI: |
10.32614/CRAN.package.conText |
Author: |
Pedro L. Rodriguez
[aut, cph],
Arthur Spirling
[aut],
Brandon Stewart
[aut],
Christopher Barrie
[ctb],
Sofia Avila [cre,
aut] |
Maintainer: |
Sofia Avila <sofiaavila at princeton.edu> |
BugReports: |
https://github.com/prodriguezsosa/ConText/issues |
License: |
GPL-3 |
URL: |
https://github.com/prodriguezsosa/conText |
NeedsCompilation: |
no |
Language: |
en-US |
Materials: |
README |
CRAN checks: |
conText results |