R package to run estimations with weighted replicate samples and plausible values
Rrepest
estimates statistics using replicate weights
(balanced repeated replication/brr weights,
jackknife replicate weights,…), thus accounting for complex
survey designs in the estimation of sampling variances. It is especially
designed for use with the international education datasets produced by
the OECD and the IEA, and also allows
for analyses with multiply imputed variables (plausible values); where
plausible values are used, the average estimator across plausible values
is reported and the imputation error is added to the variance
estimator.
A “cheat sheet” including an overview of the syntax and uses of
Rrepest
is available here
Run the following code:
install.packages("Rrepest")
Download Rrepest, then run
Run the following code replacing “You_R_Name” with your username:
install.packages("C:/Users/You_R_Name/Downloads/Rrepest.tar.gz",
repos = NULL,
type ="source")
Run:
library(Rrepest)
Run the following code replacing “MY_TOKEN” with your gitlab token:
::install_gitlab("edu_data/rrepest", host = "https://algobank.oecd.org:4430", upgrade = "never", auth_token = "MY_TOKEN") remotes
Note: It will take a few minutes to install
Run:
library(Rrepest)
Note: Ensure you have the package data.table
installed.
For a complete list of the dependencies used, consult the Description
file.
The current version supports uni-variate statistics (e.g. mean, variance, standard deviation, quantiles), frequencies, linear regression and covariance.
# PISA 2018 Data
# df.qqq <- readRDS("//oecdmain/asgenedu/EDUCATION_DATALAKE/sources/PISA/PISA 2018/R/STU/CY07_MSU_STU_QQQ.rds")
::Rrepest(data = df.qqq,
Rrepestsvy = "PISA2015",
est = est(c("mean","var","quant",0.5,"iqr",c(.9,.1)),"CNTSCHID"),
by = c("cnt"))
# TALIS 2018 Data
# df.t <- readRDS(file = "V:/TALIS/BACKUP/DATA/TALIS2018/R/PUF/TTGINTT3_demo.rds")
::Rrepest(data = df.t,
Rrepestsvy = "TALISTCH",
est = est("freq","TT3G06I2"),
over = c("TT3G06A2","TT3G52J"),
by = "cntry",
test = T,
isced = 2)
# TALIS 2018 Data
# df.t <- readRDS(file = "V:/TALIS/BACKUP/DATA/TALIS2018/R/PUF/TTGINTT3_demo.rds")
<- df.t %>%
df.t mutate(TT3G01_rec = case_when(TT3G01 == 2 ~ 1,
== 1 ~ 0))
TT3G01
::Rrepest(data = df.t,
Rrepestsvy = "TALISTCH",
est = est("lm","TT3G01_rec",'TT3G39C'),
by = "cntry")
# PISA 2018 Data
# df.qqq <- readRDS("//oecdmain/asgenedu/EDUCATION_DATALAKE/sources/PISA/PISA 2018/R/STU/CY07_MSU_STU_QQQ.rds")
::Rrepest(data = df.qqq,
Rrepestsvy = "PISA2015",
est = est("corr",c("pv@math","pv@read")),
by = c("CNT"))
Further examples can be found in the Examples.R file
To incorporate analyses that are not pre-programmed into Rrepest, you
can utilize the ‘gen’ option within the
est()
function of Rrepest. More about it is presented in
the following wiki.
Francesco Avvisati, Rodolfo Ilizaliturri and François Keslair.
Contact us if you want to join!
Do you have suggestions or comments? Please open an issue
First public release (30 June 2023).