Another Look at Inference After Prediction

This repository contains the code and results for the paper "Another Look at Inference After Prediction" by Jessica Gronsbell, Jianhui Gao, Yaqi Shi, Zachary R. McCaw, and David Cheng. You can find the preprint here.

Overview

Our paper investigates the statistical efficiency of prediction-based (PB) inference methods. We analyze and compare several PB inference approaches, including the prediction-powered inference (PPI) method from Angelopoulos et al. (2023) and the Chen and Chen (2000) estimator with theoretical and numerical evaluations.

The repository includes:

Implementation of PB inference methods discussed in the paper.
Simulations and analyses used to generate the results in the paper.
Code for reproduce our UK Biobank Analysis, but access to UK Biobank is required as the data cannot be released.

To run the simulations, the required packages include dplyr, broom, and parallel.

Simple Example

# Set the working directory
# Update this path based on your local setup
setwd('~/PBInference/Scripts')

# Load dependencies
library(dplyr)
library(broom)
library(parallel)

# Load functions
source('estimators.R')
source('data_gen.R')

# Set up parameters
n_train <- 10000
n_test <- 1000
n_val <- 9000

# Scenario details are described in the paper
scenario <- "1a"

# Set seed
set.seed(2025)

# Generate data
sim_dat <- data_gen(n_train, n_test, n_val, scenario)

# Calculate coefficient and standard errors
ppi <- predpowinf(sim_dat)[, c("ppi_beta", "ppi_se")]
ppi_full <- predpowinf_full(sim_dat)[, c("ppi_full_beta", "ppi_full_se")]
cc <- chen_chen(sim_dat)[, c("cc_beta", "cc_se")]

Acknowledgments

This repository builds on the code from PredictionBasedInference repository. We thank Motwani and Witten (2023) for publicly sharing their code, which allowed us to further investigate those methods in this manuscript.

Contact

For questions or issues, please contact Yaqi Shi or open an issue on this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
RealData		RealData
Scripts		Scripts
.gitignore		.gitignore
PBInference.Rproj		PBInference.Rproj
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Another Look at Inference After Prediction

Overview

Simple Example

Acknowledgments

Contact

About

Releases

Packages

Contributors 3

Languages

SelinaS37/PBInference

Folders and files

Latest commit

History

Repository files navigation

Another Look at Inference After Prediction

Overview

Simple Example

Acknowledgments

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages