Pinned Loading
Repositories
Showing 10 of 45 repositories
- model-vs-human Public
Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)
- sort-and-search Public
Code for the paper: "Efficient Lifelong Model Evaluation in an Era of Rapid Progress" [NeurIPS'24]
- frequency_determines_performance Public
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
- DataTypeIdentification Public
Code for the ICLR'24 paper: "Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models"
- robustness Public
Robustness and adaptation of ImageNet scale models. Pre-Release, stay tuned for updates.