This is the official codebase for Atari-GPT, a new benchmark for Large Language Models (LLMs) on Atari games. To see more results see our project webpage.
In order to run the code you need to have an API key for the respective model. To reproduce results from the paper you will need all 3 API keys.
For Google you can get an API key here. Once you have this API key put it in a file called GOOGLE_API_KEY.txt.
For Anthropic you can get an API key here. Once you have this API key put it in a file called ANTHROPIC_API_KEY.txt.
For OpenAI you can get an API key here. Once you have this API key put it in a file called OPENAI_API_KEY.txt.
To run the code you will need to have Anaconda and run the following commands:
conda env create --file=environment.yaml
conda activate atari_gpt
python full_evaluation.py
@misc{waytowich2024atarigptinvestigatingcapabilitiesmultimodal,
title={Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games},
author={Nicholas R. Waytowich and Devin White and MD Sunbeam and Vinicius G. Goecks},
year={2024},
eprint={2408.15950},
archivePrefix={arXiv},
primaryClass={cs.AI},
url={https://arxiv.org/abs/2408.15950},
}