GitHub

Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning

This repository contains codes for COLM 2024 paper: Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning

Getting Started

Enviorments

The codes were tested on:

Python >= 3.10

transformers >= 4.36

datasets >= 2.20.0

pytorch >= 2.1.0

faiss-cpu >= 1.7.4

pandas

scikit-learn

numpy

Optional dependencies include:

accelerate >= 0.30.1 (For distributed inference, used together with deepspeed)

deepspeed >= 0.12.6

Datasets Preprocessing

We do not own the datasets evaluated in our experiments. Please download them via huggingface.

scripts/prepare_datasets/generate_dataset.sh is for this purpose. It downloads the datasets from huggingface and do some pre-processing, including sampling the samples used for in-context demonstrations.

Inference and Evaluation

In scripts folder, section5, section6 and section7 folders contain the bash files used for experiments in correspoding sections of our paper. Replace accelerate launch --config_file "./acc_config_dist.yaml" with python to perform single-gpu inference. You need to set the batch size accordingly.

The evaluation scripts will be runned after the inference finish, which is included in the corresponding bash files.

Citation

@inproceedings{
    long2024does,
    title={Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning},
    author={Quanyu Long and Yin Wu and Wenya Wang and Sinno Jialin Pan},
    booktitle={First Conference on Language Modeling},
    year={2024},
    url={https://openreview.net/forum?id=i2oJjC0ESQ}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
post_process		post_process
scripts		scripts
.gitignore		.gitignore
README.md		README.md
acc_config_dist.yaml		acc_config_dist.yaml
acc_config_single.yaml		acc_config_single.yaml
acc_dsconfig.json		acc_dsconfig.json
ag_new_1000indexes.txt		ag_new_1000indexes.txt
generate_icl.py		generate_icl.py
generate_icl_retrieval.py		generate_icl_retrieval.py
handle_classification.py		handle_classification.py
handle_generation.py		handle_generation.py
hs18_1000indexes.txt		hs18_1000indexes.txt
main.py		main.py
prepare_prompt.py		prepare_prompt.py
sari.py		sari.py
txt2jsonl.py		txt2jsonl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning

Getting Started

Enviorments

Datasets Preprocessing

Inference and Evaluation

Citation

About

Releases

Packages

Contributors 2

Languages

ruyue0001/decompose_ICL_improvement

Folders and files

Latest commit

History

Repository files navigation

Does In-Context Learning Really Learn? Rethinking How Large Language Models Respond and Solve Tasks via In-Context Learning

Getting Started

Enviorments

Datasets Preprocessing

Inference and Evaluation

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages