GitHub - mdrpanwar/icl-bayesian-prism

This repository contains the code for our ICLR 2024 paper:

In-Context Learning through the Bayesian Prism
Madhur Panwar*, Kabir Ahuja*, Navin Goyal
Paper: https://openreview.net/forum?id=HX5ujdsSon

    @inproceedings{
        panwar2024incontext,
        title={In-Context Learning through the Bayesian Prism},
        author={Madhur Panwar and Kabir Ahuja and Navin Goyal},
        booktitle={The Twelfth International Conference on Learning Representations},
        year={2024},
        url={https://openreview.net/forum?id=HX5ujdsSon}
    }

Code has been adapted from https://github.com/dtsip/in-context-learning:

    @InProceedings{garg2022what,
        title={What Can Transformers Learn In-Context? A Case Study of Simple Function Classes},
        author={Shivam Garg and Dimitris Tsipras and Percy Liang and Gregory Valiant},
        year={2022},
        booktitle={arXiv preprint}
    }

Getting started

You can start by cloning our repository and following the steps below.

Install the dependencies for our code using Conda. You may need to adjust the environment YAML file depending on your setup.
```
conda env create -f environment.yml
conda activate in-context-learning
```
[Optional] If you plan to train, populate conf/wandb.yaml with you wandb info.

Reproducing Experiments in the Paper

The code for all the experiments is contained in src/ directory. Swith that to the working directory to run experiments.

cd src

Linear inverse problems

To train models on Dense Regression (DR) function class run:

python train.py --config conf/linear_regression.yaml

To train models on Sparse Regression (SR) function class run:

python train.py --config conf/sparse_regression.yaml

To train models on Sign-Vector Regression (SVR) function class run:

python train.py --config conf/sign_vec_cs.yaml

To train models on Low-Rank Regression function class run:

python train.py --config conf/low_rank_cs.yaml

Non-linear functions

For Fourier Series, run the following:

python train.py --config conf/fourier_series.yaml

To evaluate inductive biases using the DFT method after training the model, run:

python fourier_series_inductive_bias.py <RUN_ID_Of_TrainedModel> <MaxFreq> interpolate Transformer True

For Monomial Regression, run the following:

python train.py --config conf/polynomials_deg2_monomials_fixed_S.yaml

For HAAR Wavelets, run the following:

python train.py --config conf/haar_wavelets.yaml

Task Mixtures

For DR + SR mixture:

python train.py --config conf/lr_sr.yaml

For DR + SVR mixture:

python train.py --config conf/lr_sign_vec_cs_mixer.yaml

For DR + SR + SVR mixture:

python train.py --config conf/lr_sr_sign_vec_cs_mixer.yaml

For DR + DT mixture:

python train.py --config conf/lr_dt.yaml

For DT + NN mixture:

python train.py --config conf/dt_relu_2nn_mixer.yaml

For Fourier Series Mixtures (simplicity bias) run:

python train.py --config conf/fourier_series_simpl.yaml

The DFT analysis can be performed similarly as for the single fourier experiment as above by running:

python fourier_series_mixture_inductive_bias.py <RUN_ID_Of_TrainedModel> <MaxFreq> interpolate Transformer True

For GMM experiments run:

python train.py --config conf/gmm_linear_regression.yaml

For Monomials Multitask experiments run:

python train.py --config conf/polynomials_deg2_monomials_fixed_K_d10.yaml

For Fourier Series Multitask experiments run:

python train.py --config conf/fourier_series_multitask_fixedK.yaml

For Forgetting (Noisy Linear Regression) experiments run:

python train.py --config conf/task_diversity_noisy_lr.yaml

Evaluations

For evaluation on different tasks, refer to the notebooks provided in EvalNotebooks/

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
EvalNotebooks		EvalNotebooks
images		images
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting started

Reproducing Experiments in the Paper

Linear inverse problems

Non-linear functions

Task Mixtures

Evaluations

Maintainers

About

Releases

Packages

Languages

License

mdrpanwar/icl-bayesian-prism

Folders and files

Latest commit

History

Repository files navigation

Getting started

Reproducing Experiments in the Paper

Linear inverse problems

Non-linear functions

Task Mixtures

Evaluations

Maintainers

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages