Memory-Based Meta-Learning on Non-Stationary Distributions

This repository provides an implementation of our ICML 2023 paper Memory-Based Meta-Learning on Non-Stationary Distributions.

Memory-based meta-learning is a technique for approximating Bayes-optimal predictors. Under fairly general conditions, minimizing sequential prediction error, measured by the log loss, leads to implicit meta-learning. The goal of this work is to investigate how far this interpretation can be realized by current sequence prediction models and training regimes. The focus is on piecewise stationary sources with unobserved switching-points, which arguably capture an important characteristic of natural language and action-observation sequences in partially observable environments. We show that various types of memory-based neural models, including Transformers, LSTMs, and RNNs, can learn to accurately approximate known Bayes-optimal algorithms and behave as if performing Bayesian inference over the latent switching-points and the latent parameters governing the data distribution within each segment.

It is based on JAX and Haiku and contains all code, datasets, and models necessary to reproduce the paper's results.

Content

.
├── experiments
|   ├── config.py                   - Experiment configurations
|   ├── constants.py                - Experiment constants
|   ├── distributions.py            - Probability distributions
|   ├── evaluator.py                - Evaluation loop
|   └── live_and_die_predictors.py  - LAD (Willems, 1996)
|   └── local_launch.py             - Local launch script
|   └── ptw_predictors.py           - PTW (Veness et al., 2013)
|   └── trajectory_generators.py    - Trajectory generators
|
├── models
|   ├── basic.py                    - CNNs, MLPs, RNNs
|   └── positional_encodings.py     - ALiBi (Press et al., 2022), relative (Dai et al., 2019), sin/cos (Vaswani et al., 2017)
|   ├── stack_rnn.py                - Stack-RNN (Joulin & Mikolov, 2015)
|   └── transformer.py              - Transformer (Vaswani et al., 2017)
|
├── README.md
├── predictor_factories.py          - Factories to initialize predictors
├── predictors.py                   - Predictor interface
├── base_config.py                  - Base configurations
├── base_constants.py               - Base constants
├── requirements.txt                - Dependencies
└── train.py                        - Training loop

Installation

Clone the source code into a local directory:

git clone https://github.com/deepmind/nonstationary_mbml.git
cd nonstationary_mbml

pip install -r requirements.txt will install all required dependencies. This is best done inside a conda environment. To that end, install Anaconda. Then, create and activate the conda environment:

conda create --name nonstationary_mbml
conda activate nonstationary_mbml

Install pip and use it to install all the dependencies:

conda install pip
pip install -r requirements.txt

If you have a GPU available (highly recommended for fast training), then you can install JAX with CUDA support.

pip install --upgrade "jax[cuda12_pip]" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html

Note that the jax version must correspond to the existing CUDA installation you wish to use (CUDA 12 in the example above). Please see the JAX documentation for more details.

Usage

Before running any code, make sure to activate the conda environment and set the PYTHONPATH:

conda activate nonstationary_mbml
export PYTHONPATH=$(pwd)/..

We provide an example of a training and evaluation run at:

python experiments/local_launch.py

The experiment configurations can be adjusted in base_config.py and experiments/config.py.

Citing this work

@inproceedings{genewein2023memory,
  author    = {Tim Genewein and
               Gr{\'{e}}goire Del{\'{e}}tang and
               Anian Ruoss and
               Li Kevin Wenliang and
               Elliot Catt and
               Vincent Dutordoir and
               Jordi Grau-Moya and
               Laurent Orseau and
               Marcus Hutter and
               Joel Veness},
  title     = {Memory-Based Meta-Learning on Non-Stationary Distributions},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
}

License and disclaimer

All software is licensed under the Apache License, Version 2.0 (Apache 2.0); you may not use this file except in compliance with the Apache 2.0 license. You may obtain a copy of the Apache 2.0 license at: https://www.apache.org/licenses/LICENSE-2.0

All other materials are licensed under the Creative Commons Attribution 4.0 International License (CC-BY). You may obtain a copy of the CC-BY license at: https://creativecommons.org/licenses/by/4.0/legalcode

Unless required by applicable law or agreed to in writing, all software and materials distributed here under the Apache 2.0 or CC-BY licenses are distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the licenses for the specific language governing permissions and limitations under those licenses.

This is not an official Google product.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Memory-Based Meta-Learning on Non-Stationary Distributions

Content

Installation

Usage

Citing this work

License and disclaimer

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
experiments		experiments
models		models
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
base_config.py		base_config.py
base_constants.py		base_constants.py
overview.svg		overview.svg
predictor_factories.py		predictor_factories.py
predictors.py		predictors.py
requirements.txt		requirements.txt
train.py		train.py

License

google-deepmind/nonstationary_mbml

Folders and files

Latest commit

History

Repository files navigation

Memory-Based Meta-Learning on Non-Stationary Distributions

Content

Installation

Usage

Citing this work

License and disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages