diff --git a/README.md b/README.md index c3baf21f..66cef5c2 100644 --- a/README.md +++ b/README.md @@ -13,8 +13,9 @@ Catalog: - [x] Download of bootstrapped pre-training datasets -### Inference demo (Image Captioning and VQA): -Run our interactive demo using Colab notebook (no GPU needed): +### Inference demo: +Run our interactive demo using Colab notebook (no GPU needed). +The demo includes code for: (1) image captioning, (2) open-ended visual question answering, (3) multimodal / unimodal feature extraction. ### Pre-trained checkpoints: Num. pre-train images | BLIP w/ ViT-B | BLIP w/ ViT-B and CapFilt-L | BLIP w/ ViT-L