diff --git a/wikiweb2m.md b/wikiweb2m.md index 5585e39..6cc6c28 100644 --- a/wikiweb2m.md +++ b/wikiweb2m.md @@ -7,6 +7,7 @@ section each text and image element comes from. This dataset is a contribution from our upcoming paper `A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding`. + The dataset is stored as gzipped TFRecord files which can be downloaded via these links. **Train**