Skip loading extra data for export #1051

mwu1993 · 2019-10-14T23:31:22Z

Summary: When performing a standalone model export, we just need one batch of data to pass through. But we currently get the data via data.batches(...) which could use PoolingBatcher, which loads many many examples, or could have in_memory=True, which loads all examples. This can take hours for a large dataset (especially since we also tokenize the data). Add a flag for batches() that forcibly skips loading all data into memory and using pooling.

Differential Revision: D17920179

Summary: When performing a standalone model export, we just need one batch of data to pass through. But we currently get the data via `data.batches(...)` which could use PoolingBatcher, which loads many many examples, or could have `in_memory=True`, which loads all examples. This can take hours for a large dataset (especially since we also tokenize the data). Add a flag for `batches()` that forcibly skips loading all data into memory and using pooling. Differential Revision: D17920179 fbshipit-source-id: b878c9594d4f41752a8f0fb835d706602e68c1eb

facebook-github-bot · 2019-10-14T23:31:38Z

This pull request was exported from Phabricator. Differential Revision: D17920179

facebook-github-bot · 2019-10-16T22:44:23Z

This pull request has been merged in a0f6002.

facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Oct 14, 2019

facebook-github-bot closed this in a0f6002 Oct 16, 2019

facebook-github-bot added the Merged label Oct 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip loading extra data for export #1051

Skip loading extra data for export #1051

mwu1993 commented Oct 14, 2019

facebook-github-bot commented Oct 14, 2019

facebook-github-bot commented Oct 16, 2019

Skip loading extra data for export #1051

Skip loading extra data for export #1051

Conversation

mwu1993 commented Oct 14, 2019

facebook-github-bot commented Oct 14, 2019

facebook-github-bot commented Oct 16, 2019