We advertise bulk data downloads via /api/document/zip. However, I noticed that:
- Browser responds with a Gateway Timeout error when visiting the link
- Looking at the cached file in the factoid container at
downloads/factoid_bulk it appears to be recreated each time a request is made
This creates a barrier to data download.
- Cannot reproduce on unstable instance (works fine here)
- Able to reproduce on beta instance: Gateway Time-out (1); cannot reproduce (2)
Aside:
- I wonder if this endpoint is being hit by bots/Google
- We should probably not generate this file on the fly, other optimizations