Description
openedon Apr 21, 2021
Source Site
https://www.aucklandmuseum.com/discover/collections/our-data
Provider API Endpoint / Documentation
Tutorial: https://github.com/AucklandMuseum/API/wiki/Tutorial#fields-available-for-query-string-searches
Endpoint: https://api.aucklandmuseum.com/
Provider description
The Auckland Museum displays wide varieties of historical artifacts and collections in addition to providing education and research resources.
Provider API Technical info
Rate Limit: 10 requests per second/ 1000 requests per day.
This issue has been migrated from the CC Search Catalog repository
Author: akshgpt7
Date: Mon Feb 24 2020
Labels: Hacktoberfest,help wanted,providers,✨ goal: improvement,🙅 status: discontinued
Original Comments:
akshgpt7 commented on Tue Feb 25 2020:
@annatuma, @mathemancer I found out that the Auckland Museum's online collection has CC licensed images, which can be identified by the
copyright
field in the response JSON.
mathemancer commented on Tue Feb 25 2020:
@akshgpt7 That's great, thank you! Further info to gather would be:
- What is the overall volume of objects to be queried?
- Is it possible to catalog all CC-licensed items via API calls in some reasonable way? (I.e., there must be some way to systematically loop through all objects, gathering their metadata)
akshgpt7 commented on Fri Feb 28 2020:
- The current volume of objects with
CC
in thecopyright
field is a bit over 100k. - To catalog the CC-licensed items, the following endpoint can be used
https://api.aucklandmuseum.com/search/collectionsonline/_search?q=copyright:CC&has_image=true
. For each object, there's aprimaryRepresentation
field that contains the image url. I believe it's very much possible to extract the items through the same.
Most of the images are licensed under this license: http://creativecommons.org/licenses/by/4.0/ (Needs confirmation).
mathemancer commented on Mon Mar 02 2020:
This is great info @akshgpt7 , thank you!
annatuma commented on Thu Mar 05 2020:
@akshgpt7 you are welcome to tackle this integration if you're interested in doing so. Let us know, for now this ticket is assigned to you to work on.
akshgpt7 commented on Fri May 29 2020:
@mathemancer I have been working on this script.
I have one question moving forward. There is a rate limit of 1000 requests per day. How do I go about handling checks to not exceed that in a day?
I was thinking something like getting the time of the day at the start of the script, and maintaining a request_count
. Then getting the time of the day on each request and stop the script if request_count
hits 1000 (on the same day), or refresh it if the day passes before completing 1000 requests.
However, I'm not sure if this is the right way to go about handling the 1000 requests/day limit. Moreover, how do we make sure to start off the next day from the same page where we left on the previous day?
mathemancer commented on Mon Jun 15 2020:
@akshgpt7 I suggest using the
DelayedRequester
class with a delay of 87 seconds. This will keep the overall number of requests under the limit.
Metadata
Assignees
Labels
Type
Projects
Status
✅ Done