Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Geograph #1405

Open
1 task
cogdog opened this issue Oct 14, 2022 · 4 comments
Open
1 task

Geograph #1405

cogdog opened this issue Oct 14, 2022 · 4 comments
Labels
💻 aspect: code Concerns the software code in the repository 🌟 goal: addition Addition of new feature 🟩 priority: low Low priority and doesn't need to be rushed 🧱 stack: catalog Related to the catalog and Airflow DAGs

Comments

@cogdog
Copy link

cogdog commented Oct 14, 2022

Source Site

https://www.geograph.org.uk/

Value Provided

Over 7 million photos of places in Ireland and the UK, aiming o cover every piece of land in a grid fashion "project aims to collect geographically representative photographs and information for every square kilometre of Great Britain and Ireland" they have an API

Licenses Provided

CC BY-SA required

Implementation

  • 🙋 I would be interested in implementing this feature.
@cogdog cogdog added 🚦 status: awaiting triage Has not been triaged & therefore, not ready for work 🧹 status: ticket work required Needs more details before it can be worked on labels Oct 14, 2022
@AetherUnbound
Copy link
Contributor

@obulat had mentioned we have some of this data in Openverse already, possibly via commoncrawl? Example: https://wordpress.org/openverse/search/image/?q=Cat&source=geographorguk

But given that they have an API, it would be great to add an ingester for it!

@rwidom
Copy link
Collaborator

rwidom commented Oct 14, 2022

Yes! And they even have tsv dumps if we wanted to go the SQL route.

@cogdog
Copy link
Author

cogdog commented Oct 14, 2022

Plus a German version! http://geo-en.hlipp.de/

@obulat
Copy link
Contributor

obulat commented Oct 15, 2022

I didn't notice that we do not ingest the new data from Geograph. It is an amazing source! It would be great to add the TSV loaders for the UK and the German versions.

@dhruvkb dhruvkb added 🟩 priority: low Low priority and doesn't need to be rushed 🌟 goal: addition Addition of new feature and removed 🧹 status: ticket work required Needs more details before it can be worked on 🚦 status: awaiting triage Has not been triaged & therefore, not ready for work labels Oct 17, 2022
@krysal krysal added the 💻 aspect: code Concerns the software code in the repository label Nov 18, 2022
@obulat obulat added the 🧱 stack: catalog Related to the catalog and Airflow DAGs label Feb 23, 2023
@obulat obulat transferred this issue from WordPress/openverse-catalog Apr 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
💻 aspect: code Concerns the software code in the repository 🌟 goal: addition Addition of new feature 🟩 priority: low Low priority and doesn't need to be rushed 🧱 stack: catalog Related to the catalog and Airflow DAGs
Projects
Status: 📋 Backlog
Development

No branches or pull requests

6 participants