-
conciliator Public
Forked from codeforkjeff/conciliatorOpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
Java GNU General Public License v3.0 UpdatedFeb 19, 2025 -
openlibrary Public
Forked from internetarchive/openlibraryOne webpage for every book ever published!
-
ia-web-commons Public
Forked from commoncrawl/ia-web-commonsWeb archiving utility library
Java Apache License 2.0 UpdatedFeb 12, 2025 -
-
OpenRefine Public
Forked from OpenRefine/OpenRefineJava BSD 3-Clause "New" or "Revised" License UpdatedNov 8, 2024 -
simile-butterfly Public
Automatically exported from code.google.com/p/simile-butterfly
-
simile-vicino Public
Forked from OpenRefine/simile-vicinoAutomatically exported from code.google.com/p/simile-vicino
Java Other UpdatedSep 21, 2024 -
dmgbuild Public
Forked from dmgbuild/dmgbuildmacOS command line utility to build disk images
Python MIT License UpdatedSep 21, 2024 -
openrefine-sample-extension Public template
Forked from OpenRefine/sample-extensionOpenRefine sample extension provided for demonstration purposes
JavaScript BSD 3-Clause "New" or "Revised" License UpdatedAug 22, 2024 -
openrefine.org Public
Forked from OpenRefine/openrefine.orgGithub pages repository for OpenRefine account
TypeScript UpdatedJul 26, 2024 -
simile-butterfly-new Public
Forked from OpenRefine/simile-butterflyOpenRefine fork of the MIT Simile Butterfly server - our changes are on branch openrefine
Java Apache License 2.0 UpdatedMay 30, 2024 -
pdf2table Public
PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz
-
simile-vicino-original Public
Automatically exported from code.google.com/p/simile-vicino
-
webpy Public
Forked from webpy/webpyweb.py is a web framework for python that is as simple as it is powerful.
Python Other UpdatedFeb 16, 2024 -
-
-
cellranger Public
Forked from 10XGenomics/cellranger10x Genomics Single Cell Analysis
Rust Other UpdatedNov 10, 2023 -
-
tessdata_fast Public
Forked from tesseract-ocr/tessdata_fastFast integer versions of trained LSTM models
Apache License 2.0 UpdatedOct 25, 2023 -
cjworkbench Public
Forked from CJWorkbench/cjworkbenchThe data journalism platform with built in training
Python Other UpdatedOct 22, 2023 -
OpenRefineFlatpak Public
Forked from mbugni/OpenRefineFlatpak -
warc-specifications Public
Forked from iipc/warc-specificationsCentralised repository for WARC usage specifications.
HTML UpdatedSep 2, 2023 -
surt Public
Forked from internetarchive/surtSort-friendly URI Reordering Transform (SURT) python module
Python GNU Affero General Public License v3.0 UpdatedAug 28, 2023 -
mreid-resolver Public
Forked from AngryLoki/mreid-resolverTool for showing Freebase and Google Knowledge Graph entries
Svelte MIT License UpdatedAug 3, 2023 -
archive-pdf-tools Public
Forked from internetarchive/archive-pdf-toolsFast PDF generation and compression. Deals with millions of pages daily.
Python GNU Affero General Public License v3.0 UpdatedJul 13, 2023 -
pattypan Public
Forked from yarl/pattypanUpload files to Wikimedia Commons. The Spreadsheet Way.
Java MIT License UpdatedJul 11, 2023 -
openrefineder Public
Forked from betatim/openrefineder💠 + 📚 OpenRefine on Binder!
Jupyter Notebook Other UpdatedJul 2, 2023 -
pylsd Public
Forked from OCR-D/pylsdpython bindings for LSD - Line Segment Detector.
C++ Other UpdatedJun 27, 2023 -
-
Data-Quality-Rule-Engine Public
Forked from microsoft/Data-Quality-Rule-EngineScala Other UpdatedApr 21, 2023