Skip to content
Change the repository type filter

All

    Repositories list

    • Seeder

      Public
      Seeder - Czech webarchive curating tool and public site
      Python
      MIT License
      215282Updated Oct 25, 2024Oct 25, 2024
    • Aplikace slouží jako automatizované řešení pro identifikaci a popis mrtvých webů. Následně je ukládá do vlastní databáze a zpřístupňuje kurátorům, kteří s informacemi v ní dále nakládají, interpretují je a obsah klasifikují.
      PHP
      0240Updated Aug 28, 2024Aug 28, 2024
    • pywb

      Public
      Nový věk zpřístupnění českého webového archivu.
      Shell
      0081Updated Jul 28, 2024Jul 28, 2024
    • Continuous heritrix shell suite (CHSS)
      Shell
      GNU General Public License v3.0
      1000Updated Feb 6, 2024Feb 6, 2024
    • Dokumentace k Databázi mrtvých webových zdrojů.
      0000Updated Dec 19, 2023Dec 19, 2023
    • WebBEAT

      Public
      WebBEAT website data extractor
      Shell
      GNU General Public License v3.0
      2000Updated Nov 22, 2023Nov 22, 2023
    • WA-KAT

      Public
      Catalogization tool for the czech webarchive.
      JavaScript
      MIT License
      02202Updated Jul 19, 2023Jul 19, 2023
    • naki

      Public
      NAKI informační stránka
      HTML
      1100Updated Jul 19, 2023Jul 19, 2023
    • grainery

      Public
      Keeping knowledge about harvested ARC/WARCs and related files such as logs, CDX files etc.
      HTML
      MIT License
      0004Updated May 1, 2023May 1, 2023
    • User documentation for WACloud
      GNU General Public License v3.0
      1000Updated Feb 7, 2023Feb 7, 2023
    • Analytical component of WACloud
      Python
      0000Updated Jan 18, 2023Jan 18, 2023
    • WACloud

      Public
      Centralised interface for Webarchive data extraction and analysis
      TypeScript
      GNU General Public License v3.0
      1000Updated Jan 4, 2023Jan 4, 2023
    • WARC Export application
      Python
      GNU General Public License v3.0
      0000Updated Dec 28, 2022Dec 28, 2022
    • WebArchiv.cz crawler configuration.
      2400Updated May 3, 2021May 3, 2021
    • Katalogizační manuál pro popis elektronických online zdrojů ve formátu MARC 21 podle pravidel RDA
      CSS
      MIT License
      0000Updated Mar 13, 2020Mar 13, 2020
    • Custom Elasticsearch for webarchiv.cz services.
      Dockerfile
      0000Updated Apr 23, 2019Apr 23, 2019
    • Webarchiv's Wayback Machine
      Java
      11100Updated Nov 2, 2018Nov 2, 2018
    • WWW

      Public archive
      Legacy version of website of Czech web archive
      HTML
      1000Updated Nov 1, 2017Nov 1, 2017
    • CSS
      0000Updated Jun 21, 2017Jun 21, 2017
    • machines

      Public
      Code for web archiving infrastructure
      Ruby
      0030Updated Feb 16, 2017Feb 16, 2017
    • Openwayback CDX Server build
      0000Updated Feb 9, 2016Feb 9, 2016
    • wa-tools

      Public
      Scripts for managing web archive
      Shell
      0300Updated Jan 7, 2016Jan 7, 2016
    • va2mods

      Public archive
      XSLT
      0000Updated Jan 13, 2014Jan 13, 2014
    • WA-Admin

      Public
      PHP
      Other
      0310Updated Jul 11, 2013Jul 11, 2013
    • WAmetadataHarvest

      Public archive
      Project focused on harvesting metadata from Heritrix logs/archives and WA-admin tool.
      PHP
      1200Updated Dec 19, 2012Dec 19, 2012
    • webanalyzer

      Public archive
      webanalyzer
      Java
      GNU General Public License v3.0
      0200Updated Apr 26, 2012Apr 26, 2012
    • WadminKonspekt

      Public archive
      Java
      Other
      0000Updated Apr 24, 2012Apr 24, 2012
    • Opensearch to SRU/SRW gate
      Java
      Other
      0100Updated Apr 24, 2012Apr 24, 2012
    • WAHarvester

      Public archive
      Tool for managing crawls with Heritrix and WAdmin tool.
      Groovy
      Other
      0300Updated Feb 5, 2012Feb 5, 2012