earkweb is a repository for archiving digital objects. It offers basic functions for ingest, management and dissemination of information packages.
earkweb consists of a frontend web application together with a task execution system based on Celery which allows synchronous and asynchronous processing of information packages by means of processing units which are called “tasks”.
The following diagram illustrates the component architecture.
The user interface represented by the box on top of the diagram is a Python/Django-based web application which supports creation, management and exploration of information packages. Tasks can be assigned to Celery workers (green boxes with a "C") which share the same storage area and the result of the package transformation is stored in the information package’s working directory based on files. Full-text content included in information packages is indexed by SolR. A ResourceSync interface exposes the changelist of information packages managed by the repository.