Retry peeruserimport task on Database or connection errors #13821

AlexVelezLl · 2025-10-08T23:22:18Z

Summary

Adds support for a retry_on argument in the @task decorator to specify a list of potential non-deterministic exceptions that can be retried if the job failed because of them.
- The user won't see the task failed until the same task was re-attempted 3 times.
Updates the setupwizard frontend to handle failed tasks.
Updates the setupwizard frontend to persists users being imported.
Disable back button in the import users page when importing users to prevent unexpected page layouts.

References

Closes #11836.

Reviewer guidance

Follow steps in Setup Wizard - Confusing behavior when importing multiple learners #11836.
Check that import flow in the LOD users page keeps working as expected.

github-actions · 2025-10-08T23:59:48Z

Build Artifacts

Asset type	Download link
PEX file	kolibri-0.19.0b0.dev0_git.73.g309bd739.pex
Windows Installer (EXE)	kolibri-0.19.0b0.dev0+git.73.g309bd739-windows-setup-unsigned.exe
Debian Package	kolibri_0.19.0b0.dev0+git.73.g309bd739-0ubuntu1_all.deb
Mac Installer (DMG)	kolibri-0.19.0b0.dev0+git.73.g309bd739.dmg
Android Package (APK)	kolibri-0.19.0b0.dev0+git.73.g309bd739-0.1.6-debug.apk
Raspberry Pi Image	kolibri-pi-image-0.19.0b0.dev0+git.73.g309bd739.zip
TAR file	kolibri-0.19.0b0.dev0+git.73.g309bd739.tar.gz
WHL file	kolibri-0.19.0b0.dev0+git.73.g309bd739-py2.py3-none-any.whl

rtibbles

I think we can maintain the current separation of concerns, and it may be worth the effort of adding a new column to track the retries rather than keeping it in the extra_metadata.

To allow us to migrate the SQLAlchemy table, adding alembic as a dependency feels a bit heavy duty. So perhaps the answer is to clear the jobs table of any finished tasks, then dump the remainder to a temporary CSV, clear the table, recreate, and then reload the data?

rtibbles · 2025-10-10T14:17:16Z

kolibri/core/tasks/decorators.py

    permission_classes=None,
    long_running=False,
    status_fn=None,
+    retry_on=None,


Good job avoiding a classic Python gotcha! (passing mutable values as default arguments, such as [] is a very common mistake that can cause issues)

rtibbles · 2025-10-10T14:21:27Z

kolibri/core/tasks/job.py

        total_progress=0,
        result=None,
        long_running=False,
+        retry_on=None,


I feel like we don't need to store this in the job object - we're not allowing this to be customized per job, only per task - so I think we can just reference this from the task itself, rather than having to pass it in at job initialization. This also saves us having to coerce the exception classes to import paths.

rtibbles · 2025-10-10T14:26:32Z

kolibri/core/tasks/job.py

        )
        setattr(current_state_tracker, "job", None)

+    def should_retry(self, exception):


I think I'd rather defer all this logic to the reschedule_finished_job_if_needed method on the storage class, rather than having it in the job class.

rtibbles · 2025-10-10T14:28:47Z

kolibri/core/tasks/job.py


+    def should_retry(self, exception):
+        retries = self.extra_metadata.get("retries", 0) + 1
+        self.extra_metadata["retries"] = retries


I am a bit iffy about using extra_metadata for tracking this - I think if we want to hack the existing schema, 'repeat' is probably a better place for this, but I wonder if instead we should add to the job table schema to add error_retries so that we can put a sensible default in place for failing tasks so they don't endlessly repeat.

I also think I'd rather have the retry interval defined by the task registration (we could also set a sensible default if retryable exceptions are set).

Add retry_on to task decorator

6798dd5

AlexVelezLl requested a review from rtibbles October 8, 2025 23:22

github-actions bot added DEV: backend Python, databases, networking, filesystem... APP: Setup Wizard Re: Setup Wizard (facility import, superuser creation, settings, etc.) DEV: frontend SIZE: medium labels Oct 8, 2025

AlexVelezLl added this to the Kolibri 0.19: Bulk User Management milestone Oct 8, 2025

Handle user import errors in setup wizard plugin

236f654

AlexVelezLl force-pushed the fix-lod-import-multi-users branch from 4e2fbc9 to 236f654 Compare October 8, 2025 23:26

Add tests

f2d0d45

rtibbles self-assigned this Oct 9, 2025

rtibbles reviewed Oct 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Retry peeruserimport task on Database or connection errors #13821

Retry peeruserimport task on Database or connection errors #13821

Uh oh!

AlexVelezLl commented Oct 8, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 8, 2025 •

edited

Loading

Uh oh!

rtibbles left a comment

Uh oh!

rtibbles Oct 10, 2025

Uh oh!

rtibbles Oct 10, 2025

Uh oh!

rtibbles Oct 10, 2025

Uh oh!

rtibbles Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Retry peeruserimport task on Database or connection errors #13821

Are you sure you want to change the base?

Retry peeruserimport task on Database or connection errors #13821

Uh oh!

Conversation

AlexVelezLl commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

References

Reviewer guidance

Uh oh!

github-actions bot commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Build Artifacts

Uh oh!

rtibbles left a comment

Choose a reason for hiding this comment

Uh oh!

rtibbles Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

rtibbles Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

rtibbles Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

rtibbles Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AlexVelezLl commented Oct 8, 2025 •

edited

Loading

github-actions bot commented Oct 8, 2025 •

edited

Loading