Improve error message for duplicate pipeline run names #3701

strickvl · 2025-05-24T17:30:07Z

Summary

Improve the error message when users try to run a pipeline with a duplicate run_name
Handle duplicate run name errors directly in SQLZenStore where they occur
Provide a generic, user-friendly error message with actionable solutions

Problem

When users run a pipeline with a duplicate run name (whether from a config file, programmatically, or any other method), they get a confusing database error. The error can come in two forms:

A raw SQL IntegrityError (when using REST API):

RuntimeError: (pymysql.err.IntegrityError) (1062, "Duplicate entry 'test_run_name-6e23c0466cc4411c8b9f75f0c8a1a818' for key 'pipeline_run.unique_run_name_in_project'")

An EntityExistsError with a technical message

Solution

This PR catches IntegrityError in SQLZenStore's _create_run method and provides a much more helpful error message with actionable solutions.

Before (Raw SQL Error)

RuntimeError: (pymysql.err.IntegrityError) (1062, "Duplicate entry 'my_run-6e23c0466cc4411c8b9f75f0c8a1a818' for key 'pipeline_run.unique_run_name_in_project'")
[SQL: INSERT INTO pipeline_run ...]

After (User-Friendly Error)

Pipeline run name 'my_run' already exists in this project. Each pipeline run must have a unique name.

To fix this, you can:
1. Use a different run name
2. Use a dynamic run name with placeholders like: "my_run_{date}_{time}"
3. Remove the run name from your configuration to auto-generate unique names

For more information on run naming, see: https://docs.zenml.io/concepts/steps_and_pipelines/yaml_configuration#run-name

Changes Made

Enhanced error handling in SQLZenStore's _create_run() method to catch IntegrityError and provide user-friendly messages
Moved error handling from run_utils.py to SQLZenStore for better architecture (database errors handled at database layer)
Made error message more generic - removed specific mention of "config file" since run names can be set in multiple ways
Simplified run_utils.py by removing 40+ lines of redundant error handling code
Updated unit tests to reflect the new error handling location
Updated documentation with clearer examples and warnings about run name uniqueness

Test Plan

Unit tests verify the improved error message is shown
Unit tests pass successfully after refactoring
All existing tests continue to pass
Mypy type checking passes
Formatting and linting checks pass

🤖 Generated with Claude Code

coderabbitai · 2025-05-24T17:30:13Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

github-actions · 2025-05-24T17:31:27Z

Documentation Link Check Results

✅ Absolute links check passed
✅ Relative links check passed
_{Last checked: 2025-05-26 19:05:13 UTC}

Copilot

Pull Request Overview

This PR enhances the user experience by providing clearer guidance when a pipeline run name conflict occurs, along with tests and documentation to support the change.

Enhanced error handling in create_placeholder_run to catch duplicate run-name errors and surface a helpful, actionable message.
Added unit tests to verify the new error message and ensure other EntityExistsError cases remain unchanged.
Updated docs to warn about run-name uniqueness and suggest best practices.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 2 comments.

File	Description
src/zenml/pipelines/run_utils.py	Improved duplicate run-name error detection and messaging
tests/unit/pipelines/test_run_utils.py	Added tests for duplicate run-name error and non-duplicate behavior
docs/book/how-to/steps-pipelines/yaml_configuration.md	Warn about unique run names and provide guidance in YAML examples
run_alerter_tests.sh	New script to run alerter tests (scope seems unrelated)

Comments suppressed due to low confidence (1)

run_alerter_tests.sh:1

[nitpick] This script for running alerter tests appears unrelated to the pipeline run-name improvements. Consider moving it to a separate PR or isolating it under a more relevant feature grouping to keep this change focused.

#!/bin/bash

src/zenml/pipelines/run_utils.py

docs/book/how-to/steps-pipelines/yaml_configuration.md

When users run a pipeline with a fixed `run_name` in their config.yaml and then rerun the same pipeline, they would get a confusing database error about entity existence. This change catches both EntityExistsError and RuntimeError (with IntegrityError) specifically for duplicate run names and provides a much more helpful error message. ## Changes - Add improved error handling in `create_placeholder_run()` to catch duplicate run name errors (both EntityExistsError and raw SQL IntegrityError) - Provide actionable guidance with 3 specific solutions: 1. Change the run_name to a unique value 2. Use dynamic placeholders like `run_name: "my_run_{date}_{time}"` 3. Remove the run_name to auto-generate unique names - Add comprehensive unit tests to verify the improved error message - Update documentation in yaml_configuration.md to warn about run name uniqueness ## User Experience Instead of seeing confusing database errors, users now get: ``` Pipeline run name 'my_run_name' already exists in this project. Each pipeline run must have a unique name. To fix this, you can: 1. Change the 'run_name' in your config file to a unique value 2. Use a dynamic run name with placeholders like: run_name: "my_run_name_{date}_{time}" 3. Remove the 'run_name' from your config to auto-generate unique names For more information on run naming, see: https://docs.zenml.io/concepts/steps_and_pipelines/yaml_configuration#run-name ``` 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

As suggested in PR review, this commit adds clearer YAML comments to the run_name placeholder examples to make it more obvious what each example demonstrates.

- Use TYPE_CHECKING to handle the optional sqlalchemy import properly - Rename to SQLIntegrityError to avoid confusion with other exceptions - This ensures mypy doesn't complain about assigning None to a type

Copilot

Pull Request Overview

This PR enhances error handling for duplicate pipeline run names by catching both ZenML and raw SQL errors, improving the user-facing message, adding unit tests, and updating the documentation with guidance on unique run names.

Catch and re-raise EntityExistsError with a friendlier, actionable message when run names collide
Detect raw SQL integrity errors (duplicate entry) and convert them to the same improved error
Add unit tests covering both error pathways and update docs with run name best practices

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
tests/unit/pipelines/test_run_utils.py	New tests for duplicate-name errors and preservation of other errors
src/zenml/pipelines/run_utils.py	Catch both `EntityExistsError` and raw SQL duplicates; build improved error message
docs/book/how-to/steps-pipelines/yaml_configuration.md	Warning section added explaining unique run name guidelines

src/zenml/pipelines/run_utils.py

docs/book/how-to/steps-pipelines/yaml_configuration.md

- Change broad Exception catch to specific RuntimeError - Add parentheses for clarity in boolean logic - Align documentation wording with error message 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

schustmi · 2025-05-26T07:53:21Z

src/zenml/pipelines/run_utils.py

-    run, _ = Client().zen_store.get_or_create_run(run_request)
-    return run
+
+    try:


This should be handled in the SQLZenStore, not in this random place (which is only one occurence where a run is created).

Specifying the run name in a config file is not the only one way to do it, the message can simply be generic and talk about configuration instead of files.

…or-message

- Moved error handling from run_utils.py to sql_zen_store.py where it architecturally belongs - Database-specific error handling now stays in the database layer - Made error message more generic (removed specific mention of 'config file') - Simplified run_utils.py by removing 40+ lines of error handling code - Updated tests to reflect the new error handling location - All code paths that create runs now benefit from improved error messages 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

Remove EntityExistsError from Raises section since this function no longer explicitly raises exceptions - they are now handled in SQLZenStore. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>

schustmi · 2025-05-26T18:13:53Z

src/zenml/zen_stores/sql_zen_store.py

            # We have to rollback the failed session first in order to
            # continue using it
            session.rollback()
+
+            # Check if this is a duplicate run name error


Actually this error should already be caught as part of the method call below your changes, which verifies the name uniqueness and raises an EntityExistsError already.

I could reproduce the weird error message that you were getting though, but it is as part of the autoflush behaviour of SQLAlchemy, and does not happen as part of this commit() I think.

I managed to solve it by wrapping the code inside the if pipeline_run.logs is not None with a contextmanager as follows:

if pipeline_run.logs is not None: with session.no_autoflush: ... try: ...

If you think the error message is still to unclear after this change, I think the best way would be to add an option for a custom error message in the verify_name_uniqueness method.

schustmi · 2025-05-26T18:15:41Z

tests/unit/pipelines/test_run_utils.py

+
+
+@patch("zenml.pipelines.run_utils.Client")
+def test_create_placeholder_run_duplicate_name_error(mock_client):


This doesn't seem to really test anything, other than that the mocking library works. If anything, I think an integration test that actually runs a pipeline twice with the same name is the best test for this.

strickvl added enhancement New feature or request internal To filter out internal PRs and issues labels May 24, 2025

strickvl requested a review from schustmi May 24, 2025 17:30

strickvl requested a review from Copilot May 24, 2025 17:30

strickvl force-pushed the feature/better-error-message branch from b38ae4f to 32b4a7a Compare May 24, 2025 17:31

Copilot AI reviewed May 24, 2025

View reviewed changes

src/zenml/pipelines/run_utils.py Outdated Show resolved Hide resolved

docs/book/how-to/steps-pipelines/yaml_configuration.md Outdated Show resolved Hide resolved

strickvl force-pushed the feature/better-error-message branch from 32b4a7a to c28564c Compare May 24, 2025 20:11

strickvl added 2 commits May 24, 2025 22:16

Add more specific examples to run_name documentation

0b903f7

As suggested in PR review, this commit adds clearer YAML comments to the run_name placeholder examples to make it more obvious what each example demonstrates.

Fix mypy type errors for IntegrityError import

659288c

- Use TYPE_CHECKING to handle the optional sqlalchemy import properly - Rename to SQLIntegrityError to avoid confusion with other exceptions - This ensures mypy doesn't complain about assigning None to a type

strickvl requested a review from Copilot May 24, 2025 20:25

Copilot AI reviewed May 24, 2025

View reviewed changes

src/zenml/pipelines/run_utils.py Outdated Show resolved Hide resolved

src/zenml/pipelines/run_utils.py Outdated Show resolved Hide resolved

docs/book/how-to/steps-pipelines/yaml_configuration.md Outdated Show resolved Hide resolved

schustmi requested changes May 26, 2025

View reviewed changes

strickvl and others added 3 commits May 26, 2025 09:59

Merge remote-tracking branch 'origin/develop' into feature/better-err…

22bc668

…or-message

strickvl requested a review from schustmi May 26, 2025 11:41

schustmi requested changes May 26, 2025

View reviewed changes

Merge branch 'develop' into feature/better-error-message

508a616

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve error message for duplicate pipeline run names #3701

Improve error message for duplicate pipeline run names #3701

Uh oh!

strickvl commented May 24, 2025 •

edited

Loading

Uh oh!

coderabbitai bot commented May 24, 2025

Review skipped

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

Uh oh!

github-actions bot commented May 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

schustmi May 26, 2025

Uh oh!

schustmi May 26, 2025

Uh oh!

schustmi May 26, 2025

Uh oh!

Uh oh!



		@patch("zenml.pipelines.run_utils.Client")
		def test_create_placeholder_run_duplicate_name_error(mock_client):

Improve error message for duplicate pipeline run names #3701

Are you sure you want to change the base?

Improve error message for duplicate pipeline run names #3701

Uh oh!

Conversation

strickvl commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Before (Raw SQL Error)

After (User-Friendly Error)

Changes Made

Test Plan

Uh oh!

coderabbitai bot commented May 24, 2025

Review skipped

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

Documentation and Community

Uh oh!

github-actions bot commented May 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Documentation Link Check Results

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

schustmi May 26, 2025

Choose a reason for hiding this comment

Uh oh!

schustmi May 26, 2025

Choose a reason for hiding this comment

Uh oh!

schustmi May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

strickvl commented May 24, 2025 •

edited

Loading

github-actions bot commented May 24, 2025 •

edited

Loading