Fix Airbyte pipeline filtering to use connection names instead of UUIDs #23673
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Problem
Airbyte metadata ingestion pipeline filters were only working with internal UUIDs (connectionId) instead of human-readable connection names. This made regex-based filtering impractical because:
Example of the Issue
Solution
Changed the pipeline entity creation to use the connection name (what users see in the UI) instead of the connectionId (internal UUID).
Before:
After:
Benefits
✅ Users can now filter using intuitive patterns:
MSSQL.*
- matches all pipelines starting with "MSSQL".*Postgres
- matches all pipelines ending with "Postgres"Production.*
- matches all production pipelines✅ No need to extract UUIDs from Airbyte connection URLs
✅ Entity name matches what users see in OpenMetadata UI
✅ Filtering works consistently across environments (names are stable, UUIDs change)
✅ Aligns with how other pipeline sources handle naming (Fivetran, Databricks, Airflow)
Backward Compatibility
Existing Airbyte pipelines will retain their UUID-based names. After upgrading:
Testing
Fixes #[issue-number]
Original prompt
✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.