(843-update-lookup-with-train-method #854

mborodii-prog · 2025-12-27T11:45:43Z

Update Lookup (with train method)

Overview

This pull request introduces support for an action parameter to the write method for training lookup models, allowing more granular control with INSERT, UPDATE, and UPSERT operations. The implementation includes robust error handling and validation for these actions, and comprehensive tests have been added to ensure correct behavior and coverage of edge cases.

Feature: Action parameter for lookup model training

Added action parameter to the write method in wrangles/connectors/train.py, supporting INSERT, UPDATE, and UPSERT actions for lookup models. This enables explicit control over whether to create, update, or upsert models.
Implemented logic for each action:
- UPSERT: Updates existing models or creates new ones, handling duplicate keys and merging data.
- UPDATE: Updates only existing records in a model, with validation for presence of keys.
- INSERT: Creates a new model, with checks for duplicate model names and unique keys.
- Added error handling for invalid combinations of parameters and unsupported actions.

Usage Examples

INSERT

  write:
    - train.lookup:
        name: My Lookup Wrangle
        action: INSERT
        variant: key

UPDATE

  write:  
    - train.lookup:  
        model_id: test-model-id  
        action: UPDATE

UPSERT

  write:  
  - train.lookup:  
      name: {model_name} 
      action: UPSERT  
      variant: key

UPSERT (by default)

  write:  
  - train.lookup:  
      name: {model_name} 
      variant: key

Validation and error handling

Comprehensive error messages and checks for duplicate model names, duplicate keys, missing models, and invalid action parameters, ensuring robust and user-friendly behavior.

Test coverage

Added extensive tests in tests/connectors/test_train.py for all new behaviors:
- Successful insert and upsert operations.
- Handling duplicate model names and duplicate keys.
- Update operations for existing and non-existent models.
- Validation of invalid actions and parameters.

Documentation

Updated schema documentation to include the new action parameter and its possible values, improving clarity for users.

wrangles/connectors/train.py

mborodii-prog · 2026-01-08T15:40:43Z

@thomasstvr @ebhills Pls look into final version of PR:

Implemented logic for each action:
UPSERT: Updates existing models or creates new ones, handling duplicate keys and merging data.
UPDATE: Updates only existing records in a model, with validation for presence of keys.
INSERT: Creates a new model, with checks for duplicate model names and unique keys.
OVERWRITE: Default logic (agreed with Thomas)

Mocked tests to avoid real model modification

wrangles/connectors/train.py

thomasstvr · 2026-01-13T17:34:55Z

wrangles/connectors/train.py

+            try:
+                metadata = _data.model(model_id)
+            except Exception as e:
+                raise e


When attempting to train a new model using the name parameter and action set to update, a cryptic error is raised:

RuntimeError: train.lookup - Something went wrong trying to access model None

Passing insert raises a better message, while upsert works. User should only be able to pass overwrite when creating a new model/using the name parameter. This can be caught early and each of the 3 cases can have the same error message. Something like:

"{action} not allowed when training a new model"

thomasstvr · 2026-01-13T17:36:12Z

tests/connectors/test_train.py


-
-
+    def test_upsert_missing_key_for_key_variant(self, mock_lookup_action_backend):


Let's drop mocker everywhere that a new model is not being trained.

(843-update-lookup-with-train-method

b0e2565

mborodii-prog linked an issue Dec 27, 2025 that may be closed by this pull request

Update Lookup (with train method) #843

Open

mborodii-prog requested review from ebhills and thomasstvr December 27, 2025 11:45

mborodii-prog commented Jan 7, 2026

View reviewed changes

wrangles/connectors/train.py Outdated Show resolved Hide resolved

mborodii-prog added 2 commits January 8, 2026 14:43

843-update-lookup-with-train-method

da6af4a

843-update-lookup-with-train-method

f8ad7e4

843-update-lookup-with-train-method

ba4b741

thomasstvr reviewed Jan 8, 2026

View reviewed changes

mborodii-prog added 3 commits January 11, 2026 14:59

(843-update-lookup-with-train-method

884a607

843-update-lookup-with-train-method

6eb01bd

(843-update-lookup-with-train-method

5cff43c

thomasstvr reviewed Jan 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

(843-update-lookup-with-train-method #854

(843-update-lookup-with-train-method #854

mborodii-prog commented Dec 27, 2025

Uh oh!

Uh oh!

mborodii-prog commented Jan 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasstvr Jan 13, 2026

Uh oh!

thomasstvr Jan 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants




		def test_upsert_missing_key_for_key_variant(self, mock_lookup_action_backend):

(843-update-lookup-with-train-method #854

Are you sure you want to change the base?

(843-update-lookup-with-train-method #854

Conversation

mborodii-prog commented Dec 27, 2025

Update Lookup (with train method)

Overview

Feature: Action parameter for lookup model training

Usage Examples

INSERT

UPDATE

UPSERT

UPSERT (by default)

Validation and error handling

Test coverage

Documentation

Uh oh!

Uh oh!

mborodii-prog commented Jan 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasstvr Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

thomasstvr Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants