Added ML Model Handling and Prophet model support #13

dominiquekleeven · 2025-03-21T09:31:23Z

Closes #8

PR adds the following:

Generic configuration schema for ML Models (Pydantic Model)
- Model uses Pydantic union discriminators to construct the concrete type based on a specified discriminator field.
- Extended configurations can override or add additional properties.
Extended configuration schema for Prophet Model
ProphetModelProvider
- Supports training, saving, loading, and forecasting with Prophet models.
- Implements multi-variable training and forecasting.
- Regressors (variables correlated with the predicted variable) have their Y values interpolated (nearest) to match the timestamps of the target variable.
- Future improvement: Make interpolation and missing data handling configurable.
Simple provider factory for ModelProvider creation
- Selects the appropriate implementation based on the type field in the config.
- Easily extendable for new providers.
- Scheduler automatically applies the correct implementation based on the config.
ModelConfigService – CRUD service for managing model configurations.
ModelStorageService – Handles persistence for serialized ML models.
OpenRemoteDataService – Abstraction for the OpenRemote Client, with the focus on having a more specific interface that is suitable for the ml forecast service.
ModelScheduler
- Manages scheduling for training and forecasting jobs.
- Updates jobs when configurations change.
- Removes stale jobs when configurations are deleted.
- Uses APScheduler.
Utility classes
- Time handling.
- Filesystem handling.
- Singleton decorator for the Scheduler .
Other changes
- Re-ordered the startup process so that uvicorn can use more than 1 worker (when configured)

Testing

All tests utilize mocked datasets, mocked data retrieval, and various fixtures.

I have also manually tested the scheduling, training and forecasting. I let the service run for roughly 12 hours against a OpenRemote demo instance. No issues occured during this window.

Notes

Stricter validation of the model configuration will be part of the endpoint implementation #14 - Examples are: checking whether the assets are present in the provided realm, regressor has predicted datapoints, timestamps are valid etc.

Acceptance Criteria

No linter errors
All tests pass (integration tests are allowed to skip)
Application starts and the scheduler is started

Reach out if you'd like help with manually adding a config to test against an actual OR instance.

The code structure/implementation makes sense
Implementation is well documented (comments)
Tests cover the core functionality, e.g. data preparation, training, forecast and scheduling.

Sample Config

{
    "id": "d3c119a6-1018-4ebd-932b-a509eb7ab741",
    "realm": "smartcity",
    "name": "Power Total Consumers Forecast",
    "enabled": true,
    "type": "prophet",
    "target": {
      "asset_id": "44ORIhkDVAlT97dYGUD9n5",
      "attribute_name": "powerTotalConsumers",
      "cutoff_timestamp": 1716153600000
    },
    "forecast_interval": "PT1M",
    "training_interval": "PT2M",
    "forecast_periods": 96,
    "forecast_frequency": "1h",
    "daily_seasonality": true,
    "weekly_seasonality": true,
    "yearly_seasonality": true
  }

Logging moved to python rather than yaml Pydantic settings for dealing with environment variables and run configuration.

…n up if configs are empty.

- Exceptions that are expected should be handled, if they can be recovered from. - Exceptions that are not excepted should bubble up. Since they are exceptional cases and will be logged due to the fact that they are raised.

wborn

I noticed these small issues while checking the changes in IntelliJ.

src/service_ml_forecast/common/fs_util.py

tests/conftest.py

tests/ml/resources/prophet-tariff-config.json

tests/ml/resources/prophet-windspeed-config.json

wborn · 2025-04-09T09:00:45Z

I am a bit unsure about using the filesystem directly for storing the config files, but it should be fine with the scale the service will be used at. I made sure to make the writes atomic, and since it will run in docker you can map the volume.

I'm fine with config files for now but it might be something to add in the future when there is a need for database anyhow. Will everything also be configurable via the API/UI?

dominiquekleeven · 2025-04-09T21:52:39Z

I am a bit unsure about using the filesystem directly for storing the config files, but it should be fine with the scale the service will be used at. I made sure to make the writes atomic, and since it will run in docker you can map the volume.

I'm fine with config files for now but it might be something to add in the future when there is a need for database anyhow. Will everything also be configurable via the API/UI?

Yes, configs will be able to be created via an UI and via the REST API.

Thanks for the review, will make the necessary changes tomorrow.

Co-authored-by: Wouter Born <github@maindrain.net>

…ervice-ml-forecast into prophet-integration

dominiquekleeven added 9 commits March 12, 2025 15:35

Environment variable setup, logging configuration adjustment.

2f95fe0

Logging moved to python rather than yaml Pydantic settings for dealing with environment variables and run configuration.

Work in progress

d449a0d

Retrieve historical datapoints for asset attribute

f1c16bf

Write & Retrieve predicated data points

1b588f4

Check whether an OpenRemote instance is available for API tests

e059888

Minor clean up

231eb16

Small update to Docstrings

0cf9e80

Refactor to use Pytest fixtures and added a mocked OpenRemoteClient test

fcd2d91

Update test_openremote_client.py

bd0ceb9

dominiquekleeven changed the title ~~Added support for Prophet ML model~~ Integrate Prophet ML model Mar 21, 2025

dominiquekleeven added 20 commits March 21, 2025 11:04

Directory rename

ba57298

Wip

f6c104f

Minor model config update

a2fc0a5

Include Prophet and data science libs

23963a5

Update pyproject.toml

ea205f2

Merge branch 'openremote-integration' into prophet-integration

23f15a1

Small structural tweaks

8a23af5

Merge branch 'openremote-integration' into prophet-integration

5838120

Wip

49959fc

Wip

60e8ecc

WIP

fab6aa4

Dataframe formatting for Prophet

ed1a646

Model saving and loading

f48bbda

WIP forecasting using trained model

1f2c8cb

Use cutoff timestamp for input attributes

b7dbabe

Fix prophet causing issues with MyPy linter

8dc0457

Use deployment folder for data

23917cb

Log successful forecast

fc56cb7

Add mock datapoints for testing

680cb50

Format fix

0247ecd

dominiquekleeven mentioned this pull request Apr 3, 2025

Added Model Configuration Endpoints #26

Closed

dominiquekleeven added 16 commits April 3, 2025 12:13

Renamed ml data service to be more generic

96755d0

Enabled/Disabled state for model configurations, fixed stale job clea…

d4986c6

…n up if configs are empty.

Don't warn about empty configs directory

37846c0

Add prefix to ENV variables

c96b936

Enforce UUID, path handling adjustments

efb6baf

Add missing license header

d9ee7c4

Changes to how exceptions, errors are handled.

bc07125

- Exceptions that are expected should be handled, if they can be recovered from. - Exceptions that are not excepted should bubble up. Since they are exceptional cases and will be logged due to the fact that they are raised.

Merge branch 'main' into prophet-integration

cf0ebf1

Update uv.lock

4c42dba

Exception chaining, additional test for realm filtering

31c7a60

Minor convention changes and formatting of docs

7bdb01a

Update test model scheduler test description

fb4ab71

Format

6e60014

API improvements

69a8276

Minor model config update

548706b

Update model_scheduler.py

112f5b3

wborn reviewed Apr 9, 2025

View reviewed changes

Merge branch 'main' into prophet-integration

1c47cea

dominiquekleeven and others added 6 commits April 10, 2025 13:19

Update license headers

e5ceefb

Update src/service_ml_forecast/common/fs_util.py

a0b1a91

Co-authored-by: Wouter Born <github@maindrain.net>

Update tests/ml/resources/prophet-tariff-config.json

9dc8f4b

Co-authored-by: Wouter Born <github@maindrain.net>

Update tests/ml/resources/prophet-windspeed-config.json

a4c53f3

Co-authored-by: Wouter Born <github@maindrain.net>

openremote client fixtures can return None

041ff01

Merge branch 'prophet-integration' of https://github.com/openremote/s…

0f7481a

…ervice-ml-forecast into prophet-integration

wborn approved these changes Apr 10, 2025

View reviewed changes

wborn merged commit c5b53dc into main Apr 10, 2025
1 check passed

wborn deleted the prophet-integration branch April 10, 2025 12:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added ML Model Handling and Prophet model support #13

Added ML Model Handling and Prophet model support #13

Uh oh!

dominiquekleeven commented Mar 21, 2025 •

edited

Loading

Uh oh!

wborn left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wborn commented Apr 9, 2025

Uh oh!

dominiquekleeven commented Apr 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Added ML Model Handling and Prophet model support #13

Added ML Model Handling and Prophet model support #13

Uh oh!

Conversation

dominiquekleeven commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR adds the following:

Testing

Notes

Uh oh!

wborn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wborn commented Apr 9, 2025

Uh oh!

dominiquekleeven commented Apr 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dominiquekleeven commented Mar 21, 2025 •

edited

Loading