Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataDriftTrigger: Support custom embedding encoder model #417

Open
jenny011 opened this issue May 5, 2024 · 0 comments
Open

DataDriftTrigger: Support custom embedding encoder model #417

jenny011 opened this issue May 5, 2024 · 0 comments

Comments

@jenny011
Copy link
Contributor

jenny011 commented May 5, 2024

DataDriftTrigger currently uses the previously trained model in the pipeline as the embedding encoder. We would like to allow using user-provided models to compute embeddings.

jenny011 added a commit that referenced this issue May 14, 2024
This is a clean version of PR#367.
1. Add DataDriftTrigger class to supervisor. Supports one configurable
Evidently metric. Launches drift detection every N new data points. Data
used in detection are data trained in the previous trigger and all the
untriggered new data.
2. Update Trigger interface. `Trigger.inform()` returns a Generator
instead of List.
3. Add a generic ModelDownloader in supervisor.
4. Add example pipelines using DataDriftTrigger.
5. Add Evidently to pylint known third party.
6. Change ModelDownloader to embedding encoder utils. The downloader
sets up and returns the model. The DataDriftTrigger owns the model.

Future
1. Support multiple configurable Evidently metric. #416
2. Support Alibi-Detect. #414 
3. Support custom embedding encoder. #417
4. Support different windowing for detection data, e.g. compare with all
previously trained data. #418
5. Common DataLoaderInfo #415
robinholzi pushed a commit that referenced this issue May 18, 2024
This is a clean version of PR#367.
1. Add DataDriftTrigger class to supervisor. Supports one configurable
Evidently metric. Launches drift detection every N new data points. Data
used in detection are data trained in the previous trigger and all the
untriggered new data.
2. Update Trigger interface. `Trigger.inform()` returns a Generator
instead of List.
3. Add a generic ModelDownloader in supervisor.
4. Add example pipelines using DataDriftTrigger.
5. Add Evidently to pylint known third party.
6. Change ModelDownloader to embedding encoder utils. The downloader
sets up and returns the model. The DataDriftTrigger owns the model.

Future
1. Support multiple configurable Evidently metric. #416
2. Support Alibi-Detect. #414 
3. Support custom embedding encoder. #417
4. Support different windowing for detection data, e.g. compare with all
previously trained data. #418
5. Common DataLoaderInfo #415
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant