Add Feature Extraction Support for API Classifiers #77

mohamedelabbas1996 · 2025-04-14T18:36:13Z

Description

This PR adds support for returning model feature vectors (embeddings) alongside classification results in the Data Companion API.
The classification pipeline now supports returning a vector embedding per classification, derived from the classification model backbone.

The changes are fully backward-compatible for models that do not implement custom get_features(), as they will fallback to returningNone from the base class.

Related Issues

#752

Screenshots

Detection features clustering visualization using K-means + PCA

sentry-io · 2025-04-14T18:36:25Z

🔍 Existing Issues For Review

Your pull request is modifying functions with the following pre-existing issues:

📄 File: trapdata/api/models/classification.py

Function	Unhandled Issue
`save_results`	ValidationError: 15 validation errors for ClassificationResponse ... `Event Count:` 2
`save_results`	ValidationError: 10 validation errors for ClassificationResponse ... `Event Count:` 2
`save_results`	AttributeError: 'NoneType' object has no attribute 'tolist' ... `Event Count:` 1
`save_results`	ValueError: not enough values to unpack (expected 3, got 2) ... `Event Count:` 1

_{Did you find this useful? React with a 👍 or 👎}

mihow · 2025-04-26T00:25:38Z

pyproject.toml

@@ -36,7 +36,8 @@ pyobjus = [
    { version = "^1.2.1", platform = "darwin" },
    { version = "^1.2.1", platform = "linux" },
 ]
-
+plotly = "^5.21.0"
+scikit-learn = "^1.3.0"


I think we should make these optional dependencies and just use numpy in the tests. unless we need to use them in the core app.

[tool.poetry.extras] dev = ["plotly", "scikit-learn"]

trapdata/api/tests/test_features_extraction.py

mihow · 2025-04-26T00:29:46Z

trapdata/ml/models/classification.py

@@ -287,6 +287,16 @@ def get_model(self):
        model.eval()
        return model

+    def get_features(self, batch_input: torch.Tensor) -> torch.Tensor:


Nice work on this method of extracting features! It seems more flexible than our current feature extractor. Perhaps we should add a comment in both feature extractors that the other one exists. And eventually update the old one to use this code.

mohamedelabbas1996 added 4 commits April 13, 2025 21:00

feat: Added features field to the classification response

368edc2

feat: add support for returning features in APIMothClassifier response

4484f2e

added fallback get_features method to the InferenceBaseClass

3cc31ad

feat: implemented get_features for Resnet50TimmClassifier class

8071168

mohamedelabbas1996 added 4 commits April 14, 2025 14:43

chore: moved features dim to constants

52f0f62

Default to None if get_features is not implemented

b4c3af7

Added features extraction tests

ae62dd5

Removed prints

88c8220

mohamedelabbas1996 marked this pull request as ready for review April 22, 2025 15:38

mohamedelabbas1996 added 3 commits April 23, 2025 10:15

Added clustering using K-Means and visualization

fa7dee8

Added plotly dependency

cce38f3

Added sklearn dependency

902331b

mihow reviewed Apr 26, 2025

View reviewed changes

trapdata/api/tests/test_features_extraction.py Show resolved Hide resolved

mihow reviewed Apr 26, 2025

View reviewed changes

chore: make plotly optional, fix type warnings

9306bd0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Feature Extraction Support for API Classifiers #77

Add Feature Extraction Support for API Classifiers #77

Uh oh!

mohamedelabbas1996 commented Apr 14, 2025 •

edited

Loading

Uh oh!

sentry-io bot commented Apr 14, 2025

Uh oh!

mihow Apr 26, 2025

Uh oh!

Uh oh!

mihow Apr 26, 2025

Uh oh!

Uh oh!

Add Feature Extraction Support for API Classifiers #77

Are you sure you want to change the base?

Add Feature Extraction Support for API Classifiers #77

Uh oh!

Conversation

mohamedelabbas1996 commented Apr 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Screenshots

Uh oh!

sentry-io bot commented Apr 14, 2025

🔍 Existing Issues For Review

Uh oh!

mihow Apr 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mihow Apr 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mohamedelabbas1996 commented Apr 14, 2025 •

edited

Loading