CrucibleFeedback

Production feedback loop for ML systems: telemetry ingestion, quality assessment, drift detection, curation, retraining triggers, and export

Features

Batch-buffered ingestion with PII sanitization and telemetry events
User signal capture (thumbs up/down, regenerate, edit, copy, share, report)
Multi-check quality assessment (format, length, refusal, repetition, optional LLM-judge)
Drift detection (statistical, embedding, output)
Data curation strategies (high-quality, hard examples, diverse)
Retraining triggers (drift, quality drop, count, schedule)
Export to JSONL, HuggingFace dataset dir, Parquet placeholder, preference pairs
Crucible stage integration for pipelines

Installation

# mix.exs
{:crucible_feedback, "~> 0.2.0"}

Configuration

Database Configuration (Oban Pattern)

CrucibleFeedback uses dynamic repo injection - your host application provides the Repo:

# config/config.exs
config :crucible_feedback,
  repo: MyApp.Repo,  # Required when using Ecto storage
  storage: CrucibleFeedback.Storage.Ecto,
  embedding_client: CrucibleFeedback.EmbeddingClient.Noop,
  start_ingestion: true,
  ingestion: [
    flush_interval: :timer.seconds(5),
    max_batch_size: 1_000
  ],
  sanitizer: [
    pii_patterns: []
  ],
  quality: [
    min_length: 20,
    max_length: 4_000
  ],
  triggers: [
    drift_threshold: 0.2,
    quality_threshold: 0.7,
    data_count_threshold: 1_000,
    schedule_interval_seconds: 86_400
  ],
  export: [
    output_dir: "exports"
  ],
  curation: [
    limit: 1_000,
    min_quality_score: 0.8,
    max_hard_quality: 0.5
  ]

# Your host app's Repo configuration
config :my_app, MyApp.Repo,
  database: "my_app_dev",
  username: "postgres",
  password: "postgres",
  hostname: "localhost"

Then start your Repo in your application's supervision tree:

# lib/my_app/application.ex
children = [
  MyApp.Repo,
  # ... other children
]

Migrations

Copy migrations from deps/crucible_feedback/priv/repo/migrations/ or run:

mix crucible_feedback.install

Legacy Mode

For backwards compatibility, set start_repo: true to auto-start internal Repo:

config :crucible_feedback,
  start_repo: true,
  ecto_repos: [CrucibleFeedback.Repo]

config :crucible_feedback, CrucibleFeedback.Repo,
  database: "crucible_feedback_dev",
  username: "postgres",
  password: "postgres",
  hostname: "localhost"

Usage

Ingest Events

CrucibleFeedback.log_inference(%{
  deployment_id: "deploy-1",
  model_version_id: "model-1",
  user_id: "user-123",
  prompt: "What is 2+2?",
  response: "4",
  latency_ms: 120,
  token_count: 42,
  metadata: %{source: "api"}
})

Record Signals

CrucibleFeedback.record_signal("inference-id", :thumbs_up, %{source: "ui"})
CrucibleFeedback.record_user_edit("inference-id", "User edited response")

Quality Assessment

CrucibleFeedback.assess_quality(%{response: "{\"ok\": true}"})
CrucibleFeedback.get_quality_stats("deploy-1")

Drift Detection

CrucibleFeedback.detect_drift("deploy-1", window_size: 1_000)

Curation and Export

CrucibleFeedback.curate("deploy-1", persist: true)
CrucibleFeedback.export("deploy-1", format: :jsonl, output_path: "exports/feedback.jsonl")
CrucibleFeedback.export_preference_pairs("deploy-1")

Retraining Triggers

CrucibleFeedback.check_triggers("deploy-1")

Feedback Stages

This package provides Crucible stages for feedback loop management:

:export_feedback - Export collected feedback data for model retraining
:check_triggers - Check retraining triggers based on feedback signals and drift

All stages implement the Crucible.Stage behaviour with full describe/1 schemas.

Options (Export Feedback)

:format - Export format (:jsonl, :parquet, :csv)
:output_path - Output file path
:filters - Filters to apply to exported data

Options (Check Triggers)

:threshold - Trigger threshold
:window_hours - Time window for trigger evaluation
:trigger_types - Types of triggers to check (:drift, :accuracy, :volume, :feedback)

Stage Usage

CrucibleFeedback.Stages.ExportFeedback.run(context, format: :jsonl)
CrucibleFeedback.Stages.CheckTriggers.run(context, [])

Storage Backends

CrucibleFeedback.Storage.Ecto (Postgres)
CrucibleFeedback.Storage.Memory (tests/dev)
CrucibleFeedback.Storage.Clickhouse (placeholder)

Migrations

mix ecto.create
mix ecto.migrate

Testing

mix test

Notes

Parquet export currently writes JSONL content to the specified .parquet path as a placeholder.
To enable embedding drift and diversity selection, configure embedding_client with a real provider.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
config		config
lib		lib
priv/repo/migrations		priv/repo/migrations
test		test
.formatter.exs		.formatter.exs
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
logo.svg		logo.svg
mix.exs		mix.exs
mix.lock		mix.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

CrucibleFeedback

Features

Installation

Configuration

Database Configuration (Oban Pattern)

Migrations

Legacy Mode

Usage

Ingest Events

Record Signals

Quality Assessment

Drift Detection

Curation and Export

Retraining Triggers

Feedback Stages

Options (Export Feedback)

Options (Check Triggers)

Stage Usage

Storage Backends

Migrations

Testing

Notes

About

Uh oh!

Releases

Packages

Languages

License

North-Shore-AI/crucible_feedback

Folders and files

Latest commit

History

Repository files navigation

CrucibleFeedback

Features

Installation

Configuration

Database Configuration (Oban Pattern)

Migrations

Legacy Mode

Usage

Ingest Events

Record Signals

Quality Assessment

Drift Detection

Curation and Export

Retraining Triggers

Feedback Stages

Options (Export Feedback)

Options (Check Triggers)

Stage Usage

Storage Backends

Migrations

Testing

Notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages