perf(data): optimize struct marshaling/unmarshaling with caching and … #1117

pinglin · 2025-09-22T04:16:51Z

Because

The data marshaling/unmarshaling framework was performing expensive operations repeatedly, including:
- Reflection type computations on every operation
- Regex pattern compilation for validation on each use
- Instill tag parsing without caching
- Time format slice creation for every time parsing operation
- File type checking with repeated type slice creation
These operations were causing performance bottlenecks in high-throughput scenarios
Memory allocations were not optimized, leading to unnecessary garbage collection overhead

This commit

Implements reflection type caching: Pre-computes reflect.Type instances for commonly used types (time.Time, time.Duration, format.Value, etc.) to avoid repeated reflect.TypeOf() calls
Adds regex pattern caching: Implements thread-safe LRU cache for compiled regex patterns with sync.RWMutex protection to eliminate repeated compilation overhead
Introduces tag parsing cache: Caches parsed instill tag results to avoid expensive string parsing operations on every field processing
Pre-compiles time formats: Stores common time parsing formats in global variables to eliminate repeated slice creation
Pre-computes file type slices: Maintains global file type arrays for efficient type checking operations
Enhances JSON string to struct conversion: Adds fast pre-check for JSON-like strings to minimize unnecessary parsing attempts
Consolidates test suite: Combines all benchmarks into struct_test.go for comprehensive performance tracking

Performance Improvements (Benchmarked)

Optimization	Before	After	Improvement	Memory Impact
Reflection Type Caching	5.146 ns/op	0.2555 ns/op	20.1x faster	0 B/op (no allocations)
Regex Pattern Caching	Repeated compilation	LRU cached compilation	12.6x faster	100% memory reduction
Tag Parsing Optimization	String parsing every access	Cached parsing results	13.0x faster	100% memory reduction
Time Format Pre-compilation	Format slice creation	Pre-compiled arrays	1.03x faster	Eliminates slice allocations
File Type Checking	Type slice creation	Pre-computed global array	6.2x faster	Eliminates reflection calls

Overall Performance Metrics

Complete Struct Unmarshaling: 1635 ns/op, 496 B/op, 14 allocs/op
Complete Struct Marshaling: 1476 ns/op, 1184 B/op, 23 allocs/op
Concurrent Access: Regex cache (114.4 ns/op), Tag cache (98.70 ns/op)

Technical Implementation Details

🏗️ Architecture Enhancements

Pre-computed Global Variables:

var (
    timeTimeType     = reflect.TypeOf(time.Time{})
    timeDurationType = reflect.TypeOf(time.Duration(0))
    formatValueType  = reflect.TypeOf((*format.Value)(nil)).Elem()
    // ... more pre-computed types
)

Thread-Safe Caching:

type regexCache struct {
    cache map[string]*regexp.Regexp
    mu    sync.RWMutex
}

LRU Cache Implementation:
- Automatic eviction of least recently used entries
- Configurable cache sizes for different use cases
- Double-checked locking for optimal performance

Fast JSON Pre-check:

if len(stringValue) > 1 && (stringValue[0] == '{' || stringValue[0] == '[') {
    // Only attempt JSON parsing for JSON-like strings
}

🧪 Comprehensive Test Coverage

287 unit tests covering all functionality
9 benchmark suites measuring performance improvements
Edge case testing for concurrent access patterns
Memory allocation profiling to prevent regressions
Performance regression detection through continuous benchmarking

🔒 Thread Safety & Reliability

All caches use sync.RWMutex for concurrent access
Double-checked locking patterns for initialization
Graceful fallback for cache misses
Zero breaking changes to existing APIs

…pre-computation

#1117) Because - The data marshaling/unmarshaling framework was performing expensive operations repeatedly, including: - Reflection type computations on every operation - Regex pattern compilation for validation on each use - Instill tag parsing without caching - Time format slice creation for every time parsing operation - File type checking with repeated type slice creation - These operations were causing performance bottlenecks in high-throughput scenarios - Memory allocations were not optimized, leading to unnecessary garbage collection overhead **This commit** - **Implements reflection type caching**: Pre-computes `reflect.Type` instances for commonly used types (`time.Time`, `time.Duration`, `format.Value`, etc.) to avoid repeated `reflect.TypeOf()` calls - **Adds regex pattern caching**: Implements thread-safe LRU cache for compiled regex patterns with `sync.RWMutex` protection to eliminate repeated compilation overhead - **Introduces tag parsing cache**: Caches parsed instill tag results to avoid expensive string parsing operations on every field processing - **Pre-compiles time formats**: Stores common time parsing formats in global variables to eliminate repeated slice creation - **Pre-computes file type slices**: Maintains global file type arrays for efficient type checking operations - **Enhances JSON string to struct conversion**: Adds fast pre-check for JSON-like strings to minimize unnecessary parsing attempts - **Consolidates test suite**: Combines all benchmarks into `struct_test.go` for comprehensive performance tracking ## Performance Improvements (Benchmarked) | Optimization | Before | After | Improvement | Memory Impact | | ------------------------------- | --------------------------- | ------------------------- | ------------------ | ---------------------------- | | **Reflection Type Caching** | 5.146 ns/op | 0.2555 ns/op | **20.1x faster** | 0 B/op (no allocations) | | **Regex Pattern Caching** | Repeated compilation | LRU cached compilation | **12.6x faster** | 100% memory reduction | | **Tag Parsing Optimization** | String parsing every access | Cached parsing results | **13.0x faster** | 100% memory reduction | | **Time Format Pre-compilation** | Format slice creation | Pre-compiled arrays | **1.03x faster** | Eliminates slice allocations | | **File Type Checking** | Type slice creation | Pre-computed global array | **6.2x faster** | Eliminates reflection calls | ### Overall Performance Metrics - **Complete Struct Unmarshaling**: 1635 ns/op, 496 B/op, 14 allocs/op - **Complete Struct Marshaling**: 1476 ns/op, 1184 B/op, 23 allocs/op - **Concurrent Access**: Regex cache (114.4 ns/op), Tag cache (98.70 ns/op) ## Technical Implementation Details ### 🏗️ **Architecture Enhancements** 1. **Pre-computed Global Variables**: ```go var ( timeTimeType = reflect.TypeOf(time.Time{}) timeDurationType = reflect.TypeOf(time.Duration(0)) formatValueType = reflect.TypeOf((*format.Value)(nil)).Elem() // ... more pre-computed types ) ``` 2. **Thread-Safe Caching**: ```go type regexCache struct { cache map[string]*regexp.Regexp mu sync.RWMutex } ``` 3. **LRU Cache Implementation**: - Automatic eviction of least recently used entries - Configurable cache sizes for different use cases - Double-checked locking for optimal performance 4. **Fast JSON Pre-check**: ```go if len(stringValue) > 1 && (stringValue[0] == '{' || stringValue[0] == '[') { // Only attempt JSON parsing for JSON-like strings } ``` ### 🧪 **Comprehensive Test Coverage** - **287 unit tests** covering all functionality - **9 benchmark suites** measuring performance improvements - **Edge case testing** for concurrent access patterns - **Memory allocation profiling** to prevent regressions - **Performance regression detection** through continuous benchmarking ### 🔒 **Thread Safety & Reliability** - All caches use `sync.RWMutex` for concurrent access - Double-checked locking patterns for initialization - Graceful fallback for cache misses - Zero breaking changes to existing APIs

Because - The version of the pipeline-backend service is not updated in the instill-core repository. This commit - updates the `PIPELINE_BACKEND_VERSION` in the `.env` file to `1b4cd1f`. - updates the `pipelineBackend.image.tag` in the helm chart values.yaml file to `1b4cd1f`. ## Changes in pipeline-backend - fix(text): correct positions on duplicate markdown chunks (instill-ai/pipeline-backend#1120) - refactor(component,generic,http): replace env-based URL validation with constructor injection (instill-ai/pipeline-backend#1121) - fix(usage): add missing error filtering for users/admin (instill-ai/pipeline-backend#1119) - feat(component,ai,gemini): implement File API support for large files… (instill-ai/pipeline-backend#1118) - perf(data): optimize struct marshaling/unmarshaling with caching and … (instill-ai/pipeline-backend#1117) - feat(data): enhance unmarshaler with JSON string to struct conversion (instill-ai/pipeline-backend#1116) - feat(data): implement time types support with pattern validation (instill-ai/pipeline-backend#1115) - feat(component,ai,gemini): add multimedia support with unified format… (instill-ai/pipeline-backend#1114) - ci(workflows): adopt GitHub-hosted runner (instill-ai/pipeline-backend#1113) - perf(data): enhance comprehensive format coverage and optimize test performance (instill-ai/pipeline-backend#1112) - ci(workflows): adopt loarger runner for coverage test (instill-ai/pipeline-backend#1111) - perf(component,operator,document): optimize unit tests and fix LibreOffice dependency failures (instill-ai/pipeline-backend#1110) - perf(component,operator,video): optimize unit test performance by 59.7% (instill-ai/pipeline-backend#1109) - perf(component,operator,image): optimize unit tests for 98.5% faster … (instill-ai/pipeline-backend#1107) - ci(docker): optimize Dockerfiles with multi-stage builds for faster build times (instill-ai/pipeline-backend#1108) - perf(data): implement automatic field naming convention detection with LRU caching (instill-ai/pipeline-backend#1105) - feat(component,ai,gemini): enhance streaming to output all fields (instill-ai/pipeline-backend#1106) - fix(component,ai,gemini): correct text-based documents logic (instill-ai/pipeline-backend#1103) - test(component,generic,http): replace external httpbin.org dependency with local test server (instill-ai/pipeline-backend#1101) - ci(docker): add GitHub fallback for ffmpeg installation (instill-ai/pipeline-backend#1102) Co-authored-by: jvallesm <3977183+jvallesm@users.noreply.github.com>

perf(data): optimize struct marshaling/unmarshaling with caching and …

a0c89c5

…pre-computation

pinglin requested review from donch1989 and jvallesm as code owners September 22, 2025 04:16

pinglin merged commit a75887f into main Sep 22, 2025
6 checks passed

pinglin deleted the pinglin/perf-data-struct-optimization branch September 22, 2025 04:44

droplet-bot mentioned this pull request Sep 22, 2025

chore(env): update PIPELINE_BACKEND_VERSION instill-ai/instill-core#1405

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf(data): optimize struct marshaling/unmarshaling with caching and … #1117

perf(data): optimize struct marshaling/unmarshaling with caching and … #1117

Uh oh!

pinglin commented Sep 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

perf(data): optimize struct marshaling/unmarshaling with caching and … #1117

perf(data): optimize struct marshaling/unmarshaling with caching and … #1117

Uh oh!

Conversation

pinglin commented Sep 22, 2025

Performance Improvements (Benchmarked)

Overall Performance Metrics

Technical Implementation Details

🏗️ Architecture Enhancements

🧪 Comprehensive Test Coverage

🔒 Thread Safety & Reliability

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants