Conversation
Centril
left a comment
There was a problem hiding this comment.
Approved modulo the 2 requested changes below.
|
/update-llm-benchmark |
LLM Benchmark Results (ci-quickfix)
Compared against master branch baseline Generated at: 2026-01-21T19:26:17.891Z Failure Analysis (click to expand)Benchmark Failure AnalysisGenerated from: Summary
Analysis of SpacetimeDB Benchmark FailuresIn this analysis, I've categorized the SpacetimeDB benchmark test failures by language and mode, detailing the differences between the generated code and expected outputs, and providing actionable insights for documentation changes. Rust / rustdoc_json FailuresCommon Errors: Schema Parity and Publishing IssuesFailures: t_002_scheduled_table, t_017_scheduled_columns
Specific Reducer and Table DefinitionsFailures: t_003_struct_in_table
CRUD and User Insertion ErrorsFailures: t_004_insert, t_006_delete, t_007_crud
C# / docs FailuresIndex Lookup IssuesFailures: t_008_index_lookup
This analysis provides insights into the recurring issues across the benchmark failures and identifies actionable steps for documentation improvements to prevent similar failures in the future. |
Co-authored-by: Mazdak Farrokhzad <twingoow@gmail.com> Signed-off-by: Noa <coolreader18@gmail.com>
Description of Changes
This release has a couple of patches I've been waiting on for a bit:
StackFrame::get_script_source[_mapping_url]denoland/rusty_v8#1886, which allows us to lazily parse sourcemaps.sourcemapto get Support charset indecode_data_urlgetsentry/rust-sourcemap#137, so that we no longer have to have our hack to work around that.InterruptCallbacksignature denoland/rusty_v8#1892, which allows us to re-enablerun_timeout_and_cb_every(though I haven't done that in this PR)Expected complexity level and risk
2: v8 is a very big and important dependency, but also very stable.
Testing