syntax issue in migration by icecrasher321 · Pull Request #1007 · simstudioai/sim

icecrasher321 · 2025-08-18T07:59:19Z

Summary

Attempting to fix memory issue with github runner for large migration.

vercel · 2025-08-18T07:59:25Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
sim (staging)	Building			Aug 18, 2025 7:59am

1 Skipped Deployment

Project	Deployment	Preview	Comments	Updated (UTC)
docs	Skipped			Aug 18, 2025 7:59am

greptile-apps

Greptile Summary

This PR fixes memory issues with GitHub runners during a large data migration by making the migration more resilient to varying database schema states. The primary changes focus on migration file 0076_damp_vector.sql, which appears to be a complex data migration involving workflow logs and user statistics.

The key modifications include:

Added RECURSIVE to CTE: The Common Table Expression (CTE) now uses WITH RECURSIVE candidate AS to properly handle hierarchical traversal of trace spans and their children, which is essential for processing nested execution data.
Switched to JSON-based column access: Instead of direct column references like l.total_cost, the migration now uses (to_jsonb(l)->>'total_cost')::numeric to safely extract values. This prevents failures when columns might not exist in certain schema states.
Implemented JSON existence checks: Rather than direct column checks, the migration uses JSON operators like (to_jsonb(l) ? 'total_tokens') to verify field existence before accessing values.

These changes make the migration more robust across different deployment environments and schema states, particularly important for what appears to be a "one-shot data migration" that needs to be safe on reruns. The migration likely processes large amounts of workflow execution data to populate user statistics, which explains the memory pressure on GitHub runners.

Confidence score: 3/5

This PR addresses a specific technical issue but introduces complexity that could impact performance
Score reflects the high-risk nature of modifying complex data migrations and potential performance implications
Pay close attention to the migration file and thoroughly test with production-sized datasets

_{1 file reviewed, no comments}

_{Edit Code Review Bot Settings | Greptile}

syntax issue in migration

syntax issue in migration

09b95f4

icecrasher321 merged commit 3873f44 into main Aug 18, 2025
4 of 6 checks passed

vercel bot deployed to staging – sim August 18, 2025 08:02 View deployment

greptile-apps bot reviewed Aug 18, 2025

View reviewed changes

arenadeveloper02 pushed a commit to arenadeveloper02/p2-sim that referenced this pull request Sep 19, 2025

Merge pull request simstudioai#1007 from simstudioai/staging

7d84f45

syntax issue in migration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

syntax issue in migration#1007

syntax issue in migration#1007
icecrasher321 merged 1 commit intomainfrom
staging

icecrasher321 commented Aug 18, 2025

Uh oh!

vercel bot commented Aug 18, 2025

Uh oh!

Uh oh!

greptile-apps bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

icecrasher321 commented Aug 18, 2025

Summary

Uh oh!

vercel bot commented Aug 18, 2025

Uh oh!

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Summary

Confidence score: 3/5

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant