H-3572, H-4105: Track Node migrations run, skip already-run migrations #8187

CiaranMn · 2025-12-16T12:18:58Z

🌟 What is the purpose of this PR?

Start storing the Node API migrations run and the latest migration state (version number by type) as part of the HASH instance entity (they create system types and update entities to updated types), and skip those already run.

Import Note / Call for ideas
The latest migration state is persisted because the migration logic relies on checking whether the 'next version' (version in state + 1) of a type exists before creating it, where the 'current version' was previously a state object that was updated as each migration ran (i.e. all types start at 1 and get incremented as migrations update them).

The first time this PR is deployed, no migrations will have been skipped, and so we need to maintain this 'start from 1' behaviour, which means an empty migration state. After migrations have run, both (1) the migrations already run and (2) latest versions state will have been persisted, and the next time the API starts up the migrations will be skipped and the next new migration will have the correct versions to increment from.

Apart from the first run, it will be possible to hydrate the 'current type version' from the database. So we can actually get rid of it in favour of just fetching all the versions of all types at the start of the migrations at some future point (assuming there is no other HASH instance which has run migrations PRIOR to the introduction of the skipping logic, which still needs the approach in this PR for the first deployment).

Note also that migrations which are skipped, if they somehow are not skipped in future (e.g. numbering changes), will not be idempotent, because they will have the wrong migration state (e.g. we rename 001 to 001b, it has the latest version of types in migration state, it increments to check if next exists, it doesn't, it creates User v8 but with the properties of V1).

This is a bit suboptimal. The other alternatives are:

Don't 'skip' migrations, but instead have some kind of 'dry run' handling that still runs all migrations to populate the migration state, but if a migration is marked has already run, don't actually do any db writes. This is maybe the second best or even equal to the approach currently taken. It would involve amending the functions that do the db operations (update types, entities) to check if the migration has already run, and simply return if it has.
Some other way of checking if the operation has already happened, e.g. diffing types. This is a bit messy and complicated and involves checking lots of things.

One consequence of this (not rebuilding state by running each migration each time, whether or not it makes changes) is that we can no longer have 'dev' migrations (don't run on prod yet) sitting around between migrations that have already run, and being 'un-deved' later, because the migration state they receive will reflect the latest versions in the db, which might not be what the version numbers should be at the point they fall in the files. I've therefore moved all dev migrations to later numbers and closed a few gaps in the existing migration numbering. All new migrations will now have to be numbered after existing ones (which should be the case anyway).

This PR is designed to speed up start-up time by not bothering to go through the process of checking entities that need upgrading (of which there should be none once a migration has run once).

There are also a couple of changes to handle the fact that the HASH instance might not be the latest version when it's fetched as part of migrations (change exact id to base URL for filtering).

Drive-bys:

Update database reset instructions in the README
Ensure Block Protocol 'query' and 'has-query' types are seeded as part of migrations.

Pre-Merge Checklist 🚀

🚢 Has this modified a publishable library?

This PR:

does not modify any publishable blocks or libraries, or modifications do not need publishing

📜 Does this require a change to the docs?

The changes in this PR:

are internal and do not require a docs change

🕸️ Does this require a change to the Turbo Graph?

The changes in this PR:

do not affect the execution graph

🛡 What tests cover this?

Migrations are run as part of integration tests.

cursor · 2025-12-16T12:19:06Z

PR Summary

Introduces persistent migration tracking to speed startup and ensure safe resumes.

Adds migrationsCompleted and migrationState to the HASH Instance and saves/loads them in migrateOntologyTypes; skips processed files and persists after each run
New migration 022 updates HASH Instance schema and upgrades existing entities (temporary instantiate policies); bumps hashInstance to v2 and updates IDs/baseUrls
Updates createHashInstance to accept an explicit entity type ID; adjusts migration 005 to pass it
Switches HASH Instance queries and validation to use baseUrl (not exact version) in backend utils; relaxes entity-type check accordingly
Seeds external BP query and has-query entity types during initial system types migration
Frontend return-types-as-json now returns JSON GraphQL errors on fetch failures
README: simplifies local DB reset steps

^{Written by Cursor Bugbot for commit b0c44a4. This will update automatically on new commits. Configure here.}

codecov · 2025-12-16T12:22:40Z

Codecov Report

❌ Patch coverage is 0% with 69 lines in your changes missing coverage. Please review.
✅ Project coverage is 58.97%. Comparing base (94ee310) to head (b0c44a4).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
...tem-graph-is-initialized/migrate-ontology-types.ts	0.00%	40 Missing ⚠️
...migrations-completed-to-hash-instance.migration.ts	0.00%	21 Missing ⚠️
...grations/001-create-hash-system-types.migration.ts	0.00%	6 Missing ⚠️
...ate-hash-system-entities-and-web-bots.migration.ts	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #8187      +/-   ##
==========================================
- Coverage   59.71%   58.97%   -0.74%     
==========================================
  Files        1214     1188      -26     
  Lines      115203   112471    -2732     
  Branches     5062     4942     -120     
==========================================
- Hits        68793    66333    -2460     
+ Misses      45608    45380     -228     
+ Partials      802      758      -44

Flag	Coverage Δ
apps.hash-ai-worker-ts	`1.41% <ø> (ø)`
apps.hash-api	`0.00% <0.00%> (ø)`
local.hash-isomorphic-utils	`0.00% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

apps/hash-api/src/graph/ensure-system-graph-is-initialized/migrate-ontology-types.ts

This ensures migrations are idempotent by populating the cache with existing ontology types before applying migrations. Co-authored-by: c <c@hash.ai>

cursor · 2025-12-16T14:38:16Z

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

This reverts commit d6a3636.

…isting numbering

augmentcode · 2026-01-12T19:39:20Z

🤖 Augment PR Summary

Summary: This PR makes Node API ontology migrations resumable and faster by persisting which migrations have run (and the accumulated type-version state) onto the HASH Instance entity.

Changes:

Adds a new migration to extend the HASH Instance entity type (bumping to hash-instance/v/2) with migrationsCompleted and migrationState properties, and upgrades existing instance entities accordingly.
Updates migrateOntologyTypes to load the persisted state on startup, skip migrations whose numbers are already recorded, and save state after each successful migration.
Adjusts HASH Instance creation and lookup to work across entity type versions (match by baseUrl rather than a single versioned URL), and threads the current hash instance entityTypeId into creation.
Moves/renumbers existing migration files (including dev migrations) so that new migrations are always appended after already-run ones.
Ensures Block Protocol query / has-query types are seeded during bootstrap, and updates relevant Block Protocol IDs from @h to @hash.
Simplifies local database reset instructions in the repo README.
Adds a defensive GraphQL fetch/JSON parsing error fallback in the frontend middleware.

Technical Notes: Migration progress is stored on the instance entity itself, enabling fast startups by avoiding repeated idempotency checks once migrations have completed.

_{🤖 Was this summary useful? React with 👍 or 👎}

augmentcode

Review completed. 2 suggestions posted.

Comment augment review to trigger a new review at any time.

...migrate-ontology-types/migrations/022-add-migrations-completed-to-hash-instance.migration.ts

apps/hash-api/src/graph/ensure-system-graph-is-initialized/migrate-ontology-types.ts

TimDiekmann

A few minor things, but nothing blocking. Looks good!

I think the approach is sufficient, it's a similar behavior as we have in the Graph migrations, but we also don't distinguish between dev and prod migrations and enforce continuously incremented migration numbers. The alternative you discussed with the dry-run seems more stable, but I don't know if we require it. I guess we can always change it later and eventually we want to move it to the Graph at some point anyway?

We, however, could consider adding the constraints to the README.

TimDiekmann · 2026-01-16T11:21:53Z

apps/hash-api/src/graph/ensure-system-graph-is-initialized/migrate-ontology-types.ts

+  await saveMigrationState({
+    context: params.context,
+    hashInstance,
+    migrationsCompleted,
+    migrationState,
+  });


Do we still want to call this when all migrations being skipped?

Fair, 54900d6

vercel · 2026-01-20T12:35:33Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Review	Updated (UTC)
ds-theme	Error		Jan 20, 2026 0:39am
hashdotdesign	Ready	Preview, Comment	Jan 20, 2026 0:39am

Your organization requires reapproval when changes are made, so Graphite has dismissed approvals. See the output of git range-diff at https://github.com/hashintel/hash/actions/runs/21171694942

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

...migrate-ontology-types/migrations/022-add-migrations-completed-to-hash-instance.migration.ts

CiaranMn added 3 commits December 16, 2025 12:11

update README database reset instructions

dfd4874

seed BP query types as part of migrations

890f309

track migrations run, skip already run migrations

21b2437

CiaranMn requested a review from TimDiekmann December 16, 2025 12:18

graphite-app bot assigned CiaranMn Dec 16, 2025

vercel bot deployed to Preview – petrinaut December 16, 2025 12:19 View deployment

vercel bot deployed to Preview – hash December 16, 2025 12:23 View deployment

cursor bot reviewed Dec 16, 2025

View reviewed changes

apps/hash-api/src/graph/ensure-system-graph-is-initialized/migrate-ontology-types.ts Show resolved Hide resolved

graphite-app bot requested review from a team and removed request for a team December 16, 2025 13:01

CiaranMn marked this pull request as draft December 16, 2025 13:02

CiaranMn removed the request for review from TimDiekmann December 16, 2025 13:02

feat: Hydrate migration state from graph on init

d6a3636

This ensures migrations are idempotent by populating the cache with existing ontology types before applying migrations. Co-authored-by: c <c@hash.ai>

vercel bot temporarily deployed to Preview – petrinaut December 16, 2025 14:38 Inactive

CiaranMn added 2 commits December 16, 2025 15:27

Revert "feat: Hydrate migration state from graph on init"

95f5e2d

This reverts commit d6a3636.

Merge branch 'main' into cm/track-and-dont-rerun-migrations

dfb6727

vercel bot temporarily deployed to Preview – petrinaut January 12, 2026 18:10 Inactive

Merge branch 'main' into cm/track-and-dont-rerun-migrations

201ed56

vercel bot deployed to Preview – hashdotdesign January 12, 2026 18:13 View deployment

vercel bot deployed to Preview – hash January 12, 2026 19:22 View deployment

move dev migrations to arbitrary larger number, close some gaps in ex…

01701b7

…isting numbering

vercel bot temporarily deployed to Preview – petrinaut January 12, 2026 19:27 Inactive

add ticket number to todo

a04cae5

vercel bot temporarily deployed to Preview – petrinaut January 12, 2026 19:28 Inactive

CiaranMn marked this pull request as ready for review January 12, 2026 19:32

vercel bot deployed to Preview – hash January 12, 2026 19:35 View deployment

augmentcode bot reviewed Jan 12, 2026

View reviewed changes

...migrate-ontology-types/migrations/022-add-migrations-completed-to-hash-instance.migration.ts Outdated Show resolved Hide resolved

apps/hash-api/src/graph/ensure-system-graph-is-initialized/migrate-ontology-types.ts Outdated Show resolved Hide resolved

respond to PR feedback

6f2383e

vercel bot temporarily deployed to Preview – petrinaut January 12, 2026 20:00 Inactive

another error improvement

1019329

vercel bot temporarily deployed to Preview – petrinaut January 12, 2026 20:02 Inactive

remove pointless comments

9a73d8b

vercel bot temporarily deployed to Preview – petrinaut January 12, 2026 20:03 Inactive

vilkinsons requested a review from TimDiekmann January 12, 2026 20:06

fix import

73ad9c2

vercel bot temporarily deployed to Preview – petrinaut January 13, 2026 10:52 Inactive

TimDiekmann previously approved these changes Jan 16, 2026

View reviewed changes

only do final update of migrations completed if count differs from prev

54900d6

vercel bot temporarily deployed to Preview – petrinaut January 20, 2026 12:35 Inactive

CiaranMn requested a review from TimDiekmann January 20, 2026 12:35

Merge branch 'main' into cm/track-and-dont-rerun-migrations

b0c44a4

vercel bot deployed to Preview – hashdotdesign January 20, 2026 12:38 View deployment

vercel bot had a problem deploying to Preview – ds-theme January 20, 2026 12:39 Failure

vercel bot deployed to Preview – petrinaut January 20, 2026 12:40 View deployment

vercel bot deployed to Preview – hash January 20, 2026 12:44 View deployment

cursor bot reviewed Jan 20, 2026

View reviewed changes

...migrate-ontology-types/migrations/022-add-migrations-completed-to-hash-instance.migration.ts Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

H-3572, H-4105: Track Node migrations run, skip already-run migrations #8187

H-3572, H-4105: Track Node migrations run, skip already-run migrations #8187

Uh oh!

CiaranMn commented Dec 16, 2025 •

edited

Loading

Uh oh!

cursor bot commented Dec 16, 2025 •

edited

Loading

Uh oh!

codecov bot commented Dec 16, 2025 •

edited

Loading

Uh oh!

Uh oh!

cursor bot commented Dec 16, 2025

Uh oh!

augmentcode bot commented Jan 12, 2026

Uh oh!

augmentcode bot left a comment

Uh oh!

Uh oh!

Uh oh!

TimDiekmann left a comment

Uh oh!

TimDiekmann Jan 16, 2026

Uh oh!

CiaranMn Jan 20, 2026

Uh oh!

vercel bot commented Jan 20, 2026 •

edited

Loading

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

H-3572, H-4105: Track Node migrations run, skip already-run migrations #8187

Are you sure you want to change the base?

H-3572, H-4105: Track Node migrations run, skip already-run migrations #8187

Uh oh!

Conversation

CiaranMn commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🌟 What is the purpose of this PR?

Pre-Merge Checklist 🚀

🚢 Has this modified a publishable library?

📜 Does this require a change to the docs?

🕸️ Does this require a change to the Turbo Graph?

🛡 What tests cover this?

Uh oh!

cursor bot commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

codecov bot commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

cursor bot commented Dec 16, 2025

Uh oh!

augmentcode bot commented Jan 12, 2026

Uh oh!

augmentcode bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

TimDiekmann left a comment

Choose a reason for hiding this comment

Uh oh!

TimDiekmann Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

CiaranMn Jan 20, 2026

Choose a reason for hiding this comment

Uh oh!

vercel bot commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

CiaranMn commented Dec 16, 2025 •

edited

Loading

cursor bot commented Dec 16, 2025 •

edited

Loading

codecov bot commented Dec 16, 2025 •

edited

Loading

vercel bot commented Jan 20, 2026 •

edited

Loading