feat: actor messages & queues #3989

NathanFlurry · 2026-01-21T00:59:42Z

No description provided.

railway-app · 2026-01-21T00:59:49Z

🚅 Deployed to the rivet-pr-3989 environment in rivet-frontend

Service	Status	Web	Updated (UTC)
website	😴 Sleeping (View Logs)	Web	Jan 22, 2026 at 5:56 pm
frontend-cloud	❌ Build Failed (View Logs)	Web	Jan 22, 2026 at 5:46 pm
frontend-inspector	❌ Build Failed (View Logs)	Web	Jan 22, 2026 at 5:46 pm

NathanFlurry · 2026-01-21T01:00:15Z

feat: actor messages & queues #3989 : 2 dependent PRs (#4002 , #4005 ) 👈 (View in Graphite)
main

How to use the Graphite Merge Queue

Add the label merge-queue to this PR to add it to the merge queue.

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

This stack of pull requests is managed by Graphite. Learn more about stacking.

claude · 2026-01-21T01:03:44Z

PR Review: Actor Messages & Queues

This PR implements a message queue system for Rivet Actors, adding the ability to send and receive named messages with persistence and timeout support. Overall, the implementation is solid with good test coverage and proper error handling. Here are my findings:

✅ Strengths

Well-structured queue management: The QueueManager class properly separates concerns with clear initialization, message persistence, and waiter resolution logic
Comprehensive test coverage: The test suite covers key scenarios including timeout behavior, abort handling, size limits, and concurrent operations
Proper error handling: Good use of custom error classes (QueueFull, QueueMessageTooLarge, QueueMessageInvalid, ActorAborted) with appropriate metadata
Clean API design: The proxy-based queue sender API (handle.queue.messageName.send()) provides excellent developer ergonomics
Versioned serialization: Proper use of BARE schema versioning for forward/backward compatibility
Abort signal handling: Correctly propagates actor abort signals to waiting receivers

🐛 Potential Issues

1. Race Condition in Metadata Updates (Critical)

Location: queue-manager.ts:129-138

const id = this.#metadata.nextId;
const messageKey = makeQueueMessageKey(id);

// Update metadata before writing so we can batch both writes
this.#metadata.nextId = id + 1n;
this.#metadata.size += 1;
const encodedMetadata = this.#serializeMetadata();

// Batch write message and metadata together
await this.#driver.kvBatchPut(this.#actor.id, [
    [messageKey, encodedMessage],
    [KEYS.QUEUE_METADATA, encodedMetadata],
]);

Issue: If two enqueue() calls run concurrently, they could both read the same nextId, leading to message key collisions. The in-memory metadata is updated before the write completes, but there's no locking mechanism to prevent concurrent access.

Recommendation: Either:

Add a mutex/lock around the enqueue operation
Move to an atomic increment pattern if the KV driver supports it
Document that the actor runtime ensures single-threaded execution (if that's the case)

2. Inefficient Message Loading

Location: queue-manager.ts:228-243

async #drainMessages(nameSet: Set<string>, count: number): Promise<QueueMessage[]> {
    if (this.#metadata.size === 0) {
        return [];
    }
    const entries = await this.#loadQueueMessages(); // Loads ALL messages
    const matched = entries.filter((entry) => nameSet.has(entry.name));
    // ...
}

Issue: #drainMessages loads ALL queue messages from storage on every receive call, even when only requesting 1 message. This is O(n) for every receive operation and could be expensive for large queues.

Recommendation: Consider:

Adding an in-memory index of message names → IDs
Using a prefix scan with early termination
Only loading full messages after finding matching keys

3. Potential Memory Leak with Waiters

Location: queue-manager.ts:190-220

The abort event listeners are registered but only cleaned up when the waiter resolves/rejects. If the actor stops before a waiter times out, the timeout callback on line 184-187 will still fire and try to resolve.

Recommendation: Clear the timeout handle in the onStop handler:

const onStop = () => {
    this.#waiters.delete(waiterId);
    if (waiter.timeoutHandle) {
        clearTimeout(waiter.timeoutHandle);  // Add this
    }
    reject(new errors.ActorAborted());
};

4. Sequential Waiter Resolution

Location: queue-manager.ts:300-325

async #maybeResolveWaiters() {
    if (this.#waiters.size === 0) { return; }
    const pending = [...this.#waiters.values()];
    for (const waiter of pending) {
        // ...
        const messages = await this.#drainMessages(waiter.nameSet, waiter.count);
        // ...
    }
}

Issue: Waiters are resolved sequentially, each potentially loading all messages from storage. If there are many waiters, this could be slow.

Recommendation:

Load messages once and distribute to waiters
Or process waiters in parallel when they don't conflict

⚠️ Minor Issues

5. Inconsistent Return Types

Location: queue.ts:31-73

The overloaded next() function returns QueueMessage | undefined for single names but QueueMessage[] | undefined for arrays. The single-message case returns undefined when no messages are available, but it could also return an empty array on timeout (line 68).

Recommendation: Consider making the return type more consistent or add clear JSDoc comments explaining the behavior.

6. Magic Number for Timeout

Location: fixtures/driver-test-suite/queue.ts:57

await c.queue.next("abort", { timeout: 10_000 });

The test uses a 10-second timeout which seems arbitrary. Consider using a constant or explaining why this specific value.

7. Missing Validation

Location: queue-manager.ts:156-161

The receive() function accepts names: string[] but doesn't validate that it's non-empty. An empty array would create a waiter that never resolves.

Recommendation: Add validation:

if (names.length === 0) {
    throw new Error("At least one queue name must be specified");
}

8. Metadata Size Mismatch Silent Recovery

Location: queue-manager.ts:274-277

if (this.#metadata.size \!== decoded.length) {
    this.#metadata.size = decoded.length;
    this.#actor.inspector.updateQueueSize(this.#metadata.size);
}

Size mismatches are silently corrected without logging. This could hide persistent corruption issues.

Recommendation: Add a warning log when size mismatches are detected.

🎯 Performance Considerations

Batch Operations: Good use of kvBatchPut and kvBatchDelete to minimize round trips
CBOR Encoding: Appropriate for binary efficiency, though adds CPU overhead
Inspector Updates: Queue size is properly updated on every change for observability

🔒 Security Considerations

Size Limits: Properly enforced at both queue level and message level
Serialization Safety: isCborSerializable check prevents unsafe types
No Injection Risks: Message names are used as-is without string interpolation

📝 Code Style & Conventions

✅ Uses tracing with structured logging (e.g., this.#actor.rLog.error({ msg: "...", error }))
✅ Private fields use # prefix
✅ Error messages are lowercase as per CLAUDE.md
✅ Good TypeScript typing throughout
⚠️ Some comments could be more sentence-like (e.g., line 129: "Update metadata before writing so we can batch both writes" vs "Updates metadata before writing to batch both writes")

🧪 Test Coverage

The test suite covers:

✅ Client-to-actor messaging
✅ Actor-to-self messaging
✅ Array name filtering
✅ Timeout behavior
✅ Abort handling
✅ Queue size limits
✅ Message size limits

Missing test scenarios:

Multiple concurrent waiters for the same message name
Queue persistence across actor restarts
Metadata corruption recovery via #rebuildMetadata()
Very large queue sizes (stress testing)

📖 Documentation

Consider adding:

JSDoc comments explaining the queue ordering guarantees (FIFO per message name?)
Examples of common patterns (request/response via queues, fan-out, etc.)
Performance characteristics and recommended queue size limits

Summary

This is a well-implemented feature with good fundamentals. The main concerns are:

Critical: Potential race condition in concurrent enqueue operations needs clarification or fixing
Important: Performance of loading all messages on every receive could be optimized
Minor: Several small improvements to error handling and validation

The test coverage is solid, error handling is comprehensive, and the API design is intuitive. Great work overall! 🎉

railway-app bot had a problem deploying to rivet-frontend / rivet-pr-3989 January 21, 2026 00:59 Failure

NathanFlurry force-pushed the actor-messages branch from 20d84dd to ed996b2 Compare January 21, 2026 01:42

railway-app bot had a problem deploying to rivet-frontend / rivet-pr-3989 January 21, 2026 01:42 Failure

NathanFlurry force-pushed the actor-messages branch from ed996b2 to 11d2f22 Compare January 22, 2026 04:10

railway-app bot had a problem deploying to rivet-frontend / rivet-pr-3989 January 22, 2026 04:10 Failure

NathanFlurry mentioned this pull request Jan 22, 2026

feat(rivetkit): workflows #3734

Open

NathanFlurry marked this pull request as ready for review January 22, 2026 05:36

This was referenced Jan 22, 2026

feat(rivetkit): add run #4002

Open

feat(rivetkit): c.keepAwake #4003

Open

feat(rivetkit): add clearAlarm #4004

Draft

feat(rivetkit): add actor run handler #4005

Open

feat(rivetkit): integrate workflows in to actors #4006

Open

NathanFlurry force-pushed the actor-messages branch from 11d2f22 to 266ebbd Compare January 22, 2026 07:14

railway-app bot had a problem deploying to rivet-frontend / rivet-pr-3989 January 22, 2026 07:14 Failure

This was referenced Jan 22, 2026

feat(examples): effect #4008

Open

feat(rivetkit/effect): effect support #4009

Open

feat: actor messages & queues

2dd6f17

NathanFlurry force-pushed the actor-messages branch from 266ebbd to 2dd6f17 Compare January 22, 2026 17:45

railway-app bot had a problem deploying to rivet-frontend / rivet-pr-3989 January 22, 2026 17:45 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: actor messages & queues #3989

feat: actor messages & queues #3989

NathanFlurry commented Jan 21, 2026

Uh oh!

railway-app bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

NathanFlurry commented Jan 21, 2026 •

edited

Loading

Uh oh!

claude bot commented Jan 21, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: actor messages & queues #3989

Are you sure you want to change the base?

feat: actor messages & queues #3989

Conversation

NathanFlurry commented Jan 21, 2026

Uh oh!

railway-app bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NathanFlurry commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How to use the Graphite Merge Queue

Uh oh!

claude bot commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: Actor Messages & Queues

✅ Strengths

🐛 Potential Issues

1. Race Condition in Metadata Updates (Critical)

2. Inefficient Message Loading

3. Potential Memory Leak with Waiters

4. Sequential Waiter Resolution

⚠️ Minor Issues

5. Inconsistent Return Types

6. Magic Number for Timeout

7. Missing Validation

8. Metadata Size Mismatch Silent Recovery

🎯 Performance Considerations

🔒 Security Considerations

📝 Code Style & Conventions

🧪 Test Coverage

📖 Documentation

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

railway-app bot commented Jan 21, 2026 •

edited

Loading

NathanFlurry commented Jan 21, 2026 •

edited

Loading

claude bot commented Jan 21, 2026 •

edited

Loading