fix(maxun-core): robot browser recording crashes #811

RohitR311 · 2025-09-28T17:37:42Z

What this PR does?

Persists data in batched buffers to prevent blocking
Improved error handling for robot creation and integrations
Socket resource cleanup

Fixes #798
Fixes #799
Fixes #800
Fixes #801
Fixes #802

Summary by CodeRabbit

New Features
- Added universal selector support (CSS/XPath) with smarter waiting and timeouts.
- Introduced batched data persistence for smoother, more reliable saves.
Bug Fixes
- Improved stability with guarded timeouts around heavy operations and page evaluations.
- Enhanced pagination and button detection to handle dynamic pages more reliably.
- Bounded integration update loops to prevent runaway processing.
Chores
- Tuned database connection pool for performance.
- Added global process error handlers to log and safely terminate on critical failures.
- Standardized async error handling across update processes and browser lifecycle.

coderabbitai · 2025-09-28T17:37:49Z

Walkthrough

Adds a public abort-state getter in the Interpreter, introduces batched persistence with timers/retries in WorkflowInterpreter, bounds Airtable/GSheet processing loops, hardens browser init/cleanup error handling, adds per-call .catch handlers for integration updates, and extends server pool and global process error handlers.

Changes

Cohort / File(s)	Summary
Core Interpreter `maxun-core/src/interpret.ts`	Added `public getIsAborted(): boolean`; introduced universal selector wait helpers, XPath-aware pagination/search, timeouts around page.evaluate, expanded error handling in carryOutSteps and script loading guards.
WorkflowInterpreter (batching & cleanup) `server/src/workflow-management/classes/Interpreter.ts`	Added batched persistence: `persistenceBuffer`, timer, `BATCH_SIZE`, `BATCH_TIMEOUT`, retry/backoff, `addToPersistenceBatch`, `scheduleBatchFlush`, `flushPersistenceBuffer`; `clearState` made `public async` and flushes buffers; updated reset/stop flows.
Integrations (bounded windows & cleanup) `server/src/workflow-management/integrations/airtable.ts`, `server/src/workflow-management/integrations/gsheet.ts`	Replaced infinite loops with 60s bounded windows; remove tasks on success or retry exhaustion; skip non-pending tasks; added completion/timeout logs and final cleanup pass.
Integration invocation callers `server/src/api/record.ts`, `server/src/pgboss-worker.ts`, `server/src/workflow-management/scheduler/index.ts`	Changed fire-and-forget updates to promise calls with per-call `.catch(...)` logging to avoid unhandled rejections.
Browser lifecycle hardening `server/src/browser-management/controller.ts`	Wrapped init/upgrade/destroy flows with try/catch; ensure `switchOff` cleanup attempts on failures; mark slots failed on init errors; guarded socket namespace cleanup and added detailed logging.
Server runtime & DB pool `server/src/server.ts`	Expanded Postgres pool options (`max`, `min`, `idleTimeoutMillis`, `connectionTimeoutMillis`, `maxUses`, `allowExitOnIdle`) and added global `unhandledRejection` and `uncaughtException` handlers (process exit in production).

Sequence Diagram(s)

sequenceDiagram
  autonumber
  actor Runner
  participant WI as WorkflowInterpreter
  participant DB as Database
  Runner->>WI: interpret(items...)
  WI->>WI: addToPersistenceBatch(item)
  WI->>WI: scheduleBatchFlush(timer/size)
  opt Batch trigger (timeout or size)
    WI->>DB: flushPersistenceBuffer() (transaction)
    alt success
      DB-->>WI: committed
    else failure
      WI->>WI: retry with backoff
      WI->>DB: re-attempt
    end
  end
  Runner->>WI: clearState() / stop()
  WI->>DB: flush remaining buffer
  WI-->>Runner: stopped

sequenceDiagram
  autonumber
  actor Orchestrator
  participant Ctrl as BrowserController
  participant Session as BrowserSession
  Orchestrator->>Ctrl: createRemoteBrowserForRun()
  Ctrl->>Session: initializeBrowserAsync()
  alt init OK
    Session-->>Ctrl: initialized
  else init fails
    Ctrl->>Session: switchOff() (attempt cleanup)
    note right of Ctrl: log cleanup result
    Ctrl-->>Orchestrator: rethrow original error
  end
  Orchestrator->>Ctrl: destroyRemoteBrowser()
  Ctrl->>Ctrl: try { cleanup namespace } catch (err) { log error }

sequenceDiagram
  autonumber
  actor Trigger
  participant Scheduler as Scheduler/API/Worker
  participant Airtable as processAirtableUpdates()
  participant GSheet as processGoogleSheetUpdates()
  Trigger->>Scheduler: triggerIntegrationUpdates()
  Scheduler->>Airtable: processAirtableUpdates().catch(log)
  Scheduler->>GSheet: processGoogleSheetUpdates().catch(log)
  note right of Airtable: runs bounded window (60s)
  note right of GSheet: runs bounded window (60s)

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60–75 minutes

Possibly related PRs

feat(maxun-core): extraction and platform stability #774 — Related changes to Interpreter abort mechanics and selector/wait behavior; directly connected to getIsAborted and carryOutSteps changes.
feat: browser checks for run execution #734 — Overlaps browser init/cleanup hardening in server/src/browser-management/controller.ts.
fix: prevent page reload on run trigger to open remote browser #386 — Related to browser creation/registration and run initialization logic in controller flows.

Suggested labels

Type: Bug, Scope: Recorder

Suggested reviewers

amhsirak

Poem

I buffered my bytes in a tidy row,
Timers tick, retries hop when pages slow.
Sheets sleep tidy, airtable tasks fall through,
Browsers bow out clean — logs show what they do.
A rabbit clap — all set, ready to go! 🐇✨

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title Check	⚠️ Warning	The current title focuses narrowly on a “robot browser recording crashes” fix in maxun-core but the changeset spans multiple modules and includes batching persistence, enhanced error handling, socket cleanup, and integration improvements; it doesn’t clearly summarize the primary breadth of work.	Please update the title to succinctly reflect the core objectives addressed, such as batched persistence and comprehensive error handling and cleanup improvements across both maxun-core and server components.
Out of Scope Changes Check	⚠️ Warning	The pull request introduces changes that are not covered by any of the linked issues, including extended PostgreSQL pool configurations and global process error handlers in server.ts as well as the complete batched persistence mechanism in workflow-management which are outside the scope of #798–#802.	Consider splitting the database pool tuning, global error handler additions, and the batched persistence implementation into separate, appropriately scoped pull requests or link them to their own issue tickets to keep this PR focused on its intended fixes.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Linked Issues Check	✅ Passed	The changes implement the core objectives from each linked issue: better browser initialization error handling (#798) via added catches and cleanup in controller.ts; enhanced integration error handling and task cleanup (#799) through per-call catch handlers in record.ts, scheduler, pgboss-worker.ts, and bounded-loop cleanup revisions in airtable.ts and gsheet.ts; socket namespace cleanup (#800) in destroyRemoteBrowser; safe failure for extraction in maxun-core (#801) by wrapping evaluate calls with timeouts and resilient selector logic; and mitigation of random run failures (#802) via abort guards, timeout protections, and batched persistence improvements.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.

✨ Finishing touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch optim-record

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 3

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between d2e0324 and a83b69c.

📒 Files selected for processing (9)

maxun-core/src/interpret.ts (1 hunks)
server/src/api/record.ts (1 hunks)
server/src/browser-management/controller.ts (3 hunks)
server/src/pgboss-worker.ts (1 hunks)
server/src/server.ts (2 hunks)
server/src/workflow-management/classes/Interpreter.ts (5 hunks)
server/src/workflow-management/integrations/airtable.ts (1 hunks)
server/src/workflow-management/integrations/gsheet.ts (3 hunks)
server/src/workflow-management/scheduler/index.ts (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (5)

server/src/workflow-management/scheduler/index.ts (2)

server/src/workflow-management/integrations/airtable.ts (1)

processAirtableUpdates (456-500)

server/src/workflow-management/integrations/gsheet.ts (1)

processGoogleSheetUpdates (288-330)

server/src/api/record.ts (2)

server/src/workflow-management/integrations/airtable.ts (1)

processAirtableUpdates (456-500)

server/src/workflow-management/integrations/gsheet.ts (1)

processGoogleSheetUpdates (288-330)

server/src/browser-management/controller.ts (1)

server/src/server.ts (2)

browserPool (91-91)

io (86-86)

server/src/workflow-management/classes/Interpreter.ts (1)

server/src/db/models/index.js (1)

sequelize (16-16)

server/src/pgboss-worker.ts (2)

server/src/workflow-management/integrations/airtable.ts (1)

processAirtableUpdates (456-500)

server/src/workflow-management/integrations/gsheet.ts (1)

processGoogleSheetUpdates (288-330)

🔇 Additional comments (4)

maxun-core/src/interpret.ts (1)

126-131: Getter addition looks good.
Thanks for exposing the abort flag cleanly; this keeps callers from poking at private state while preserving existing semantics.

server/src/server.ts (2)

40-45: Stronger pool configuration.
Nice to see the pool tuned with sensible caps and timeouts; this should curb connection leaks during spikes.

224-238: Appreciate the global fail-safes.
Catching unhandled rejections/exceptions here gives us at least one clean log line before restarting, and the delayed exit in production is a good compromise.

server/src/workflow-management/integrations/gsheet.ts (1)

289-330: Bounded loop plus cleanup FTW.
The 60‑second window, retry pruning, and auto-purging of terminal states should keep this worker from pinning the event loop forever while still giving pending tasks a few chances.

coderabbitai · 2025-09-28T17:44:16Z

server/src/browser-management/controller.ts

+    try {
+      const namespace = io.of(id);
+      namespace.removeAllListeners();
+      namespace.disconnectSockets(true);
+      logger.log('debug', `Cleaned up socket namespace for browser ${id}`);
+    } catch (namespaceCleanupError: any) {
+      logger.log('warn', `Error cleaning up socket namespace for browser ${id}: ${namespaceCleanupError.message}`);
+    }


⚠️ Potential issue | 🟠 Major

Clean up namespace registry as well

removeAllListeners() plus disconnectSockets(true) stops activity, but the namespace entry stays in io._nsps, so every run leaves a UUID‑named namespace hanging around. Given we create fresh IDs per run, that map grows without bound and retains adapters/listeners, undermining the resource-leak fix this PR is aiming for. Please drop the namespace from the server registry once cleanup succeeds (Socket.IO 4 exposes the map at io._nsps). One straightforward approach:

const namespace = io.of(id); namespace.removeAllListeners(); namespace.disconnectSockets(true); logger.log('debug', `Cleaned up socket namespace for browser ${id}`); + const namespaces = (io as any)._nsps as Map<string, unknown> | undefined; + namespaces?.delete(namespace.name);

That keeps us aligned with the recommended disconnect‑then‑delete flow for dynamic namespaces and prevents the process from accumulating stale entries over long uptimes. Based on learnings.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

try {

const namespace = io.of(id);

namespace.removeAllListeners();

namespace.disconnectSockets(true);

logger.log('debug', `Cleaned up socket namespace for browser ${id}`);

} catch (namespaceCleanupError: any) {

logger.log('warn', `Error cleaning up socket namespace for browser ${id}: ${namespaceCleanupError.message}`);

}

try {

const namespace = io.of(id);

namespace.removeAllListeners();

namespace.disconnectSockets(true);

logger.log('debug', `Cleaned up socket namespace for browser ${id}`);

const namespaces = (io as any)._nsps as Map<string, unknown> | undefined;

namespaces?.delete(namespace.name);

} catch (namespaceCleanupError: any) {

logger.log('warn', `Error cleaning up socket namespace for browser ${id}: ${namespaceCleanupError.message}`);

}

🤖 Prompt for AI Agents

In server/src/browser-management/controller.ts around lines 118 to 125, after successfully removing listeners and disconnecting sockets for the namespace, also remove the namespace entry from the server registry (io._nsps) to avoid accumulating stale UUID namespaces: after namespace.disconnectSockets(true) check that the registry entry exists and call the appropriate delete method (io._nsps.delete(id) for a Map or delete io._nsps[id] for a plain object) so the namespace is removed only on successful cleanup, and keep this inside the try block before logging the cleanup complete message.

coderabbitai · 2025-09-28T17:44:17Z

server/src/workflow-management/classes/Interpreter.ts

+  private async flushPersistenceBuffer(): Promise<void> {
+    if (this.persistenceBuffer.length === 0 || this.persistenceInProgress || !this.currentRunId) {
+      return;
+    }
+
+    if (this.persistenceTimer) {
+      clearTimeout(this.persistenceTimer);
+      this.persistenceTimer = null;
+    }
+
+    this.persistenceInProgress = true;
+    const batchToProcess = [...this.persistenceBuffer];
+    this.persistenceBuffer = [];
+
+    try {
+      const sequelize = require('../../storage/db').default;
+      await sequelize.transaction(async (transaction: any) => {
+        const { Run } = require('../../models');
+        const run = await Run.findOne({
+          where: { runId: this.currentRunId! },
+          transaction
+        });
+
+        if (!run) {
+          logger.log('warn', `Run not found for batched persistence: ${this.currentRunId}`);
+          return;
+        }
+
+        const currentSerializableOutput = run.serializableOutput ?
+          JSON.parse(JSON.stringify(run.serializableOutput)) :
+          { scrapeSchema: [], scrapeList: [] };
+
+        let hasUpdates = false;
+
+        for (const item of batchToProcess) {
+          if (item.actionType === 'scrapeSchema') {
+            const newSchemaData = Array.isArray(item.data) ? item.data : [item.data];
+            currentSerializableOutput.scrapeSchema = newSchemaData;
+            hasUpdates = true;
+          } else if (item.actionType === 'scrapeList' && typeof item.listIndex === 'number') {
+            if (!Array.isArray(currentSerializableOutput.scrapeList)) {
+              currentSerializableOutput.scrapeList = [];
+            }
+            currentSerializableOutput.scrapeList[item.listIndex] = item.data;
+            hasUpdates = true;
+          }
+        }
+
+        if (hasUpdates) {
+          await run.update({
+            serializableOutput: currentSerializableOutput
+          }, { transaction });
+
+          logger.log('debug', `Batched persistence: Updated run ${this.currentRunId} with ${batchToProcess.length} items`);
+        }
+      });
+
+      this.persistenceRetryCount = 0;
+
+    } catch (error: any) {
+      logger.log('error', `Failed to flush persistence buffer for run ${this.currentRunId}: ${error.message}`);
+
+      if (!this.persistenceRetryCount) {
+        this.persistenceRetryCount = 0;
+      }
+
+      if (this.persistenceRetryCount < 3) {
+        this.persistenceBuffer.unshift(...batchToProcess);
+        this.persistenceRetryCount++;
+
+        const backoffDelay = Math.min(5000 * Math.pow(2, this.persistenceRetryCount), 30000);
+        setTimeout(async () => {
+          await this.flushPersistenceBuffer();
+        }, backoffDelay);
+
+        logger.log('warn', `Scheduling persistence retry ${this.persistenceRetryCount}/3 in ${backoffDelay}ms`);
+      } else {
+        logger.log('error', `Max persistence retries exceeded for run ${this.currentRunId}, dropping ${batchToProcess.length} items`);
+        this.persistenceRetryCount = 0;
+      }
+    } finally {
+      this.persistenceInProgress = false;
+    }
+  };


⚠️ Potential issue | 🔴 Critical

Fix batching deadlock when a flush overlaps new writes.

If new data lands during an in-flight flushPersistenceBuffer, persistDataToDatabase skips scheduleBatchFlush() because persistenceInProgress is still true. Once the current flush completes, nothing wakes up the remaining buffer, so those items sit in memory indefinitely and never reach the DB. Please queue another flush after the current transaction ends.

Apply this diff to reschedule once the flush finishes:

} finally { this.persistenceInProgress = false; + + if (this.persistenceBuffer.length > 0 && !this.persistenceTimer) { + this.scheduleBatchFlush(); + } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

private async flushPersistenceBuffer(): Promise<void> {

if (this.persistenceBuffer.length === 0 || this.persistenceInProgress || !this.currentRunId) {

return;

}

if (this.persistenceTimer) {

clearTimeout(this.persistenceTimer);

this.persistenceTimer = null;

}

this.persistenceInProgress = true;

const batchToProcess = [...this.persistenceBuffer];

this.persistenceBuffer = [];

try {

const sequelize = require('../../storage/db').default;

await sequelize.transaction(async (transaction: any) => {

const { Run } = require('../../models');

const run = await Run.findOne({

where: { runId: this.currentRunId! },

transaction

});

if (!run) {

logger.log('warn', `Run not found for batched persistence: ${this.currentRunId}`);

return;

}

const currentSerializableOutput = run.serializableOutput ?

JSON.parse(JSON.stringify(run.serializableOutput)) :

{ scrapeSchema: [], scrapeList: [] };

let hasUpdates = false;

for (const item of batchToProcess) {

if (item.actionType === 'scrapeSchema') {

const newSchemaData = Array.isArray(item.data) ? item.data : [item.data];

currentSerializableOutput.scrapeSchema = newSchemaData;

hasUpdates = true;

} else if (item.actionType === 'scrapeList' && typeof item.listIndex === 'number') {

if (!Array.isArray(currentSerializableOutput.scrapeList)) {

currentSerializableOutput.scrapeList = [];

}

currentSerializableOutput.scrapeList[item.listIndex] = item.data;

hasUpdates = true;

}

}

if (hasUpdates) {

await run.update({

serializableOutput: currentSerializableOutput

}, { transaction });

logger.log('debug', `Batched persistence: Updated run ${this.currentRunId} with ${batchToProcess.length} items`);

}

});

this.persistenceRetryCount = 0;

} catch (error: any) {

logger.log('error', `Failed to flush persistence buffer for run ${this.currentRunId}: ${error.message}`);

if (!this.persistenceRetryCount) {

this.persistenceRetryCount = 0;

}

if (this.persistenceRetryCount < 3) {

this.persistenceBuffer.unshift(...batchToProcess);

this.persistenceRetryCount++;

const backoffDelay = Math.min(5000 * Math.pow(2, this.persistenceRetryCount), 30000);

setTimeout(async () => {

await this.flushPersistenceBuffer();

}, backoffDelay);

logger.log('warn', `Scheduling persistence retry ${this.persistenceRetryCount}/3 in ${backoffDelay}ms`);

} else {

logger.log('error', `Max persistence retries exceeded for run ${this.currentRunId}, dropping ${batchToProcess.length} items`);

this.persistenceRetryCount = 0;

}

} finally {

this.persistenceInProgress = false;

}

};

private async flushPersistenceBuffer(): Promise<void> {

if (this.persistenceBuffer.length === 0 || this.persistenceInProgress || !this.currentRunId) {

return;

}

if (this.persistenceTimer) {

clearTimeout(this.persistenceTimer);

this.persistenceTimer = null;

}

this.persistenceInProgress = true;

const batchToProcess = [...this.persistenceBuffer];

this.persistenceBuffer = [];

try {

const sequelize = require('../../storage/db').default;

await sequelize.transaction(async (transaction: any) => {

const { Run } = require('../../models');

const run = await Run.findOne({

where: { runId: this.currentRunId! },

transaction

});

if (!run) {

logger.log('warn', `Run not found for batched persistence: ${this.currentRunId}`);

return;

}

const currentSerializableOutput = run.serializableOutput

? JSON.parse(JSON.stringify(run.serializableOutput))

: { scrapeSchema: [], scrapeList: [] };

let hasUpdates = false;

for (const item of batchToProcess) {

if (item.actionType === 'scrapeSchema') {

const newSchemaData = Array.isArray(item.data) ? item.data : [item.data];

currentSerializableOutput.scrapeSchema = newSchemaData;

hasUpdates = true;

} else if (item.actionType === 'scrapeList' && typeof item.listIndex === 'number') {

if (!Array.isArray(currentSerializableOutput.scrapeList)) {

currentSerializableOutput.scrapeList = [];

}

currentSerializableOutput.scrapeList[item.listIndex] = item.data;

hasUpdates = true;

}

}

if (hasUpdates) {

await run.update(

{ serializableOutput: currentSerializableOutput },

{ transaction }

);

logger.log(

'debug',

`Batched persistence: Updated run ${this.currentRunId} with ${batchToProcess.length} items`

);

}

});

this.persistenceRetryCount = 0;

} catch (error: any) {

logger.log(

'error',

`Failed to flush persistence buffer for run ${this.currentRunId}: ${error.message}`

);

if (!this.persistenceRetryCount) {

this.persistenceRetryCount = 0;

}

if (this.persistenceRetryCount < 3) {

this.persistenceBuffer.unshift(...batchToProcess);

this.persistenceRetryCount++;

const backoffDelay = Math.min(

5000 * Math.pow(2, this.persistenceRetryCount),

30000

);

setTimeout(async () => {

await this.flushPersistenceBuffer();

}, backoffDelay);

logger.log(

'warn',

`Scheduling persistence retry ${this.persistenceRetryCount}/3 in ${backoffDelay}ms`

);

} else {

logger.log(

'error',

`Max persistence retries exceeded for run ${this.currentRunId}, dropping ${batchToProcess.length} items`

);

this.persistenceRetryCount = 0;

}

} finally {

this.persistenceInProgress = false;

// If new items arrived during the flush, ensure we schedule another flush

if (this.persistenceBuffer.length > 0 && !this.persistenceTimer) {

this.scheduleBatchFlush();

}

}

};

🤖 Prompt for AI Agents

In server/src/workflow-management/classes/Interpreter.ts around lines 611-694, the finally block of flushPersistenceBuffer clears persistenceInProgress but doesn't reschedule a flush if new items were added while the flush ran, causing items to be left in memory; after setting this.persistenceInProgress = false add logic to check if this.persistenceBuffer.length > 0 and there is no existing persistenceTimer, and if so call this.scheduleBatchFlush() (or otherwise schedule a timeout) to queue another flush; ensure you don't double-schedule by checking persistenceTimer or a similar flag before scheduling.

coderabbitai · 2025-09-28T17:44:17Z

server/src/workflow-management/integrations/airtable.ts

          if (task.retries < MAX_RETRIES) {
            airtableUpdateTasks[runId].retries += 1;
            console.log(`Retrying task for runId: ${runId}, attempt: ${task.retries + 1}`);
          } else {
-            airtableUpdateTasks[runId].status = 'failed';
-            console.log(`Max retries reached for runId: ${runId}. Marking task as failed.`);
-            logger.log('error', `Permanent failure for run ${runId}: ${error.message}`);
+            console.log(`Max retries reached for runId: ${runId}. Removing task.`);
+            delete airtableUpdateTasks[runId];
          }


⚠️ Potential issue | 🟠 Major

Fix retry counter logic to match new MAX_RETRIES guard.

Each scheduled task is still seeded with retries: 5 in triggerIntegrationUpdates (see server/src/pgboss-worker.ts, Lines 91-103). Because this block now checks task.retries < MAX_RETRIES and increments upward, freshly scheduled tasks immediately fail the guard (5 < 3 is false), so the first transient Airtable error is treated as “max retries reached.” That’s a regression from prior behavior and drops resiliency for all Airtable runs. Please align the counter semantics—easiest fix is to initialize both Airtable and Google Sheets tasks with retries: 0 so the counter reflects attempts used, or otherwise adjust the conditional to work with the existing seed value. Example patch for the initializer:

- retries: 5, + retries: 0,

Apply the same change for the Google Sheets task to keep behavior consistent. After that, this guard will actually allow the intended three retry attempts before capping out.

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents

In server/src/workflow-management/integrations/airtable.ts around lines 477-483 the retry guard treats freshly scheduled tasks as already maxed out because tasks are seeded with retries: 5 elsewhere; fix by updating the task seeding in server/src/pgboss-worker.ts (around lines 91-103) to initialize retries: 0 for both the Airtable and Google Sheets tasks so the existing check task.retries < MAX_RETRIES works as intended; apply the same initialization change for the Google Sheets task to keep behavior consistent.

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a83b69c and 47d1b24.

📒 Files selected for processing (1)

maxun-core/src/interpret.ts (5 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

maxun-core/src/interpret.ts (1)

maxun-core/src/browserSide/scraper.js (1)

selector (28-28)

coderabbitai · 2025-09-28T17:57:46Z

maxun-core/src/interpret.ts

+            let args = step.args;
+
+            if (Array.isArray(args) && args.length === 1) {
+              args = [args[0], { timeout: 30000 }];
+            } else if (!Array.isArray(args)) {
+              args = [args, { timeout: 30000 }];
+            }
            await executeAction(invokee, methodName, step.args);


⚠️ Potential issue | 🟠 Major

Restore the 30 s timeout for waitForLoadState.

We build the normalized args array but still call executeAction with the original step.args, so the timeout never actually takes effect. The interpreter can still hang indefinitely, defeating the purpose of this change. Please pass the normalized args through.

- let args = step.args; + let args = step.args; if (Array.isArray(args) && args.length === 1) { args = [args[0], { timeout: 30000 }]; } else if (!Array.isArray(args)) { args = [args, { timeout: 30000 }]; } - await executeAction(invokee, methodName, step.args); + await executeAction(invokee, methodName, args);

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

let args = step.args;

if (Array.isArray(args) && args.length === 1) {

args = [args[0], { timeout: 30000 }];

} else if (!Array.isArray(args)) {

args = [args, { timeout: 30000 }];

}

await executeAction(invokee, methodName, step.args);

let args = step.args;

if (Array.isArray(args) && args.length === 1) {

args = [args[0], { timeout: 30000 }];

} else if (!Array.isArray(args)) {

args = [args, { timeout: 30000 }];

}

await executeAction(invokee, methodName, args);

🤖 Prompt for AI Agents

In maxun-core/src/interpret.ts around lines 620 to 627, the code builds a normalized args array with a 30s timeout but still calls executeAction with the original step.args, so the timeout is never applied; update the executeAction call to pass the normalized args variable (e.g., args) instead of step.args so the timeout is used, preserving the existing normalization logic and behavior.

RohitR311 added 7 commits September 28, 2025 22:34

feat: add integration error handling

0f75917

feat: map cleanup, process duration

0aeb8ad

fix: server uncaught err handling

c7ec3cf

fix: socket cleanup, err handling

24af62c

feat: add batch persistence logic

e75a10d

feat: add abort state getter

f77f42a

fix: add process retry count logic

a83b69c

RohitR311 changed the title ~~fix: robot browser recording crashes~~ fix(maxus-core): robot browser recording crashes Sep 28, 2025

RohitR311 changed the title ~~fix(maxus-core): robot browser recording crashes~~ fix(maxun-core): robot browser recording crashes Sep 28, 2025

coderabbitai bot reviewed Sep 28, 2025

View reviewed changes

fix: timeout mechanism, revamp working button logic

47d1b24

coderabbitai bot reviewed Sep 28, 2025

View reviewed changes

RohitR311 added Type: Bug Something isn't working Scope: Ext Issues/PRs related to core extraction labels Sep 29, 2025

amhsirak approved these changes Sep 29, 2025

View reviewed changes

amhsirak merged commit f34e217 into develop Sep 29, 2025
1 check passed

coderabbitai bot mentioned this pull request Sep 29, 2025

fix: socket conn handling #814

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(maxun-core): robot browser recording crashes #811

fix(maxun-core): robot browser recording crashes #811

RohitR311 commented Sep 28, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Sep 28, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Sep 28, 2025

Uh oh!

coderabbitai bot Sep 28, 2025

Uh oh!

coderabbitai bot Sep 28, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Sep 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(maxun-core): robot browser recording crashes #811

fix(maxun-core): robot browser recording crashes #811

Conversation

RohitR311 commented Sep 28, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Sep 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Sep 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

RohitR311 commented Sep 28, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Sep 28, 2025 •

edited

Loading