feat: eval-evolution work in progress - recommendations and workers UI #11444

mp-roocode · 2026-02-12T20:10:22Z

Work in progress on eval-evolution features including:

Updated methodology content
Workers UI improvements
Comparison chart enhancements
New workers-v2 pages
Eval outcomes utilities

Co-authored-by: roomote[bot] <219738659+roomote[bot]@users.noreply.github.com>

- Add 5 engineer roles: Junior, Senior, Staff, Architecture Reviewer, Autonomous Agent - Build role selection landing page with hiring metaphor at /evals/workers - Build candidate rankings page with tiered recommendations at /evals/workers/[roleId] - Build candidate comparison page with Recharts charts at /evals/workers/[roleId]/compare - Build "How We Interview" methodology page at /evals/methodology - Add mock data with real eval scores from 27 model runs - Implement "Hire This Engineer" CTA linking to Roo Code Cloud - Implement "Configure Extension" CTA with clipboard copy - Per-language score breakdowns (Go, Java, JS, Python, Rust) - Daily salary pricing (80 tasks/agent/day estimate) - framer-motion animations, glass-morphism design, role color themes - Tone-of-voice compliance (no em dashes, no hype, workflow-first copy) - vscode:// deep link design doc at plans/vscode-deep-link-design.md

…e themes - Atmospheric header with role-colored blur gradients - Glass-morphism containers for chart, filters, and export - Styled language toggle pills with role color accents - Themed provider checkboxes and success rate slider - Custom chart tooltip with backdrop blur - Export buttons with press feedback - framer-motion scroll-triggered animations - Bottom navigation with pill-style links - Role themes: reviewer (violet) and autonomous (cyan) added to candidates page

…line) - Add "Value Map: Salary vs Interview Score" scatter to comparison page - Dots colored by tier, sized by success rate - Sweet Spot quadrant highlight (upper-left) - Respects existing provider/success-rate filters - Add "AI Coding Capability Over Time" scatter to landing page - 10 models from Jun 2025 to Feb 2026 - Dots colored by provider, sized by cost efficiency - Dashed trend line showing upward trajectory - Add MODEL_TIMELINE data to mock-recommendations.ts

…lity

roomote · 2026-02-12T20:10:51Z

Rooviewer See task

Reviewed 268b183 (objective deep-dive pages, route restructuring under roles/, font layout). Both --font-display fixes hold via shared layout. Three open items remain.

--font-display CSS variable is undefined on the baseline /evals/workers path -- fixed by lifting font setup to shared evals/layout.tsx
Aggregate stat totalEvalRuns is still inflated -- totalExercises/totalModels are now unused but totalEvalRuns still sums identical per-role counts across all 5 roles (135 instead of 27)
--font-display CSS variable is undefined on the /evals/methodology path -- fixed by lifting font setup to shared evals/layout.tsx
recommendations/layout.tsx redundantly imports Fraunces/IBM Plex Sans and wraps content with font variables already provided by the shared evals/layout.tsx
copyPrompt in objective-content.tsx lacks visual feedback -- the existing clipboard pattern (copy-settings-button.tsx, comparison-chart.tsx) tracks copied state and shows a confirmation; the new callback fires without any UI response

Previous reviews

1d03382: Review #1

132f7af: Review #2

8cf93a7: Review #3

b3fb878: Review #4

310b10c: Review #5

_{Mention @roomote in a comment to request specific changes to this pull request or fix all unresolved issues.}

roomote · 2026-02-12T20:15:56Z

apps/web-roo-code/src/app/evals/workers/workers-content.tsx

+
+						{/* Heading */}
+						<motion.h1
+							className="mt-6 text-5xl font-semibold tracking-tight md:text-6xl lg:text-7xl [font-family:var(--font-display)]"


[font-family:var(--font-display)] is used here and at line 1024, but --font-display is only defined by the V2 page wrapper (workers-v2/page.tsx sets it via Fraunces font). When this component is rendered through the baseline path (/evals/workers), --font-display is undefined and these headings silently fall back to the inherited font instead of the intended display typeface. Either gate this class behind enableOutcomeLayer, or have the baseline workers/page.tsx also provide the font variable.

_{Fix it with Roo Code or mention @roomote and request a fix.}

roomote · 2026-02-12T20:16:07Z

apps/web-roo-code/src/app/evals/workers/page.tsx

+	const totalEvalRuns = recommendations.reduce((sum, r) => sum + r.totalEvalRuns, 0)
+	const totalExercises = recommendations.reduce((sum, r) => sum + r.totalExercises, 0)


totalEvalRuns and totalExercises are computed by summing across all 5 roles, but each role reports the same underlying pool of 27 eval runs and 120 exercises. This produces 135 runs / 600 exercises on the landing page instead of the actual 27 / 120. The hero stats bar and footer will show inflated numbers. Consider using the values from a single role (they're identical) or deduplicating before summing. Same issue exists in workers-v2/page.tsx at lines 64-65.

Suggested change

const totalEvalRuns = recommendations.reduce((sum, r) => sum + r.totalEvalRuns, 0)

const totalExercises = recommendations.reduce((sum, r) => sum + r.totalExercises, 0)

const totalEvalRuns = recommendations[0]?.totalEvalRuns ?? 0

const totalExercises = recommendations[0]?.totalExercises ?? 0

_{Fix it with Roo Code or mention @roomote and request a fix.}

roomote · 2026-02-13T03:26:12Z

apps/web-roo-code/src/app/evals/methodology/methodology-content.tsx

+
+								<motion.h1
+									variants={fadeUpVariants}
+									className="mt-4 text-4xl font-semibold tracking-tight md:text-5xl [font-family:var(--font-display)]">


[font-family:var(--font-display)] is used in 6 headings throughout this file, but methodology/page.tsx does not import Fraunces or provide the --font-display CSS variable, and no parent layout sets it either. This is the same issue that was fixed for workers/page.tsx in this PR (which now wraps content with the Fraunces font variable). The methodology page needs the same treatment -- either wrap <MethodologyContent /> in a div with the font variables, or lift the font setup to a shared evals layout.

_{Fix it with Roo Code or mention @roomote and request a fix.}

@roomote fix with shared layout

Fixaroo See task

Lifted Fraunces and IBM Plex Sans font setup to a shared evals/layout.tsx so --font-display and --font-body CSS variables are available to all evals sub-pages (methodology, workers, workers/[roleId], etc.) without duplication.

_{View commit | Revert commit}

@roomote pull latest changes from main into this branch

Fixaroo See task

Pulled latest changes from main into this branch. Merge completed cleanly with no conflicts.

_{View commit | Revert commit}

…s layout Move --font-display and --font-body CSS variable declarations from workers/page.tsx into a new evals/layout.tsx so all evals sub-pages (methodology, workers, workers/[roleId], etc.) inherit the font variables without duplicating the setup.

…ions

roomote · 2026-02-13T04:58:43Z

apps/web-roo-code/src/app/evals/recommendations/page.tsx

+const display = Fraunces({ subsets: ["latin"], variable: "--font-display" })
+const body = IBM_Plex_Sans({ subsets: ["latin"], weight: ["400", "500", "600"], variable: "--font-body" })


The shared evals/layout.tsx already imports Fraunces and IBM_Plex_Sans and wraps all evals children with the --font-display / --font-body CSS variables. These two declarations (and the font-variable wrapper div at line 82) duplicate that setup, producing two separate Next.js font optimization entries for the same fonts. workers/page.tsx doesn't do this -- it relies on the shared layout. Removing the imports, these two lines, and the wrapper div would align with the approach used by the other evals pages.

_{Fix it with Roo Code or mention @roomote and request a fix.}

roomote · 2026-02-13T05:27:17Z

apps/web-roo-code/src/app/evals/recommendations/[objectiveSlug]/objective-content.tsx

+	const copyPrompt = useCallback(async () => {
+		if (!examplePrompt) return
+		await navigator.clipboard.writeText(examplePrompt)
+	}, [examplePrompt])


copyPrompt fires navigator.clipboard.writeText without updating any state or providing visual feedback. The existing pattern in this codebase (copy-settings-button.tsx, comparison-chart.tsx) tracks a copied state, shows a checkmark icon and "Copied!" text, then resets after 2 seconds. Without similar feedback here, users clicking "Copy example prompt" have no confirmation that the copy succeeded (or failed).

_{Fix it with Roo Code or mention @roomote and request a fix.}

roomote and others added 10 commits February 11, 2026 15:46

feat(web): add llms.txt and llms-full.txt for Answer Engine Optimization

f458e5e

Update apps/web-roo-code/public/llms-full.txt

417abeb

Co-authored-by: roomote[bot] <219738659+roomote[bot]@users.noreply.github.com>

fix(web-evals): improve comparison chart bar spacing and height

336a672

refactor(web-evals): generalize methodology roles section for scalabi…

6c849fb

…lity

fix(web-evals): pluralize team members text and increase card row gap

514dadf

style(web-evals): remove tilde prefix from dollar amounts

3a6eaf8

feat: eval-evolution work in progress - recommendations and workers UI

310b10c

github-project-automation bot added this to Roo Code Roadmap Feb 12, 2026

github-project-automation bot moved this to New in Roo Code Roadmap Feb 12, 2026

mp-roocode self-assigned this Feb 12, 2026

roomote bot reviewed Feb 12, 2026

View reviewed changes

mp-roocode added 2 commits February 12, 2026 19:20

Make workers outcomes-first canonical; redirect v2; refresh methodology

d91c035

Fix workers-v2 redirect util imports

b3fb878

roomote bot reviewed Feb 13, 2026

View reviewed changes

roomote and others added 3 commits February 13, 2026 03:39

Merge remote-tracking branch 'origin/main' into feat/eval-recommendat…

132f7af

…ions

Refine evals objective selection and recommendation URL defaults

1d03382

roomote bot reviewed Feb 13, 2026

View reviewed changes

Add objective deep-dive pages under eval recommendations

268b183

roomote bot reviewed Feb 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: eval-evolution work in progress - recommendations and workers UI #11444

feat: eval-evolution work in progress - recommendations and workers UI #11444

mp-roocode commented Feb 12, 2026

Uh oh!

roomote bot commented Feb 12, 2026 •

edited

Loading

Uh oh!

roomote bot Feb 12, 2026

Uh oh!

roomote bot Feb 12, 2026

Uh oh!

roomote bot Feb 13, 2026

Uh oh!

schneidergithub Feb 13, 2026

Uh oh!

roomote bot Feb 13, 2026 •

edited

Loading

Uh oh!

schneidergithub Feb 13, 2026

Uh oh!

roomote bot Feb 13, 2026 •

edited

Loading

Uh oh!

roomote bot Feb 13, 2026

Uh oh!

roomote bot Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		const totalEvalRuns = recommendations.reduce((sum, r) => sum + r.totalEvalRuns, 0)
		const totalExercises = recommendations.reduce((sum, r) => sum + r.totalExercises, 0)

		const display = Fraunces({ subsets: ["latin"], variable: "--font-display" })
		const body = IBM_Plex_Sans({ subsets: ["latin"], weight: ["400", "500", "600"], variable: "--font-body" })

feat: eval-evolution work in progress - recommendations and workers UI #11444

Are you sure you want to change the base?

feat: eval-evolution work in progress - recommendations and workers UI #11444

Conversation

mp-roocode commented Feb 12, 2026

Uh oh!

roomote bot commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

roomote bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

roomote bot Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

roomote bot Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

schneidergithub Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

roomote bot Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

schneidergithub Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

roomote bot Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

roomote bot Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

roomote bot Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

roomote bot commented Feb 12, 2026 •

edited

Loading

roomote bot Feb 13, 2026 •

edited

Loading

roomote bot Feb 13, 2026 •

edited

Loading