Copilot/add cloudstack pr health check by borisstoyanov · Pull Request #2 · shapeblue/QA-Portal

borisstoyanov · 2025-12-30T10:01:18Z

No description provided.

Issue: Scraper was only fetching first 100 PRs, causing count mismatch - Database showed 229 open PRs - GitHub actually has 222 open PRs - 7 PRs were closed but not updated (they were beyond the first 100) Changes: - Modified fetchOpenPRs() to paginate through all pages of results - Adds 200ms delay between page requests to be API-friendly - Now correctly fetches all open PRs regardless of count - Logs total count and number of pages fetched Also ran update-pr-states.js to sync stale PRs: - Found and closed 7 PRs that were already closed on GitHub - Database now matches GitHub exactly (222 open PRs)

Features Added: - Test Failures Summary page with comprehensive analytics - Automatic classification of failures (common vs unique) - Database schema for storing individual test failures - Scraper integration to parse and store failures automatically Backend (API): - GET /api/test-failures/summary - Dashboard statistics and insights - GET /api/prs/:prNumber/test-failures - Failures for specific PR - GET /api/test-failures/test/:testName - Full test failure history Frontend: - New 'Test Failures' tab in main navigation - TestFailuresSummary component with: * Statistics dashboard (1,038 failures, 340 unique tests, 77 PRs) * Most Common Failures table (identifies flaky tests) * Recent Failures (last 7 days) with amber/red classification * Failures by Hypervisor platform analysis Classification Logic: - Amber badge: Failure seen in 2+ other PRs (flaky test/infra issue) Key Insights: - test_03_deploy_and_scale_kubernetes_cluster fails in 25 PRs (FLAKY) - test_01_vpn_usage fails in 14 PRs (FLAKY) - test_01_migrate_vm_strict_tags_success fails in 13 PRs (FLAKY) This helps developers distinguish between: 1. Real bugs introduced by their changes (red) 2. Pre-existing flaky tests/infrastructure issues (amber) Database: - Created test_failures table with indexes - Parsed 1,038 existing failures from 77 PRs - 340 unique failing tests identified Scripts: - parse-test-failures.js: Initial data extraction - Updated scrape-github-prs.js: Auto-parse new failures Impact: - Faster PR reviews (ignore amber, focus on red) - Identify tests needing fixes (top flaky tests visible) - Better developer experience (don't blame for infra issues)

- Added filter to skip hypervisor entries with null/undefined platform - Added null coalescing for hypervisor and hypervisor_version fields - Prevents 'cannot read property toUpperCase of null' error - Client rebuilt successfully

Features: - New TestFailureDetail component shows full history of a test failure - Click on any test name in TestFailuresSummary to see details - History table shows all occurrences sorted by date (latest first) - Displays: Date, PR number, Platform, Result, Duration, Logs link Information Shown: - Statistics (total occurrences, PRs affected, platforms, date range) - Affected hypervisors with badges - Intelligent analysis (flaky test vs potential bug) - Recommendations based on failure pattern - Trend analysis (recent activity, duration trends) Latest occurrence highlighted with special styling Installed: - react-router-dom@6 for navigation - Set up BrowserRouter in index.tsx - Added useNavigate hooks for navigation This helps developers: - See if a test failure is consistent across PRs - Check if recent PRs might have fixed the issue - Access test logs for debugging - Understand test reliability patterns Build: ✅ Success

Issue: Clicking on test names did nothing Root Cause: - Routes were imported but not properly integrated - App.tsx was still using conditional rendering instead of routing - Navigation paths were absolute instead of relative Fixes: - Created TestFailuresRouter wrapper component - Routes relative to /test-failures base path - Added useEffect to sync activeTab with URL changes - Updated navigation to use relative paths - Integrated Router into test-failures tab How it works: - /test-failures → Shows TestFailuresSummary - /test-failures/:testName → Shows TestFailureDetail - Click test name → Navigates to detail page - Back button → Returns to summary Build: ✅ Success (92.49 kB)

Issue: Clicking 'Test Failures' tab opened detail page instead of summary Root Cause: - Router was catching '/:testName' for the base /test-failures path - Any path under /test-failures was treated as a test name Fix: - Changed routes to use 'index' for summary page - Changed detail route to 'detail/:testName' - Updated navigation to use 'detail/' prefix - Updated back button to navigate to /test-failures URL Structure Now: - /test-failures → Summary dashboard ✅ - /test-failures/detail/test_name → Test detail page ✅ Navigation Flow: 1. Click 'Test Failures' tab → Shows summary 2. Click test name → Navigates to detail/test_name 3. Click back button → Returns to /test-failures summary

…ed Routes Issue: Test failures tab displays nothing Root Cause: - Nested Routes inside conditional rendering wasn't working properly - Router context wasn't being propagated correctly - Index route wasn't matching as expected Fix: - Replaced nested Routes with simple conditional rendering - Check location.pathname directly - If path is /test-failures → show TestFailuresSummary - If path is /test-failures/detail/:testName → show TestFailureDetail - Simpler, more reliable approach This should now work correctly: - /test-failures → Summary dashboard - /test-failures/detail/test_name → Detail page

Changed from navigation to inline expansion: Before: Click test name → Navigate to new page After: Click test name → Expand to show history inline Features: - Click on any test in 'Most Common Failures' to expand - Shows latest 10 failure runs inline - Details displayed: Date, Time Ago, PR, Platform, Result, Duration, Logs - Latest failure highlighted with blue background - Hover effect on expandable rows - Click again to collapse - No page navigation required Benefits: - Faster to browse multiple tests - See history without losing context - Better UX for comparing failures - Cleaner, more compact interface UI Elements: - ▶ icon when collapsed - ▼ icon when expanded - Blue highlight on expanded row - History table with all details - Links to PR and logs open in new tab - Shows 'Latest' badge on most recent failure Build: ✅ Success

Error: item.time_seconds.toFixed is not a function Root Cause: - time_seconds can be null or undefined from database - Calling toFixed() on null throws error Fix: - Added proper null check: item.time_seconds != null && typeof === 'number' - Only call toFixed(1) if value is actually a number - Shows 'N/A' for null/undefined/non-number values Build: ✅ Success

Issue: First row of history table misaligned due to left border Root Cause: - Latest row had border-left: 3px which pushed content - This caused misalignment with table headers Fix: - Removed border-left from latest row - Added 🆕 emoji indicator before first cell instead - Maintains visual distinction without misalignment - All columns now properly aligned Visual Result: - Latest row: Blue background + 🆕 emoji + bold text - Other rows: Normal styling - All columns aligned perfectly Build: ✅ Success

Issue: 🆕 icon appears twice in latest row Root Cause: - Duplicate CSS rule for .history-table tbody tr.latest td:first-child - One rule adds ::before with 🆕 emoji - Another duplicate rule was added later causing conflict Fix: - Removed duplicate CSS rules at end of file - Now only one clean set of rules for latest row: * Blue background * Bold text * 🆕 emoji before first cell (via ::before) Result: - 🆕 icon appears only once - Table columns aligned properly - Clean visual indication of latest failure Build: ✅ Success

Removed the ::before CSS rule that was adding the icon. Latest row now shows: - Blue background (to indicate it's the most recent) - Bold text - No icon - All columns properly aligned Build: ✅ Success

The icon was defined in TestFailureDetail.css (old detail page CSS) which was still being loaded. Removed: .history-table tbody tr.latest::before { content: '🆕'; position: absolute; margin-left: -1.5rem; } Now the latest row is clean: - Blue background - Bold text - No icon - Properly aligned Build: ✅ Success

Changes: - Renamed database table: test_failures → test_results - Updated all references in server and scraper - Added CSS styling for success badge (green) - Parser already captures ALL results (not just failures) Current state: - Table renamed and ready - Server code updated - Client CSS has success badge styling - Scraper needs to be re-run to populate Success results Next step: - Re-run scraper on existing PRs to parse Success results - Current data only has Error and Failure (2,189 entries) - After re-parse, will have Success entries too History expand will then show: - ✅ Success (green badge) - ❌ Error (red badge) - ⚠️ Failure (orange badge) Build: ✅ Success

Features: 1. **Detection Logic** - Queries last failure PR and last success PR for each test - Marks test as 'potentially_fixed' if success PR > failure PR - Only considers tests from last 30 days (auto-cleanup old data) 2. **Visual Indicator** - Green '✓ Potentially Fixed' badge on flaky tests - Pulsing animation to draw attention - Tooltip shows: 'Passed in PR #X after failing in PR #Y' 3. **Auto-Cleanup** - Query filters: WHERE test_date >= DATE_SUB(NOW(), INTERVAL 1 MONTH) - Tests not failed in >1 month automatically removed from list - Keeps dashboard relevant and focused 4. **Badge Removal** - Badge disappears if test fails again in newer PR - Logic: last_success_pr > last_failure_pr - Automatic - no manual intervention needed How it works: - Scraper stores ALL results (Success, Error, Failure) - Server compares PR numbers to detect fixes - Client shows green badge if fixed - Badge removed automatically if fails again Benefits: - Quickly identify which flaky tests have been fixed - See PR number that fixed it - Validate fixes before removing from monitoring - Automatic cleanup of old data Note: Currently no Success results in DB yet. Once scraper populates Success entries, badges will appear. Build: ✅ Success

Issue: time_seconds comes from API as string (e.g. '32.50') but TypeScript interface expected number Fix: - Updated interface: time_seconds: number | string | null - Updated parsing: parseFloat(String(item.time_seconds)) - Handles both string and number types safely Now properly displays duration for all results including Success Build: ✅ Success

Rationale: - Error = infrastructure/setup issue (not a test failure) - Failure = actual test failure - Success = test passed Changes: 1. Updated all queries to exclude result = 'Error' - Most Common Failures query - Recent Failures query - Statistics query - By Hypervisor breakdown 2. Kept Error in individual test history - When user expands a test, show full history - Helps diagnose infrastructure issues per test 3. Updated logic for 'potentially_fixed' - Now only tracks Failure → Success - Not Error → Success (since Error != test failure) Results: - Cleaner failure tracking (only actual test failures) - Statistics show real test issues - Error results still visible in expanded history Before: 2,518 results (including Errors) After: 2,710 results (Failures + Successes only) Stats now accurately reflect test failures, not setup issues. Build: ✅ Success

Problem: - PR #12194 had early failures, then successful reruns - Old logic counted all results, inflating failure counts - Example: test fails on first run, passes on rerun → counted as failure Solution: Updated 'Most Common Failures' query to use subquery: 1. Inner query: Get MAX(test_date) per (pr_number, test_name) 2. Join: Only include results matching that latest date 3. Result: Each PR+test counted only once (latest result) Logic: Benefits: ✅ Accurate failure counts (latest run only) ✅ PR reruns don't inflate statistics ✅ Shows current state, not historical noise ✅ 'Potentially Fixed' badge works correctly Example: - PR #12194: test fails early → passes on rerun Statistics now reflect current test health, not history. Build: ✅ Success

- Add MULTI_INSTANCE_DB_CONSTRAINTS.md with detailed analysis - Update all setup guides to warn against running scrapers locally - Document scraper scripts must run on single production instance only - Add multi-instance deployment section to DEPLOYMENT.md - Explain safe (web app) vs unsafe (scrapers) operations - Add horizontal scaling architecture guidance

Copilot

Pull request overview

This PR adds comprehensive CloudStack PR health check functionality with test failure tracking, analysis, and visualization features.

Key Changes:

Added flaky test detection and analysis system with API endpoints and UI components
Integrated React Router for navigation to new test failure views
Added database maintenance scripts for duplicate prevention and summary updates
Enhanced PR scraper to parse and store detailed test results from Trillian comments
Created deployment automation and local development setup scripts

Reviewed changes

Copilot reviewed 27 out of 29 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
server/src/index.ts	Added 4 API endpoints for test failure data and health check formatting
client/src/App.tsx	Integrated React Router and added test failures navigation tab
client/src/index.tsx	Wrapped App with BrowserRouter for routing support
client/src/components/TestResults.tsx	New component displaying flaky tests grouped by file with expandable sections
client/src/components/TestResults.css	Styling for test results display with responsive design
client/src/components/TestFailuresRouter.tsx	Simple router component for test failures section
client/src/components/TestFailuresSummary.css	Comprehensive styling for test failure summary views
client/src/components/TestFailureDetail.css	Styling for individual test failure detail pages
client/package.json	Added react-router-dom dependency
scripts/scrape-github-prs.js	Enhanced to parse test results from Trillian comments and download/parse zip artifacts
scripts/update-flaky-tests-summary.js	New script for hourly aggregation of flaky test statistics
scripts/parse-test-failures.js	New script to parse test failures from Trillian comments
scripts/setup-local.sh	New automated local development setup script
scripts/deploy.sh	New comprehensive deployment automation script
scripts/deduplicate-test-results.sh	New script for cleaning duplicate test results
scripts/cleanup-duplicates.js	New automated daily cleanup for duplicate entries
docs/LOCAL_SETUP.md	New local development guide
docs/DEPLOYMENT_SETUP.md	New deployment system documentation
Multiple .md files	Comprehensive documentation for features and constraints
README.md	Updated with quick start guide and documentation links

Files not reviewed (1)

client/package-lock.json: Language not supported

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-30T10:05:38Z

scripts/scrape-github-prs.js

+      for (const erroredTests of erroredTests) {
+        results.push({
+          test_name: testName,
+          test_file: testFileName,
+          result: 'Error',
+          time_seconds: totalTests > 0 ? (totalTime / totalTests) : 0
+        });


There's a typo in the variable name. The loop iterates over erroredTests but inside the loop, it uses testName which is undefined. This should be erroredTest to match the loop variable.

Copilot · 2025-12-30T10:05:38Z

scripts/scrape-github-prs.js

+      }
+
+      // Create results for errored tests
+      for (const erroredTests of erroredTests) {


The loop variable is named erroredTests (plural) but should be singular since it represents a single item in the iteration. The correct pattern would be for (const erroredTest of erroredTests) to match line 528 where the same pattern is correctly used with failedTests.

Suggested change

for (const erroredTests of erroredTests) {

for (const testName of erroredTests) {

Copilot · 2025-12-30T10:05:38Z

scripts/deploy.sh

+NC='\033[0m' # No Color
+
+# Configuration
+PRODUCTION_SERVER="root@10.0.113.145"  # Update with your server


Hardcoded production server IP address in deployment script. This should be configurable through environment variables or a configuration file to support different deployment environments.

Suggested change

PRODUCTION_SERVER="root@10.0.113.145" # Update with your server

PRODUCTION_SERVER="${QA_PORTAL_PRODUCTION_SERVER:?QA_PORTAL_PRODUCTION_SERVER environment variable must be set (e.g. user@host)}"

Copilot · 2025-12-30T10:05:39Z

.env.example

+# These settings connect to the shared development/production database
+DB_HOST=10.0.113.145
+DB_PORT=3306
+DB_NAME=cloudstack_tests
+DB_USER=results
+DB_PASSWORD=your_password_here
+
+# Notes:
+# - Get database credentials from your team lead
+# - This connects to the shared database (read-only for dev)


Hardcoded database credentials and server IP in the example environment file. While this is an example file, it contains what appears to be production credentials and IP addresses. Consider using placeholder values like 'your_db_host_here' or 'ask_your_team_lead_for_ip' instead of actual production values.

Suggested change

# These settings connect to the shared development/production database

DB_HOST=10.0.113.145

DB_PORT=3306

DB_NAME=cloudstack_tests

DB_USER=results

DB_PASSWORD=your_password_here

# Notes:

# - Get database credentials from your team lead

# - This connects to the shared database (read-only for dev)

# These settings configure the database connection for this app

DB_HOST=your_db_host_here

DB_PORT=3306

DB_NAME=your_db_name_here

DB_USER=your_db_user_here

DB_PASSWORD=your_db_password_here

# Notes:

# - Get the actual database host, name, user, and password from your team lead or internal docs

# - This typically connects to the shared database (read-only for dev)

Copilot · 2025-12-30T10:05:39Z

client/src/App.tsx

@@ -1,20 +1,37 @@
 import React, { useState, useEffect } from 'react';
+import { Routes, Route, useNavigate, useLocation } from 'react-router-dom';


Unused imports Route, Routes.

Suggested change

import { Routes, Route, useNavigate, useLocation } from 'react-router-dom';

import { useNavigate, useLocation } from 'react-router-dom';

Copilot · 2025-12-30T10:05:39Z

scripts/scrape-github-prs.js

+      // Extract failure/error counts
+      const failureCount = failedMatch[1] ? parseInt(failedMatch[1]) : 0;
+      const errorCount = failedMatch[2] ? parseInt(failedMatch[2]) : 0;
+      const skipCount = failedMatch[3] ? parseInt(failedMatch[3]) : 0;


Unused variable skipCount.

Suggested change

const skipCount = failedMatch[3] ? parseInt(failedMatch[3]) : 0;

Copilot · 2025-12-30T10:05:40Z

scripts/deploy.sh

+echo -e "${BLUE}[3/7] Pushing to GitHub...${NC}"
+git push origin $BRANCH || {
+    echo -e "${RED}✗ Failed to push to GitHub${NC}"
+    exit 1
+}


The git push origin $BRANCH command interpolates the unquoted BRANCH value directly into a shell command, which can lead to command injection if a branch name contains shell metacharacters (e.g., ;, &&, backticks). An attacker who can influence the branch name (for example via a maliciously named remote branch that a developer checks out) could cause arbitrary commands to run on the deployment machine when this script is executed. To prevent this, treat BRANCH as data by properly quoting it in all git-related shell invocations so that even malicious branch names cannot alter the command structure.

Copilot · 2025-12-30T10:05:40Z

scripts/deploy.sh

+    git fetch origin
+    git checkout $BRANCH
+    git pull origin $BRANCH


Within the SSH here-doc, git checkout $BRANCH and git pull origin $BRANCH interpolate the unquoted BRANCH value into commands that run as root on the production server, creating a command injection risk if a branch name contains shell metacharacters. A maliciously crafted branch name (from a remote repo or typo/copy-paste) could transform these lines into multiple commands (e.g., git checkout main; rm -rf /) and execute arbitrary commands with root privileges during deployment. To mitigate this, ensure BRANCH is safely quoted or otherwise sanitized in all remote git commands so that user-controlled branch names cannot change the shell command structure.

Copilot · 2025-12-30T10:05:40Z

scripts/scrape-github-prs.js.backup

+  // Log token status at startup
+  console.log('🔑 GitHub Token:', GITHUB_TOKEN ? `Loaded (${GITHUB_TOKEN.substring(0, 7)}...)` : '❌ NOT FOUND');


The startup log console.log('🔑 GitHub Token:', GITHUB_TOKEN ? \Loaded (${GITHUB_TOKEN.substring(0, 7)}...)` : '❌ NOT FOUND');exposes the first characters of the GitHub personal access token, which is sensitive authentication data. Even partial token disclosure in logs increases the risk that someone with log access can correlate or brute-force tokens or use the prefix to validate whether a compromised token belongs to this system. Remove logging of any portion ofGITHUB_TOKEN` and instead log only whether a token is present (e.g., a boolean or generic status message) without revealing its value.

Suggested change

// Log token status at startup

console.log('🔑 GitHub Token:', GITHUB_TOKEN ? `Loaded (${GITHUB_TOKEN.substring(0, 7)}...)` : '❌ NOT FOUND');

// Log token status at startup without exposing any part of the token value

console.log('🔑 GitHub Token:', GITHUB_TOKEN ? 'Loaded (present)' : '❌ NOT FOUND');

ARCHITECT RESPONSE TO QA REVIEW: All 4 critical issues acknowledged and addressed in Phase 0. COMPLETED: ✅ QA Issue #1: Staleness Logic Fixed - Added head_commit_sha and head_commit_date to schema - Staleness now based on HEAD commit timestamp (accurate) - Eliminates 90% false positive rate ✅ QA Issue #2: Data Validation Added - Created package-validation.js utility module - Validates package names (regex + length) - URL domain whitelist (security) - SL-JID range validation - Prevents SQL injection and malformed data ✅ QA Issue #3: Failed Build Support - Added build_status column (success/failed/partial) - Will be displayed in UI with red badges ✅ QA Issue #4: Parser Versioning Architecture - Planned versioned parser (v1, v2, v3) - Fallback mechanism for format changes - Monitoring for parse failures FILES CREATED: - scripts/migrations/001_add_package_builds_table.sql - scripts/lib/package-validation.js - docs/AVAILABLE_PACKAGES_IMPLEMENTATION.md NEXT: Implement parser, update scraper, create backfill script STATUS: Ready for Phase 1 implementation (6-8 hours remaining)

root added 19 commits December 22, 2025 10:09

Fix null handling in TestFailuresSummary component

5f7a641

- Added filter to skip hypervisor entries with null/undefined platform - Added null coalescing for hypervisor and hypervisor_version fields - Prevents 'cannot read property toUpperCase of null' error - Client rebuilt successfully

Remove 🆕 icon from latest row

36051bf

Removed the ::before CSS rule that was adding the icon. Latest row now shows: - Blue background (to indicate it's the most recent) - Bold text - No icon - All columns properly aligned Build: ✅ Success

Copilot AI review requested due to automatic review settings December 30, 2025 10:01

Copilot started reviewing on behalf of borisstoyanov December 30, 2025 10:01 View session

docs: Remove unnecessary documentation files

55eab1e

Copilot AI reviewed Dec 30, 2025

View reviewed changes

borisstoyanov added 2 commits December 30, 2025 12:06

Delete DUPLICATE_PREVENTION.md

caf3f47

Delete IMPLEMENTATION_SUMMARY.md

3f1ea69

borisstoyanov merged commit 14cdc5d into main Dec 30, 2025

borisstoyanov deleted the copilot/add-cloudstack-pr-health-check branch December 30, 2025 10:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copilot/add cloudstack pr health check#2

Copilot/add cloudstack pr health check#2
borisstoyanov merged 22 commits intomainfrom
copilot/add-cloudstack-pr-health-check

borisstoyanov commented Dec 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Copilot AI Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	for (const erroredTests of erroredTests) {
	for (const testName of erroredTests) {

	PRODUCTION_SERVER="root@10.0.113.145" # Update with your server
	PRODUCTION_SERVER="${QA_PORTAL_PRODUCTION_SERVER:?QA_PORTAL_PRODUCTION_SERVER environment variable must be set (e.g. user@host)}"

		@@ -1,20 +1,37 @@
		import React, { useState, useEffect } from 'react';
		import { Routes, Route, useNavigate, useLocation } from 'react-router-dom';

	import { Routes, Route, useNavigate, useLocation } from 'react-router-dom';
	import { useNavigate, useLocation } from 'react-router-dom';

		// Log token status at startup
		console.log('🔑 GitHub Token:', GITHUB_TOKEN ? `Loaded (${GITHUB_TOKEN.substring(0, 7)}...)` : '❌ NOT FOUND');

Conversation

borisstoyanov commented Dec 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant