scawful/yaze

Fork 0

Files

scawful 5c4cd57ff8 backend-infra-engineer: Post v0.3.9-hotfix7 snapshot (build cleanup)

2025-12-22 00:20:49 +00:00

12 KiB

Raw Blame History

Testing Infrastructure - Master Documentation

Owner: CLAUDE_TEST_COORD Status: Active Last Updated: 2025-11-20

Overview

This document serves as the central hub for all testing infrastructure in the yaze project. It coordinates testing strategies across local development, CI/CD, and release validation workflows.

Quick Links

Developer Quick Start: Testing Quick Start Guide
Build & Test Commands: Quick Reference
Existing Testing Guide: Testing Guide
Release Checklist: Release Checklist
CI/CD Pipeline: .github/workflows/ci.yml

Testing Levels

1. Unit Tests (`test/unit/`)

Purpose: Fast, isolated component tests with no external dependencies.

Characteristics:

Run in <10 seconds total
No ROM files required
No GUI initialization
Primary CI validation layer
Can run on any platform without setup

Run Locally:

# Build tests
cmake --build build --target yaze_test

# Run only unit tests
./build/bin/yaze_test --unit

# Run specific unit test
./build/bin/yaze_test --gtest_filter="*AsarWrapper*"

Coverage Areas:

Core utilities (hex conversion, compression)
Graphics primitives (tiles, palettes, colors)
ROM data structures (without actual ROM)
CLI resource catalog
GUI widget logic (non-interactive)
Zelda3 parsers and builders

2. Integration Tests (`test/integration/`)

Purpose: Test interactions between multiple components.

Characteristics:

Run in <30 seconds total
May require ROM file (subset marked as ROM-dependent)
Test cross-module boundaries
Secondary CI validation layer

Run Locally:

# Run all integration tests
./build/bin/yaze_test --integration

# Run with ROM-dependent tests
./build/bin/yaze_test --integration --rom-dependent --rom-path zelda3.sfc

Coverage Areas:

Asar wrapper + ROM class integration
Editor system interactions
AI service integration
Dungeon/Overworld data loading
Multi-component rendering pipelines

3. End-to-End (E2E) Tests (`test/e2e/`)

Purpose: Full user workflows driven by ImGui Test Engine.

Characteristics:

Run in 1-5 minutes
Require GUI initialization (can run headless in CI)
Most comprehensive validation
Simulate real user interactions

Run Locally:

# Run E2E tests (headless)
./build/bin/yaze_test --e2e

# Run E2E tests with visible GUI (for debugging)
./build/bin/yaze_test --e2e --show-gui

# Run specific E2E workflow
./build/bin/yaze_test --e2e --gtest_filter="*DungeonEditorSmokeTest*"

Coverage Areas:

Editor smoke tests (basic functionality)
Canvas interaction workflows
ROM loading and saving
ZSCustomOverworld upgrades
Complex multi-step user workflows

4. Benchmarks (`test/benchmarks/`)

Purpose: Performance measurement and regression tracking.

Characteristics:

Not run in standard CI (optional job)
Focus on speed, not correctness
Track performance trends over time

Run Locally:

./build/bin/yaze_test --benchmark

Test Organization Matrix

Category	ROM Required	GUI Required	Typical Duration	CI Frequency
Unit	No	No	<10s	Every commit
Integration	Sometimes	No	<30s	Every commit
E2E	Often	Yes (headless OK)	1-5min	Every commit
Benchmarks	No	No	Variable	Weekly/on-demand

Test Suites and Labels

Tests are organized into CMake test suites with labels for filtering:

stable: Fast tests with no ROM dependency (unit + some integration)
unit: Only unit tests
integration: Only integration tests
e2e: End-to-end GUI tests
rom_dependent: Tests requiring a real Zelda3 ROM file

See test/CMakeLists.txt for suite definitions.

Local Testing Workflows

Pre-Commit: Quick Validation (<30s)

# Build and run stable tests only
cmake --build build --target yaze_test
./build/bin/yaze_test --unit

# Alternative: use helper script
scripts/agents/run-tests.sh mac-dbg --output-on-failure

Pre-Push: Comprehensive Validation (<5min)

# Run all tests except ROM-dependent
./build/bin/yaze_test

# Run all tests including ROM-dependent
./build/bin/yaze_test --rom-dependent --rom-path zelda3.sfc

# Alternative: use ctest with preset
ctest --preset dev

Pre-Release: Full Platform Matrix

See Release Checklist for complete validation requirements.

CI/CD Testing Strategy

PR Validation Pipeline

Workflow: .github/workflows/ci.yml

Jobs:

Build (3 platforms: Linux, macOS, Windows)
- Compile all targets with warnings-as-errors
- Verify no build regressions
Test (3 platforms)
- Run stable test suite (fast, no ROM)
- Run unit test suite
- Run integration test suite (non-ROM-dependent)
- Upload test results and artifacts
Code Quality
- clang-format verification
- cppcheck static analysis
- clang-tidy linting
z3ed Agent (optional, scheduled)
- Full AI-enabled build with gRPC
- HTTP API testing (when enabled)

Preset Usage:

Linux: ci-linux
macOS: ci-macos
Windows: ci-windows

Remote Workflow Triggers

Agents and developers can trigger workflows remotely:

# Trigger CI with HTTP API tests enabled
scripts/agents/run-gh-workflow.sh ci.yml -f enable_http_api_tests=true

# Trigger CI with artifact uploads
scripts/agents/run-gh-workflow.sh ci.yml -f upload_artifacts=true

See GH Actions Remote Guide for details.

Test Result Artifacts

Test XML reports uploaded on failure
Build logs available in job output
Windows binaries uploaded for debugging

Platform-Specific Test Considerations

macOS

Stable: All tests pass reliably
Known Issues: None active
Recommended Preset: mac-dbg (debug), mac-ai (with gRPC)
Smoke Build: scripts/agents/smoke-build.sh mac-dbg

Linux

Stable: All tests pass reliably
Known Issues: Previous FLAGS symbol conflicts resolved (commit 43a0e5e314)
Recommended Preset: lin-dbg (debug), lin-ai (with gRPC)
Smoke Build: scripts/agents/smoke-build.sh lin-dbg

Windows

Stable: Build fixes applied (commit 43118254e6)
Known Issues: Previous std::filesystem errors resolved
Recommended Preset: win-dbg (debug), win-ai (with gRPC)
Smoke Build: pwsh -File scripts/agents/windows-smoke-build.ps1 -Preset win-dbg

Test Writing Guidelines

Where to Add New Tests

New class MyClass: Add test/unit/my_class_test.cc
Testing with ROM: Add test/integration/my_class_rom_test.cc
Testing UI workflow: Add test/e2e/my_class_workflow_test.cc

Test Structure

All test files should follow this pattern:

#include <gtest/gtest.h>
#include "path/to/my_class.h"

namespace yaze {
namespace test {

TEST(MyClassTest, BasicFunctionality) {
  MyClass obj;
  EXPECT_TRUE(obj.DoSomething());
}

TEST(MyClassTest, EdgeCases) {
  MyClass obj;
  EXPECT_FALSE(obj.HandleEmpty());
}

}  // namespace test
}  // namespace yaze

Mocking

Use test/mocks/ for mock objects:

mock_rom.h: Mock ROM class for testing without actual ROM files
Add new mocks as needed for isolating components

Test Utilities

Common helpers in test/test_utils.h:

LoadRomInTest(): Load a ROM file in GUI test context
OpenEditorInTest(): Open an editor for E2E testing
CreateTestCanvas(): Initialize a canvas for testing

Troubleshooting Test Failures

Common Issues

1. ROM-Dependent Test Failures

Symptom: Tests fail with "ROM file not found" or data mismatches

Solution:

# Set ROM path environment variable
export YAZE_TEST_ROM_PATH=/path/to/zelda3.sfc

# Or pass directly
./build/bin/yaze_test --rom-path /path/to/zelda3.sfc

2. GUI Test Failures in CI

Symptom: E2E tests fail in headless CI environment

Solution: Tests should work headless by default. If failing, check:

ImGui Test Engine initialization
SDL video driver (uses "dummy" in headless mode)
Test marked with proper YAZE_GUI_TEST_TARGET definition

3. Platform-Specific Failures

Symptom: Tests pass locally but fail in CI on specific platform

Solution:

Check CI logs for platform-specific errors
Run locally with same preset (ci-linux, ci-macos, ci-windows)
Use remote workflow trigger to reproduce in CI environment

4. Flaky Tests

Symptom: Tests pass sometimes, fail other times

Solution:

Check for race conditions in multi-threaded code
Verify test isolation (no shared state between tests)
Add test to .github/workflows/ci.yml exclusion list temporarily
File issue with flaky-test label

Getting Help

Check existing issues: https://github.com/scawful/yaze/issues
Review test logs in CI job output
Ask in coordination board: docs/internal/agents/coordination-board.md
Tag CLAUDE_TEST_COORD for testing infrastructure issues

Test Infrastructure Roadmap

Completed

✅ Unit, integration, and E2E test organization
✅ ImGui Test Engine integration for GUI testing
✅ Platform-specific CI matrix (Linux, macOS, Windows)
✅ Smoke build helpers for agents
✅ Remote workflow triggers
✅ Test result artifact uploads

In Progress

🔄 Pre-push testing hooks
🔄 Symbol conflict detection tools
🔄 CMake configuration validation
🔄 Platform matrix testing tools

Planned

📋 Automated test coverage reporting
📋 Performance regression tracking
📋 Fuzz testing integration
📋 ROM compatibility test matrix (different ROM versions)
📋 GPU/graphics driver test matrix

Helper Scripts

All helper scripts are in scripts/agents/:

Script	Purpose	Usage
`run-tests.sh`	Build and run tests for a preset	`scripts/agents/run-tests.sh mac-dbg`
`smoke-build.sh`	Quick build verification	`scripts/agents/smoke-build.sh mac-dbg yaze`
`run-gh-workflow.sh`	Trigger remote CI workflow	`scripts/agents/run-gh-workflow.sh ci.yml`
`test-http-api.sh`	Test HTTP API endpoints	`scripts/agents/test-http-api.sh`
`windows-smoke-build.ps1`	Windows smoke build (PowerShell)	`pwsh -File scripts/agents/windows-smoke-build.ps1`

See scripts/agents/README.md for details.

Coordination Protocol

IMPORTANT: AI agents working on testing infrastructure must follow the coordination protocol:

Before starting work: Check docs/internal/agents/coordination-board.md for active tasks
Update board: Add entry with scope, status, and expected changes
Avoid conflicts: Request coordination if touching same files as another agent
Log results: Update board with completion status and any issues found

See Coordination Board for current status.

Contact & Ownership

Testing Infrastructure Lead: CLAUDE_TEST_COORD
Platform Specialists:
- Windows: CLAUDE_AIINF
- Linux: CLAUDE_AIINF
- macOS: CLAUDE_MAC_BUILD
Release Coordination: CLAUDE_RELEASE_COORD

References

Testing Guide - User-facing testing documentation
Testing Quick Start - Developer quick reference
Build Quick Reference - Build commands and presets
Release Checklist - Pre-release testing requirements
CI/CD Pipeline - Automated testing configuration

Next Steps: See Integration Plan for rolling out new testing infrastructure improvements.

12 KiB Raw Blame History

Testing Infrastructure - Master Documentation

Overview

Quick Links

Testing Levels

1. Unit Tests (test/unit/)

2. Integration Tests (test/integration/)

3. End-to-End (E2E) Tests (test/e2e/)

4. Benchmarks (test/benchmarks/)

Test Organization Matrix

Test Suites and Labels

Local Testing Workflows

Pre-Commit: Quick Validation (<30s)

Pre-Push: Comprehensive Validation (<5min)

Pre-Release: Full Platform Matrix

CI/CD Testing Strategy

PR Validation Pipeline

Remote Workflow Triggers

Test Result Artifacts

Platform-Specific Test Considerations

macOS

Linux

Windows

Test Writing Guidelines

Where to Add New Tests

Test Structure

Mocking

Test Utilities

Troubleshooting Test Failures

Common Issues

1. ROM-Dependent Test Failures

2. GUI Test Failures in CI

3. Platform-Specific Failures

4. Flaky Tests

Getting Help

Test Infrastructure Roadmap

Completed

In Progress

Planned

Helper Scripts

Coordination Protocol

Contact & Ownership

References

12 KiB

Raw Blame History

1. Unit Tests (`test/unit/`)

2. Integration Tests (`test/integration/`)

3. End-to-End (E2E) Tests (`test/e2e/`)

4. Benchmarks (`test/benchmarks/`)