Files

scawful 508c5402ed Enhance documentation for E2E GUI testing framework and Tile16 editor palette system

- Added comprehensive sections on the E2E GUI testing framework in `A1-testing-guide.md`, detailing architecture, test writing, and execution.
- Introduced a new document `E7-tile16-editor-palette-system.md` outlining the redesign and implementation of the Tile16 editor palette system, including problem analysis, solution architecture, and UI/UX refactoring.
- Updated `E6-z3ed-cli-design.md` with recent refactoring improvements and code quality enhancements for the z3ed CLI tool.
- Expanded the YAZE development tracker in `yaze.org` to reflect ongoing issues and features related to the Tile16 editor and E2E testing.

2025-10-01 09:23:59 -04:00

8.1 KiB

Raw Blame History

Testing Guide

Comprehensive testing framework with efficient CI/CD integration and ROM-dependent test separation.

Test Categories

Stable Tests (STABLE)

Always run in CI/CD - Required for releases

AsarWrapperTest: Core Asar functionality tests
SnesTileTest: SNES tile format handling
CompressionTest: Data compression/decompression
SnesPaletteTest: SNES palette operations
HexTest: Hexadecimal utilities
AsarIntegrationTest: Asar integration without ROM dependencies

Characteristics:

Fast execution (< 30 seconds total)
No external dependencies (ROMs, complex setup)
High reliability and deterministic results

ROM-Dependent Tests (ROM_DEPENDENT)

Only run in development with available ROM files

AsarRomIntegrationTest: Real ROM patching and symbol extraction
ROM-based integration tests: Tests requiring actual game ROM files

Characteristics:

Require specific ROM files to be present
Test real-world functionality
Automatically skipped in CI if ROM files unavailable

Experimental Tests (EXPERIMENTAL)

Run separately, allowed to fail

CpuTest: 65816 CPU emulation tests
Spc700Test: SPC700 audio processor tests
ApuTest: Audio Processing Unit tests
PpuTest: Picture Processing Unit tests

Characteristics:

May be unstable due to emulation complexity
Test advanced/experimental features
Allowed to fail without blocking releases

Command Line Usage

# Run only stable tests (release-ready)
ctest --test-dir build --label-regex "STABLE"

# Run experimental tests (allowed to fail)
ctest --test-dir build --label-regex "EXPERIMENTAL"

# Run Asar-specific tests
ctest --test-dir build -R "*Asar*"

# Run tests excluding ROM-dependent ones
ctest --test-dir build --label-exclude "ROM_DEPENDENT"

# Run with specific preset
ctest --preset stable
ctest --preset experimental

CMake Presets

# Development workflow
cmake --preset dev
cmake --build --preset dev
ctest --preset dev

# CI workflow  
cmake --preset ci
cmake --build --preset ci
ctest --preset ci

# Release workflow
cmake --preset release
cmake --build --preset release
ctest --preset stable

Writing Tests

Stable Tests

TEST(SnesTileTest, UnpackBppTile) {
    std::vector<uint8_t> tile_data = {0xAA, 0x55, 0xAA, 0x55};
    std::vector<uint8_t> result = UnpackBppTile(tile_data, 2);
    EXPECT_EQ(result.size(), 64);
    // Test specific pixel values...
}

ROM-Dependent Tests

YAZE_ROM_TEST(AsarIntegration, RealRomPatching) {
    auto rom_data = TestRomManager::LoadTestRom();
    if (!rom_data.has_value()) {
        GTEST_SKIP() << "ROM file not available";
    }
    
    AsarWrapper wrapper;
    wrapper.Initialize();
    
    auto result = wrapper.ApplyPatch("test.asm", *rom_data);
    EXPECT_TRUE(result.ok());
}

Experimental Tests

TEST(CpuTest, InstructionExecution) {
    // Complex emulation tests
    // May be timing-sensitive or platform-dependent
}

CI/CD Integration

GitHub Actions

# Main CI pipeline
- name: Run Stable Tests
  run: ctest --label-regex "STABLE"

# Experimental tests (allowed to fail)
- name: Run Experimental Tests
  run: ctest --label-regex "EXPERIMENTAL"
  continue-on-error: true

Test Execution Strategy

Stable tests run first - Quick feedback for developers
Experimental tests run in parallel - Don't block on unstable tests
ROM tests skipped - No dependency on external files
Selective test execution - Only run relevant tests for changes

Test Development Guidelines

Writing Stable Tests

Fast execution: Aim for < 1 second per test
No external dependencies: Self-contained test data
Deterministic: Same results every run
Core functionality: Test essential features only

Writing ROM-Dependent Tests

Use TestRomManager: Proper ROM file handling
Graceful skipping: Skip if ROM not available
Real-world scenarios: Test with actual game data
Label appropriately: Always include ROM_DEPENDENT label

Writing Experimental Tests

Complex scenarios: Multi-component integration
Advanced features: Emulation, complex algorithms
Performance tests: May vary by system
GUI components: May require display context

E2E GUI Testing Framework

Overview

An agent-friendly, end-to-end testing framework built on ImGuiTestEngine to automate UI interaction testing for the YAZE editor.

Architecture

Test Execution Flow:

z3ed test-gui command invokes the modified yaze_test executable
yaze_test initializes an application window and ImGuiTestEngine
Tests are registered and executed against the live GUI
Results are reported back with detailed logs and assertions

Key Components:

test/yaze_test.cc - Main test executable with GUI initialization
test/e2e/framework_smoke_test.cc - Basic infrastructure verification
test/e2e/canvas_selection_test.cc - Canvas interaction tests
test/test_utils.h - High-level action wrappers (LoadRomInTest, OpenEditorInTest, etc.)

Writing E2E Tests

// Example: Canvas selection test
#include "test/test_utils.h"

void RegisterCanvasSelectionTest() {
  ImGuiTestEngine* engine = ImGuiTestEngine_GetCurrentContext();
  
  ImGuiTest* t = IM_REGISTER_TEST(engine, "e2e", "canvas_selection");
  t->GuiFunc = [](ImGuiTestContext* ctx) {
    // Load ROM and open editor
    LoadRomInTest(ctx, "zelda3.sfc");
    OpenEditorInTest(ctx, "Overworld Editor");
    
    // Perform actions
    ctx->MouseMove("##OverworldCanvas");
    ctx->MouseClick(ImGuiMouseButton_Left);
    ctx->KeyPress(ImGuiMod_Ctrl | ImGuiKey_C);  // Copy
    
    // Assertions
    IM_CHECK(VerifyTileData(ctx, expected_data));
  };
}

Helper Functions

test/test_utils.h provides high-level wrappers:

LoadRomInTest(ctx, rom_path) - Load ROM file in test context
OpenEditorInTest(ctx, editor_name) - Open specific editor
VerifyTileData(ctx, expected) - Assert tile data correctness
WaitForRender(ctx) - Wait for rendering to complete

Running GUI Tests

# Run all E2E tests
z3ed test-gui --rom zelda3.sfc --test all

# Run specific test suite
z3ed test-gui --rom zelda3.sfc --test canvas_selection

# Run in headless mode (CI)
z3ed test-gui --rom zelda3.sfc --test all --headless

Test Categories for E2E

Smoke Tests: Basic functionality verification (window opens, ROM loads)
Canvas Tests: Drawing, selection, copy/paste operations
Editor Tests: Specific editor workflows (Overworld, Dungeon, Graphics)
Integration Tests: Multi-editor workflows and data persistence

Development Status

✅ Phase 1: Core infrastructure & test execution flow
✅ Phase 2: Agent-friendly test definition & interaction
✅ Phase 3: Initial test case - Canvas rectangle selection
✅ Phase 4: Build and verify infrastructure
⏳ Phase 5: Expand test coverage and fix identified bugs
⏳ Phase 6: CI/CD integration with headless mode

Best Practices

Keep tests deterministic - Use fixed delays and wait for specific states
Use high-level helpers - Abstract ImGuiTestEngine complexity
Test user workflows - Focus on real user interactions, not internal state
Verify visually - Check rendered output, not just data structures
Clean up state - Reset between tests to prevent interference

Performance and Maintenance

Regular Review

Monthly review of experimental test failures
Promote stable experimental tests to stable category
Deprecate obsolete tests that no longer provide value
Update test categorization as features mature

Performance Monitoring

Track test execution times for CI efficiency
Identify slow tests for optimization or recategorization
Monitor CI resource usage and adjust parallelism
Benchmark critical path tests for performance regression

E2E Test Maintenance

Update test helpers as UI components change
Maintain test ROM files for consistent test data
Review failed tests for UI changes vs. actual bugs
Expand coverage for new features and bug fixes

8.1 KiB Raw Blame History

Testing Guide

Test Categories

Stable Tests (STABLE)

ROM-Dependent Tests (ROM_DEPENDENT)

Experimental Tests (EXPERIMENTAL)

Command Line Usage

CMake Presets

Writing Tests

Stable Tests

ROM-Dependent Tests

Experimental Tests

CI/CD Integration

GitHub Actions

Test Execution Strategy

Test Development Guidelines

Writing Stable Tests

Writing ROM-Dependent Tests

Writing Experimental Tests

E2E GUI Testing Framework

Overview

Architecture

Writing E2E Tests

Helper Functions

Running GUI Tests

Test Categories for E2E

Development Status

Best Practices

Performance and Maintenance

Regular Review

Performance Monitoring

E2E Test Maintenance

8.1 KiB

Raw Blame History