Files

scawful 4b61b213c0 feat: Add resource search and dungeon room description commands

- Implemented `resource-search` command to allow fuzzy searching of resource labels.
- Added `dungeon-describe-room` command to summarize metadata for a specified dungeon room.
- Enhanced `agent` command handler to support new commands and updated usage documentation.
- Introduced read-only accessors for room metadata in the Room class.
- Updated AI service to recognize and handle new commands for resource searching and room description.
- Improved metrics tracking for user interactions, including command execution and response times.
- Enhanced TUI to display command metrics and session summaries.

2025-10-04 12:00:51 -04:00

developer_guide.md

Refactor graphics optimizations documentation and add ImGui widget testing guide

2025-10-04 03:24:42 -04:00

README.md

feat: Add resource search and dungeon room description commands

2025-10-04 12:00:51 -04:00

README.md

z3ed: AI-Powered CLI for YAZE

Status: Production Ready (AI Integration)
Latest Update: October 3, 2025

Overview

z3ed is a command-line interface for YAZE enabling AI-driven ROM modifications through a conversational interface. It provides natural language interaction for ROM inspection and editing with a safe proposal-based workflow.

Core Capabilities:

Conversational Agent: Chat with AI to explore ROM contents and plan changes
GUI Test Automation: Widget discovery, recording/replay, introspection
Proposal System: Sandbox editing with review workflow
Multiple AI Backends: Ollama (local), Gemini (cloud)

Quick Start

Build

# Full AI features (RECOMMENDED)
cmake -B build -DZ3ED_AI=ON
cmake --build build --target z3ed

# With GUI automation
cmake -B build -DZ3ED_AI=ON -DYAZE_WITH_GRPC=ON
cmake --build build --target z3ed

AI Setup

Ollama (Recommended for Development):

brew install ollama              # macOS
ollama pull qwen2.5-coder:7b    # Pull model
ollama serve                     # Start server

Gemini (Cloud API):

export GEMINI_API_KEY="your-key-here"
# Get key from https://aistudio.google.com/apikey

Example Commands

Conversational Agent:

# Interactive chat (FTXUI)
z3ed agent chat --rom zelda3.sfc

# Simple text mode (better for AI/automation)
z3ed agent simple-chat --rom zelda3.sfc

# Batch mode
z3ed agent simple-chat --file queries.txt --rom zelda3.sfc

Direct Tool Usage:

# List dungeons
z3ed agent resource-list --type dungeon --format json

# Find tiles
z3ed agent overworld-find-tile --tile 0x02E --map 0x05

# Inspect sprites
z3ed agent dungeon-list-sprites --room 0x012

Proposal Workflow:

# Generate from prompt
z3ed agent run --prompt "Place tree at 10,10" --rom zelda3.sfc --sandbox

# List proposals
z3ed agent list

# Review
z3ed agent diff --proposal-id <id>

# Accept
z3ed agent accept --proposal-id <id>

Chat Modes

1. FTXUI Chat (`agent chat`)

Full-screen interactive terminal with:

Table rendering for JSON results
Syntax highlighting
Scrollable history
Best for manual exploration

2. Simple Chat (`agent simple-chat`)

Text-based REPL without FTXUI:

Lightweight, no dependencies
Scriptable and automatable
Batch mode support
Better for AI agent testing
Commands: quit, exit, reset

ImGui widget in YAZE editor:

Same backend as CLI
Dockable interface
History persistence
Visual proposal review

Available Tools

The agent can call these tools autonomously:

Tool	Purpose	Example
`resource-list`	List labeled resources	"What dungeons exist?"
`resource-search`	Fuzzy search across labels	"Search for soldier labels"
`dungeon-list-sprites`	Sprites in room	"Show soldiers in room 0x12"
`dungeon-describe-room`	Room metadata summary	"Describe room 0x012"
`overworld-find-tile`	Find tile locations	"Where is tile 0x2E used?"
`overworld-describe-map`	Map metadata	"Describe map 0x05"
`overworld-list-warps`	List entrances/exits	"Show all cave entrances"

🎯 Next Steps

GUI Integration (4-6h): Wire chat widget into main app
Proposal Integration (6-8h): Connect chat to ROM modification

Troubleshooting

Chat mode freezes

Solution: Use agent simple-chat instead of agent chat

Example Workflows

Explore ROM

$ z3ed agent simple-chat --rom zelda3.sfc
You: What dungeons are defined?
Agent: <calls resource-list --type dungeon>
  ID    Label                   
  ----  ------------------------
  0x00  eastern_palace          
  0x01  desert_palace           
  ...

You: Show me sprites in the first dungeon room 0x012
Agent: <calls dungeon-list-sprites --room 0x012>
  ...

Make Changes

$ z3ed agent run --prompt "Add a tree at position 10,10 on map 0" --sandbox
Proposal created: abc123

$ z3ed agent diff --proposal-id abc123
Commands:
  overworld set-tile --map 0 --x 10 --y 10 --tile 0x02E

$ z3ed agent accept --proposal-id abc123
✅ Proposal accepted

Overview

z3ed is a command-line interface for YAZE that enables AI-driven ROM modifications through a proposal-based workflow. It provides both human-accessible commands for developers and machine-readable APIs for LLM integration.

Core Capabilities:

AI-Driven Editing: Natural language prompts → ROM modifications (overworld tile16, dungeon objects, sprites, palettes)
GUI Test Automation: Widget discovery, test recording/replay, introspection for debugging
Proposal System: Safe sandbox editing with accept/reject workflow
Multiple AI Backends: Ollama (local), Gemini (cloud), Claude (planned)

Quick Start

Build Options

# Basic z3ed (CLI only, no AI/testing features)
cmake --build build --target z3ed

# Full build with AI agent (RECOMMENDED - uses consolidated flag)
cmake -B build -DZ3ED_AI=ON
cmake --build build --target z3ed

# Full build with AI agent AND testing suite
cmake -B build -DZ3ED_AI=ON -DYAZE_WITH_GRPC=ON
cmake --build build --target z3ed

Build Flags Explained:

Z3ED_AI=ON - Master flag for AI features (enables JSON, YAML, httplib for Ollama + Gemini)
YAZE_WITH_GRPC=ON - Optional GUI automation and test harness (also enables JSON)
YAZE_WITH_JSON=ON - Lower-level flag (auto-enabled by Z3ED_AI or GRPC)

Dependencies for AI Features (auto-managed by Z3ED_AI):

nlohmann/json (JSON parsing for AI responses)
yaml-cpp (Config file loading)
httplib (HTTP/HTTPS API calls)
OpenSSL (optional, for Gemini HTTPS - auto-detected on macOS/Linux)

AI Agent Commands

# Generate commands from natural language prompt
z3ed agent plan --prompt "Place a tree at position 10, 10 on map 0"

# Execute in sandbox with auto-approval
z3ed agent run --prompt "Create a 3x3 water pond at 15, 20" --rom zelda3.sfc --sandbox

# Chat with the agent in the terminal (FTXUI prototype)
z3ed agent chat

# List all proposals
z3ed agent list

# View proposal details
z3ed agent diff --proposal <id>

# Inspect project metadata for the LLM toolchain
z3ed agent resource-list --type dungeon --format json

# Dump sprite placements for a dungeon room
z3ed agent dungeon-list-sprites --room 0x012

# Search overworld maps for a tile ID using shared agent tooling
z3ed agent overworld-find-tile --tile 0x02E --map 0x05

GUI Testing Commands

# Run automated test
z3ed agent test --prompt "Open Overworld editor and verify it loads"

# Query test status
z3ed agent test status --test-id <id> --follow

# Record manual workflow
z3ed agent test record start --output tests/my_test.json
# ... perform actions in GUI ...
z3ed agent test record stop

# Replay recorded test
z3ed agent test replay tests/my_test.json

# Test conversational agent (batch mode, no TUI required)
z3ed agent test-conversation

# Test with custom conversation file
z3ed agent test-conversation --file my_tests.json

AI Service Setup

Ollama (Local LLM - Recommended for Development)

# Install Ollama
brew install ollama  # macOS
# or download from https://ollama.com

# Pull recommended model
ollama pull qwen2.5-coder:7b

# Start server
ollama serve

# z3ed will auto-detect Ollama at localhost:11434
z3ed agent plan --prompt "test"

Gemini (Google Cloud API)

# Get API key from https://aistudio.google.com/apikey
export GEMINI_API_KEY="your-key-here"

# z3ed will auto-select Gemini when key is set
z3ed agent plan --prompt "test"

Note: Gemini requires OpenSSL (HTTPS). Build with -DYAZE_WITH_GRPC=ON -DYAZE_WITH_JSON=ON to enable SSL support. OpenSSL is auto-detected on macOS/Linux. Windows users can use Ollama instead.

Example Prompts

Here are some example prompts you can try with either Ollama or Gemini:

Overworld Tile16 Editing:

"Place a tree at position 10, 20 on map 0"
"Create a 3x3 water pond at coordinates 15, 10"
"Add a dirt path from position 5,5 to 5,15"
"Plant a row of trees horizontally at y=8 from x=20 to x=25"

Dungeon Editing (Label-Aware):

"Add 3 soldiers to the Eastern Palace entrance room"
"Place a chest in Hyrule Castle treasure room"

Core Documentation

Essential Reads

BUILD_QUICK_REFERENCE.md - NEW! Fast build guide with Z3ED_AI flag examples
AGENT-ROADMAP.md - The primary source of truth for the AI agent's strategic vision, architecture, and next steps
Z3ED_AI_FLAG_MIGRATION.md - NEW! Complete guide to Z3ED_AI flag and crash fixes
E6-z3ed-cli-design.md - Detailed architecture and design philosophy
E6-z3ed-reference.md - Complete command reference and API documentation

Current Status (October 3, 2025)

✅ Production Ready

Build System: ✅ Z3ED_AI flag consolidation complete
- Single flag for all AI features
- Graceful degradation when dependencies missing
- Clear error messages and build status
- Backward compatible with old flags
AI Backends: ✅ Both Ollama and Gemini operational
- Auto-detection based on environment
- Health checks and error handling
- Tested with real API calls
Conversational Agent: ✅ Multi-step tool execution loop
- Chat history management
- Tool result replay without recursion
- JSON/table rendering in TUI
Tool Dispatcher: ✅ 5 read-only tools operational
- Resource listing, sprite inspection, tile search
- Map descriptions, warp enumeration
- Machine-readable JSON output

<EFBFBD> In Progress (Priority Order)

Live LLM Testing (1-2h): Verify function calling with real models
GUI Chat Widget (6-8h): ImGui integration (TUI exists as reference)
Tool Coverage Expansion (8-10h): Dialogue, sprites, regions

📋 Next Steps

See AGENT-ROADMAP.md for detailed technical roadmap.

AI Editing Focus Areas

z3ed is optimized for practical ROM editing workflows:

Overworld Tile16 Editing ⭐ PRIMARY FOCUS

Why: Simple data model (uint16 IDs), visual feedback, reversible, safe

Single tile placement (trees, rocks, bushes)
Area creation (water ponds, dirt patches)
Path creation (connecting points with tiles)
Pattern generation (tree rows, forests, boundaries)

Dungeon Editing

Sprite placement with label awareness ("eastern palace entrance")
Object placement (chests, doors, switches)
Entrance configuration
Room property editing

Palette Editing

Color modification by index
Sprite palette adjustments
Export/import workflows

Additional Capabilities

Sprite data editing
Compression/decompression
ROM validation
Patch application

Example Workflows

Basic Tile16 Edit

# AI generates command
z3ed agent plan --prompt "Place a tree at 10, 10"
# Output: overworld set-tile --map 0 --x 10 --y 10 --tile 0x02E

# Execute manually
z3ed overworld set-tile --map 0 --x 10 --y 10 --tile 0x02E

# Or auto-execute with sandbox
z3ed agent run --prompt "Place a tree at 10, 10" --rom zelda3.sfc --sandbox

Complex Multi-Step Edit

# AI generates multiple commands
z3ed agent plan --prompt "Create a 3x3 water pond at 15, 20"

# Review proposal
z3ed agent diff --latest

# Accept and apply
z3ed agent accept --latest

Locate Existing Tiles

# Find every instance of tile 0x02E across the overworld
z3ed overworld find-tile --tile 0x02E --format json

# Narrow search to Light World map 0x05
z3ed overworld find-tile --tile 0x02E --map 0x05

# Ask the agent to perform the same lookup (returns JSON by default)
z3ed agent overworld-find-tile --tile 0x02E --map 0x05

Label-Aware Dungeon Edit

# AI uses ResourceLabels from your project
z3ed agent plan --prompt "Add 3 soldiers to my custom fortress entrance"
# AI explains: "Using label 'custom_fortress' for dungeon 0x04"

Dependencies Guard

AI agent features require:

YAZE_WITH_GRPC=ON - GUI automation and test harness
YAZE_WITH_JSON=ON - AI service communication
OpenSSL (optional) - Gemini HTTPS support (auto-detected)

Windows Compatibility: Build without gRPC/JSON for basic z3ed functionality. Use Ollama (localhost) instead of Gemini for AI features without SSL dependency.

Recent Changes (Oct 3, 2025)

Z3ED_AI Build Flag (Major Improvement)

✅ Consolidated Build Flags: New -DZ3ED_AI=ON replaces multiple flags
- Old: -DYAZE_WITH_GRPC=ON -DYAZE_WITH_JSON=ON
- New: -DZ3ED_AI=ON (simpler, clearer intent)
✅ Fixed Gemini Crash: Graceful degradation when dependencies missing
✅ Better Error Messages: Clear guidance on missing dependencies
✅ Production Ready: Both backends tested and operational

Build System

✅ Auto-manages dependencies (JSON, YAML, httplib, OpenSSL)
✅ Backward compatible with old flags
✅ Ready for build modularization (optional libyaze_agent.a)

Documentation

✅ Updated build instructions with Z3ED_AI flag
✅ Added migration guide: Z3ED_AI_FLAG_MIGRATION.md
✅ Clear troubleshooting section with common issues

Troubleshooting

"Build with -DZ3ED_AI=ON" warning

Impact: AI agent features disabled (no Ollama or Gemini)
Solution: Rebuild with AI support:

cmake -B build -DZ3ED_AI=ON
cmake --build build --target z3ed

"gRPC not available" error

Impact: GUI testing and automation disabled
Solution: Rebuild with -DYAZE_WITH_GRPC=ON (also requires Z3ED_AI)

AI generates invalid commands

Causes: Vague prompt, unfamiliar tile IDs, missing context
Solutions:

Use specific coordinates and tile types
Reference tile16 IDs from documentation
Provide map context ("Light World", "map 0")
Check ResourceLabels are loaded for your project

Testing the conversational agent

Problem: TUI chat requires interactive input
Solution: Use the new batch testing mode:

# Run with default test cases (no interaction required)
z3ed agent test-conversation --rom zelda3.sfc

# Or use the automated test script
./scripts/test_agent_conversation_live.sh

Verifying ImGui test harness

Problem: Unsure if GUI automation is working
Solution: Run the verification script:

./scripts/test_imgui_harness.sh

Gemini-Specific Issues

"Cannot reach Gemini API": Check your internet connection, API key, and that you've built with SSL support.
"Invalid Gemini API key": Regenerate your key at aistudio.google.com/apikey.

README.md Unescape Escape

z3ed: AI-Powered CLI for YAZE

Overview

Quick Start

Build

AI Setup

Example Commands

Chat Modes

1. FTXUI Chat (agent chat)

2. Simple Chat (agent simple-chat)

3. GUI Chat Widget (In Progress)

Available Tools

🎯 Next Steps

Troubleshooting

Chat mode freezes

Example Workflows

Explore ROM

Make Changes

Overview

Quick Start

Build Options

AI Agent Commands

GUI Testing Commands

AI Service Setup

Ollama (Local LLM - Recommended for Development)

Gemini (Google Cloud API)

Example Prompts

Core Documentation

Essential Reads

Current Status (October 3, 2025)

✅ Production Ready

<EFBFBD> In Progress (Priority Order)

📋 Next Steps

AI Editing Focus Areas

Overworld Tile16 Editing ⭐ PRIMARY FOCUS

Dungeon Editing

Palette Editing

Additional Capabilities

Example Workflows

Basic Tile16 Edit

Complex Multi-Step Edit

Locate Existing Tiles

Label-Aware Dungeon Edit

Dependencies Guard

Recent Changes (Oct 3, 2025)

Z3ED_AI Build Flag (Major Improvement)

Build System

Documentation

Troubleshooting

"Build with -DZ3ED_AI=ON" warning

"gRPC not available" error

AI generates invalid commands

Testing the conversational agent

Verifying ImGui test harness

Gemini-Specific Issues

README.md

1. FTXUI Chat (`agent chat`)

2. Simple Chat (`agent simple-chat`)