Skip to content

Commit ea30d4a

Browse files
authored
Merge pull request #15 from bgyss/codex/plan-merging-research-into-pipeline
Fix automation hashing clippy warning
2 parents fccf638 + 46bbdee commit ea30d4a

34 files changed

Lines changed: 1914 additions & 4014 deletions

PLANS.md

Lines changed: 94 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -24,9 +24,103 @@ This file tracks implementation work derived from specs that do not yet have a c
2424
- SPEC-190 Video-Based Validation
2525
- SPEC-200 title-a24b9e807b456252 First-Level Milestone (macOS/aarch64)
2626
- SPEC-210 Automated Recompilation Loop
27+
- SPEC-220 Input Replay and Interaction Scripts
28+
- SPEC-230 Reference Media Normalization
2729
- SPEC-240 Validation Orchestration and Triage
2830
- SPEC-250 Automation Services and Data Flow
2931
- SPEC-260 Agent Pipeline Security and Automation
32+
- SPEC-270 Comprehensive Automated Static Recompilation Solution
33+
34+
## Reopened Active Work (SPEC-210 to SPEC-270)
35+
These specs are now active implementation work. Items remain unchecked until merged and verified.
36+
37+
## SPEC-210: Automated Recompilation Loop
38+
Outcome
39+
- Expose a single automation command and complete deterministic stage-level cache behavior.
40+
41+
Work items
42+
- [ ] Expose `recomp automate --config <path>` in CLI and document usage.
43+
- [ ] Add stage-level cache key behavior based on input fingerprint + stage signature + tool versions.
44+
- [ ] Extend run-manifest with run-level metadata and per-step cache/attempt details.
45+
- [ ] Add regression tests for valid config, invalid config, resume, and downstream-only invalidation.
46+
47+
Exit criteria (from SPEC-210)
48+
- A single command runs intake-through-validation for supported modes.
49+
- Cache behavior preserves unaffected upstream stages when downstream commands change.
50+
51+
## SPEC-220: Input Replay and Interaction Scripts
52+
Outcome
53+
- Keep input replay deterministic and consistently linked to automation + validation outputs.
54+
55+
Work items
56+
- [ ] Add automation docs for wiring `input_script.toml` into loop runs.
57+
- [ ] Add replay log linkage into run-manifest/artifact index metadata.
58+
- [ ] Add replay determinism test coverage for same-script repeated runs.
59+
60+
Exit criteria (from SPEC-220)
61+
- Input replay is deterministic across repeated runs.
62+
- Replay alignment markers are represented in validation artifacts.
63+
64+
## SPEC-230: Reference Media Normalization
65+
Outcome
66+
- Make normalization profile and outputs first-class run inputs.
67+
68+
Work items
69+
- [ ] Record normalization metadata in run-manifest and artifact index docs.
70+
- [ ] Add docs/tests for VFR-to-CFR handling and hash stability.
71+
- [ ] Ensure automation docs enforce external storage and canonical profile usage.
72+
73+
Exit criteria (from SPEC-230)
74+
- Normalized reference media and hashes are reproducible and traceable.
75+
76+
## SPEC-240: Validation Orchestration and Triage
77+
Outcome
78+
- Standardize report + triage outputs across local automation runs.
79+
80+
Work items
81+
- [ ] Align `recomp-validation artifacts` docs/schema with extended run-manifest references.
82+
- [ ] Document triage expectations and follow-up workflow in runbook/docs.
83+
- [ ] Add deterministic report regression tests for artifact-index workflows.
84+
85+
Exit criteria (from SPEC-240)
86+
- Validation runs emit deterministic reports and actionable triage details.
87+
88+
## SPEC-250: Automation Services and Data Flow
89+
Outcome
90+
- Define local-first run/state/event contracts with cloud-compatible schemas.
91+
92+
Work items
93+
- [ ] Publish run submission/status schema docs with local/cloud/hybrid execution modes.
94+
- [ ] Document stage event names and required fields.
95+
- [ ] Add artifact index schema aligned with run-manifest references.
96+
97+
Exit criteria (from SPEC-250)
98+
- Local runs and cloud-ready schemas share the same stage/data contracts.
99+
100+
## SPEC-260: Agent Pipeline Security and Automation
101+
Outcome
102+
- Provide policy and security envelopes now, enforce in cloud phase.
103+
104+
Work items
105+
- [ ] Add versioned JSON schemas for model request envelope and automation policy record.
106+
- [ ] Add local policy metadata fields in automation config and run-manifest documentation.
107+
- [ ] Add security test coverage that prevents secret/key path leakage in emitted logs.
108+
109+
Exit criteria (from SPEC-260)
110+
- Security/policy interfaces are schema-defined and auditable without blocking local runs.
111+
112+
## SPEC-270: Comprehensive Automated Static Recompilation Solution
113+
Outcome
114+
- Keep all planning/spec/docs/code surfaces aligned under dual-track, local-first strategy.
115+
116+
Work items
117+
- [ ] Publish integration meta-spec and link dependent specs/docs.
118+
- [ ] Resolve contradictions between homebrew-first and retail milestone docs.
119+
- [ ] Update roadmap phaseing for Automation + Security Hardening.
120+
- [ ] Keep runbook and automation docs in sync with shipped CLI behavior.
121+
122+
Exit criteria (from SPEC-270)
123+
- Specs, plans, roadmap, and docs are internally consistent and implementation-ready.
30124

31125
## SPEC-000: Project Charter and Ethics
32126
Outcome
@@ -327,24 +421,6 @@ Exit criteria (from SPEC-200)
327421
- First-level gameplay matches the reference video within defined tolerances.
328422
- No proprietary assets or keys are stored in the repo or build outputs.
329423

330-
## Cross-Cutting Intake: LLM-Assisted Long-Tail Decompilation (2026-02-16)
331-
Outcome
332-
- Reduce long-tail churn by ranking harder work using similarity and enforcing stronger unattended automation guardrails.
333-
334-
Work items
335-
- [ ] Add similarity-based candidate ranking for unresolved functions (for example bounded opcode-sequence distance plus optional embedding retrieval) and record top references in run metadata.
336-
- [ ] Add periodic cleanup/documentation passes for already-matched functions so they remain high-quality exemplars for similarity-guided attempts.
337-
- [ ] Add specialist task lanes for graphics macro recovery, math-heavy functions, and mechanical cleanup/refactor work.
338-
- [ ] Add hard guardrails that block integrity-sentinel edits, generated-file edits, and skipped tests in unattended agent loops.
339-
- [ ] Capture long-tail metrics (`attempt_count`, `p85/p95 attempts`, `stall_reason`) in `run-manifest.json` and validation triage output.
340-
- [ ] Define escalation rules for outliers (for example very large functions or repeated failures) to prevent unbounded token burn.
341-
342-
Related specs
343-
- SPEC-170 Function Discovery and Control-Flow Graph
344-
- SPEC-210 Automated Recompilation Loop
345-
- SPEC-250 Automation Services and Data Flow
346-
- SPEC-260 Agent Pipeline Security and Automation
347-
348424
## Future Work Notes: title-a24b9e807b456252 Video Validation (SPEC-190/200)
349425
Requirements to finalize:
350426
- Define the capture spec (resolution, fps, constant frame rate, audio rate/bit depth/channels, container/codec).

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,9 +36,11 @@ Legal and provenance policy:
3636
- `docs/static-recompilation-flow.md` outlines a hypothetical macOS static recompilation flow and verification pipeline (see the Real XCI intake section for external-tool usage).
3737
- `docs/xci-intake.md` documents the XCI intake workflow and mock extractor format.
3838
- `docs/validation-artifacts.md` defines validation artifact indexing, workflows, and dependencies.
39+
- `docs/run-manifest-schema.md` and `docs/artifact-index-schema.md` document automation/validation metadata schemas.
3940
- `docs/validation-video.md` describes the hash-based video validation workflow.
4041
- `docs/title-hash-ingest.md` documents hash-based title/data ingest and local decoder-ring files.
4142
- `scripts/capture-validation.sh`, `scripts/capture-video-macos.sh`, `scripts/capture_video.sh`, and `scripts/validate_artifacts.sh` provide capture and validation helpers.
43+
- `cargo run -p recomp-cli -- automate --config <automation.toml>` runs the local end-to-end automation loop.
4244
- `scripts/ingest_hashed_title.sh` hashes title/data paths for private local workspaces.
4345
- `scripts/rewrite_title_refs_in_history.sh` rewrites commit-message title references to hash ids.
4446

ROADMAP.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,18 @@ Exit criteria:
5151
- Performance targets met on baseline host.
5252
- Automated validation produces a report with stable metrics.
5353

54-
## Phase 5: Stabilization
54+
## Phase 5: Automation + Security Hardening
55+
- Expose and stabilize the single-command automation loop (`recomp automate`).
56+
- Extend run-manifest and artifact-index schemas for deterministic lifecycle traceability.
57+
- Reconcile homebrew baseline and hashed retail pilot under shared automation contracts.
58+
- Add local-first policy/redaction metadata and cloud-ready security envelope schemas.
59+
60+
Exit criteria:
61+
- One command runs intake-through-validation for supported local modes.
62+
- Stage cache invalidation is deterministic and scoped to affected downstream stages.
63+
- Security/policy schemas are documented and audit-ready for cloud rollout.
64+
65+
## Phase 6: Stabilization
5566
- Harden tooling, improve diagnostics, and expand coverage.
5667
- Establish a compatibility list for tested titles.
5768
- Publish a contributor guide and spec updates.

0 commit comments

Comments
 (0)