Skip to content

docs(readme): submission perf/claim honesty hygiene (claim-tightening only)#19

Merged
RMANOV merged 1 commit into
mainfrom
feat/a2-perf-honesty
Jun 16, 2026
Merged

docs(readme): submission perf/claim honesty hygiene (claim-tightening only)#19
RMANOV merged 1 commit into
mainfrom
feat/a2-perf-honesty

Conversation

@RMANOV

@RMANOV RMANOV commented Jun 15, 2026

Copy link
Copy Markdown
Owner

Documentation-only claim-tightening for submission-facing README (Wave-2 A2). No source/capability code. ADVOCATE diff-gate PASS (dafc164d1119); CONDUCTOR GO (4504bed10454).

  • C1 provenance/date on every perf/scale number (prior measured; re-run on exact submission commit).
  • C2 neutral "Criterion benchmark profile" (repo has no custom [profile.bench]); removed optimized/unoptimized assertion.
  • C3 ~400-500 drone ceiling = audit-derived estimate/roadmap only, not demonstrated.
  • C4 no 2000+ presented as demonstrated/current (single mention = roadmap target).
  • C5 kinematic Python replay kept distinct from Rust OODA/CBF.
    Citations point to in-tree Project_Docs/CAPABILITY_BOUNDARY.md + EVIDENCE_PACKET.md. press-releases/ untouched. Rebased onto current origin/main 140583d (A1 demo artifacts preserved).

Copilot AI review requested due to automatic review settings June 15, 2026 20:47

Copilot AI left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR tightens and clarifies the README’s performance/scale claims by explicitly labeling benchmark figures as prior, software-only measurements and by separating demonstrated benchmark scope from estimate/roadmap statements, with pointers to the in-tree claim boundary and evidence packet.

Changes:

  • Rewords the “Performance Snapshot” intro to emphasize point-in-time, re-run-on-submission-commit requirements and the lack of a custom Cargo [profile.bench].
  • Updates the “full tick” interpretation to avoid implying end-to-end, sensor/RF/platform I/O validation.
  • Adds explicit “Scale (estimate / roadmap, not demonstrated)” language and links to the canonical claim boundary and evidence packet.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread README.md Outdated
Comment thread README.md Outdated
…ive bound

Wave-2 A2 (STRIX perf/claim honesty hygiene) under DIANA D0 claim-freeze.
DOCS-ONLY, claim-TIGHTENING. Submission-facing README only.

- Performance Snapshot: mark numbers as prior-measured software results with
  reproduction command (`cargo bench`); add explicit "re-run on the exact
  submission commit" marker. Use neutral "Criterion benchmark profile" framing
  (no custom [profile.bench] verified in-repo) instead of optimized-vs-unoptimized
  assertion, to avoid reintroducing a prior honesty-round mismatch.
- Tighten the 1.15 ms/20-drone prose: drop "comfortably fits ... significant
  headroom"; state it is a software-only figure with no sensor/RF/platform-I/O
  budget, to be re-confirmed.
- Surface the ~400-500 single-node ceiling as an AUDIT-DERIVED ESTIMATE and
  2000+ as a roadmap target, NEVER demonstrated, where scale is discussed.
  Cited to the in-tree canonical claim map (CAPABILITY_BOUNDARY.md) and
  EVIDENCE_PACKET.md item 7. The audit file (docs/audit-2026-03-29.md) is
  gitignored and not in the submission tree, so it is not linked.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@RMANOV RMANOV force-pushed the feat/a2-perf-honesty branch from 59dc64b to f10b1e2 Compare June 15, 2026 20:58
@RMANOV RMANOV merged commit 0da3ddd into main Jun 16, 2026
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants