Move semantic hints to user prompt for cross-request caching by neuromechanist · Pull Request #130 · Annotation-Garden/HEDit

neuromechanist · 2026-03-30T08:50:34Z

Summary

Move semantic hints from system prompt to user prompt so the system prompt is static per schema version
Add a pointer in the system prompt directing the LLM to check the user message for hints
Update tests for the new prompt structure

Problem

Prompt caching broke between requests because semantic hints (different per image) were embedded in the system prompt. Since Anthropic's caching uses prefix matching, any change invalidated the cache for the entire ~1000-tag vocabulary and rules section.

Solution

The system prompt now contains only static content (vocabulary, rules, patterns). A short pointer says "Check the user message for SEMANTIC HINTS." The actual hints are in the user prompt, which already changes per request.

For batch processing of 1000 images, the system prompt cost is paid once and cached for all subsequent requests (within the 5-minute TTL).

Test plan

455 tests pass, 0 failures
Comprehensive guide tests updated for new structure
Keyword extraction tests still pass (hints flow through user prompt)

Fixes #129

System prompt is now static per schema version, enabling prompt caching across requests. Semantic hints (which change per image/description) are placed in the user prompt instead. The system prompt includes a pointer instructing the LLM to check the user message for hints. Fixes #129

cloudflare-workers-and-pages · 2026-03-30T08:50:41Z

Deploying hedit with Cloudflare Pages

Latest commit:	`2a33a32`
Status:	✅ Deploy successful!
Preview URL:	https://c4a1b4e7.hedit.pages.dev
Branch Preview URL:	https://feature-issue-129-cache-frie.hedit.pages.dev

View logs

codecov · 2026-03-30T08:52:09Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

- Rename _format_semantic_hints to format_semantic_hints (public API, used cross-module) - Align header: system prompt pointer and actual section both say "SEMANTIC HINTS" - Soften system prompt wording to "may include" (hints are optional) - Skip hints with empty tag keys - Add debug logging when hints are included in user prompt - Add 10 tests: user prompt with/without hints, confidence bucketing, system prompt caching invariant

neuromechanist · 2026-03-30T10:44:29Z

PR Review Summary (3 agents: code-reviewer, silent-failure-hunter, test-analyzer)

Critical Issues (0 found)

None.

Important Issues (4 found, ALL FIXED in `2a33a32`)

[code-reviewer] Header mismatch: system prompt pointer said "SEMANTIC HINTS" but actual output was "## POTENTIALLY RELEVANT TAGS". Aligned both to "SEMANTIC HINTS".
[code-reviewer + silent-failure-hunter] System prompt said "Check the user message for..." (imperative) even when no hints exist. Changed to "The user message may include... If no hints section is present, proceed without them."
[silent-failure-hunter] Deferred import of private _format_semantic_hints function. Made it public (format_semantic_hints) and moved import to module level. Also added continue for hints with empty tag keys.
[test-analyzer] Zero test coverage for hints in user prompt. Added 10 tests:
- test_first_pass_with_semantic_hints
- test_correction_pass_with_semantic_hints
- test_no_hints_no_hints_section
- test_empty_hints_no_hints_section
- TestFormatSemanticHints (4 tests: None, empty, valid, confidence bucketing, empty tags)
- TestSystemPromptCaching (system prompt has pointer but not dynamic content)

Suggestions (noted, not fixed)

[silent-failure-hunter] No debug logging when hints included. Fixed: added logger.debug.
[test-analyzer] Module-level format_semantic_hints could use boundary-score tests (exactly 0.8, 0.5). Low priority.

All tests pass

465 passed, 0 failures

…hing (#135) * Move semantic hints to user prompt for cross-request caching (#130) * Move semantic hints from system prompt to user prompt System prompt is now static per schema version, enabling prompt caching across requests. Semantic hints (which change per image/description) are placed in the user prompt instead. The system prompt includes a pointer instructing the LLM to check the user message for hints. Fixes #129 * Address review findings for cache-friendly prompts - Rename _format_semantic_hints to format_semantic_hints (public API, used cross-module) - Align header: system prompt pointer and actual section both say "SEMANTIC HINTS" - Soften system prompt wording to "may include" (hints are optional) - Skip hints with empty tag keys - Add debug logging when hints are included in user prompt - Add 10 tests: user prompt with/without hints, confidence bucketing, system prompt caching invariant * Bump version to 0.7.6.dev3 --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* Move semantic hints to user prompt for cross-request caching (#130) * Move semantic hints from system prompt to user prompt System prompt is now static per schema version, enabling prompt caching across requests. Semantic hints (which change per image/description) are placed in the user prompt instead. The system prompt includes a pointer instructing the LLM to check the user message for hints. Fixes #129 * Address review findings for cache-friendly prompts - Rename _format_semantic_hints to format_semantic_hints (public API, used cross-module) - Align header: system prompt pointer and actual section both say "SEMANTIC HINTS" - Soften system prompt wording to "may include" (hints are optional) - Skip hints with empty tag keys - Add debug logging when hints are included in user prompt - Add 10 tests: user prompt with/without hints, confidence bucketing, system prompt caching invariant * Bump version to 0.7.6.dev3 * Update default models to latest Qwen and Anthropic - Evaluation: qwen/qwen3-235b-a22b-2507 -> qwen/qwen3.5-397b-a17b (most capable Qwen MoE, $0.39/M prompt) - Vision: qwen/qwen3-vl-30b-a3b-instruct -> qwen/qwen3-vl-32b-instruct (newer VL model, $0.10/M prompt) - Annotation: keep anthropic/claude-haiku-4.5 (unchanged) - Replace all legacy gpt-oss-120b references in defaults and docs - Provider: let OpenRouter auto-route for Qwen models * Bump version to 0.7.6.dev4 --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

neuromechanist merged commit c37650c into develop Mar 30, 2026
14 checks passed

neuromechanist deleted the feature/issue-129-cache-friendly-prompts branch March 30, 2026 10:46

neuromechanist mentioned this pull request Mar 30, 2026

Release v0.7.6a1: keyword extraction, streaming telemetry, prompt caching #135

Merged

5 tasks

neuromechanist mentioned this pull request Apr 1, 2026

Update default models to Qwen 3.5 and fix prompt caching #136

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move semantic hints to user prompt for cross-request caching#130

Move semantic hints to user prompt for cross-request caching#130
neuromechanist merged 2 commits into
developfrom
feature/issue-129-cache-friendly-prompts

neuromechanist commented Mar 30, 2026

Uh oh!

cloudflare-workers-and-pages Bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

neuromechanist commented Mar 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

neuromechanist commented Mar 30, 2026

Summary

Problem

Solution

Test plan

Uh oh!

cloudflare-workers-and-pages Bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying hedit with Cloudflare Pages

Uh oh!

codecov Bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

neuromechanist commented Mar 30, 2026

PR Review Summary (3 agents: code-reviewer, silent-failure-hunter, test-analyzer)

Critical Issues (0 found)

Important Issues (4 found, ALL FIXED in 2a33a32)

Suggestions (noted, not fixed)

All tests pass

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

cloudflare-workers-and-pages Bot commented Mar 30, 2026 •

edited

Loading

codecov Bot commented Mar 30, 2026 •

edited

Loading

Important Issues (4 found, ALL FIXED in `2a33a32`)