feat(claude-code): auto-discover plugin marketplace agents by leih1219 · Pull Request #231 · awslabs/cli-agent-orchestrator

leih1219 · 2026-05-08T01:06:09Z

Summary

Auto-discover agents from enabled Claude Code plugin marketplaces (those
declared in ~/.claude/settings.json → extraKnownMarketplaces and
enabled in enabledPlugins). Today, agents installed as part of a
Claude Code plugin are invisible to CAO unless the user manually points
agent_dirs.claude_code at the plugin's agents/ directory. This change
makes them discoverable by default.

Motivation

Claude Code is the only supported provider without an auto-populated
default agent directory. constants.py defines:

Q_AGENTS_DIR        = ~/.aws/amazonq/cli-agents
KIRO_AGENTS_DIR     = ~/.kiro/agents
COPILOT_AGENTS_DIR  = ~/.copilot/agents
OPENCODE_AGENTS_DIR = ~/.aws/opencode/agents
# no equivalent for Claude Code

Users who install a Claude Code plugin that ships agents (via the plugin
marketplace mechanism) expect those agents to be selectable from CAO
without further manual setup. This PR closes that gap for the
marketplace-plugin case.

What changed

src/cli_agent_orchestrator/utils/agent_profiles.py — adds
_discover_claude_plugin_agent_dirs(). It reads
~/.claude/settings.json, iterates extraKnownMarketplaces, parses
each marketplace's .claude-plugin/marketplace.json, and for each
plugin present in enabledPlugins["<plugin>@<marketplace>"] collects
the plugin's agents/ directory. Integrated into both
list_agent_profiles() and _read_agent_profile_source() between
provider directories and extra user-added directories.
src/cli_agent_orchestrator/services/settings_service.py — changes
the default agent_dirs.claude_code value from
~/.aws/cli-agent-orchestrator/agent-store to ~/.claude/agents/,
aligning with Claude Code's native user-level subagent directory.
Users with a saved agent_dirs.claude_code value in their
settings.json are unaffected because get_agent_dirs() merges saved
values over defaults.
docs/settings.md — new section documenting plugin-marketplace
discovery, precedence order, how to disable via removing
enabledPlugins, and known limitations.
CHANGELOG.md — Unreleased entry.

Security & path safety

Marketplace-directory containment: plugin_dir.resolve().relative_to(marketplace_root.resolve())
prevents traversal across marketplace boundaries.
Per-file containment (added in round 2, commit b43a1d2):
_scan_plugin_directory resolves each candidate file and validates
resolve().relative_to(plugin_root) before adding it. Symlinks inside
agents/ that point outside the plugin root are rejected with a log
warning. Scoped narrowly to the claude_plugin discovery path — other
_scan_directory callers are untouched.
All JSON reads (settings.json, marketplace.json) are wrapped with
JSONDecodeError + OSError handlers. Missing or malformed files
produce a single log warning and return an empty list.
Discovery runs only on the claude_code provider path; other
providers are untouched.

Round 2 review response (2026-05-12)

Three commits address @anilkmr-a2z's review feedback, plus one test-coverage commit:

6a93db3 — fix(agent_profiles): strip multiple leading HTML comment blocks. Regex changed from ^\s* to ^(?:\s*)+ so consecutive leading blocks are stripped in a single pass.
b43a1d2 — fix(agent_profiles): validate per-file containment for plugin agents. Adds _scan_plugin_directory wrapper with resolve() + relative_to() on each file; scope is narrow (claude_plugin only) and a regression guard test pins the scope.
878f315 — perf(agent_profiles): cache claude plugin discovery with mtime-based invalidation. Module-level cache keyed on settings.json mtime plus each marketplace.json mtime. Automatic invalidation on any tracked-file change.
22e3e92 — test(agent_profiles): expand coverage for plugin discovery fixes. +7 tests and +2 mocks, pushing agent_profiles.py line coverage from 85% → 92%.

Tests

test/utils/test_claude_plugin_discovery.py now covers:

Happy path: enabled plugin with agents/ directory
Missing / malformed ~/.claude/settings.json
Malformed marketplace.json
Plugin source pointing at a non-existent path, or at a file instead of
a directory
Plugin present but not enabled
Cross-marketplace name collision (first-wins dedup)
Path traversal attempt blocked at marketplace → plugin hop
Orphan enabled-plugin entry referencing a missing marketplace
Empty enabledPlugins map
list_agent_profiles integration: plugin source label, local-beats-plugin dedup
_read_agent_profile_source integration: found in plugin dir, raises
FileNotFoundError when not found anywhere
Per-file symlink containment (round 2): symlink inside plugin
agents/ pointing outside plugin root is rejected with a warning;
symlinks resolving within the plugin root are accepted
Scope regression guard (round 2): symlinks in non-plugin
directories (e.g., ~/.kiro/agents/) are NOT rejected — the
narrow-scope decision is pinned
Discovery cache (round 2): cache hit avoids recomputation;
mtime changes on settings.json or any marketplace.json force
re-discovery; new-marketplace-added and marketplace-disappeared cases;
explicit _reset_plugin_discovery_cache()

test/utils/test_agent_profiles.py additionally covers multi-block HTML
comment stripping (1, 2, and 3+ leading blocks; multi-line blocks;
non-leading comments not stripped).

Run locally:

pytest test/utils/test_claude_plugin_discovery.py test/utils/test_agent_profiles.py -v

Coverage on agent_profiles.py: 92%.
Full non-infra suite: 1642 passed / 5 skipped.

Backwards compatibility

Users with agent_dirs.claude_code saved in
~/.aws/cli-agent-orchestrator/settings.json keep their existing
behavior (saved-over-default merge).
Users on the old default silently switch from scanning
~/.aws/cli-agent-orchestrator/agent-store to ~/.claude/agents/.
Since agent-store is already scanned as LOCAL_AGENT_STORE_DIR,
this is a no-op for agent visibility.
_discover_claude_plugin_agent_dirs return type changed from
List[Path] to List[Tuple[Path, Path]] (adding the marketplace
root for per-file containment). The function is underscore-prefixed
and private to agent_profiles.py; all in-source callers are
updated.
Other public function signatures (list_agent_profiles,
_read_agent_profile_source, load_agent_profile) are unchanged.

Follow-ups (intentionally out of scope)

Workspace-level .claude/agents/ discovery (project-local agents)

Happy to open a separate issue for the above if the maintainers prefer tracking it.

Agents enabled via Claude Code plugin marketplaces (~/.claude/settings.json → extraKnownMarketplaces) are now discoverable by CAO without requiring a manual agent_dirs.claude_code entry in ~/.aws/cli-agent-orchestrator/settings.json. Changes: - utils/agent_profiles.py: add _discover_claude_plugin_agent_dirs() that walks enabled marketplaces, validates marketplace.json, and collects each plugin's agents/ directory. Integrated into list_agent_profiles() and _read_agent_profile_source() scan order. - services/settings_service.py: change default agent_dirs.claude_code from ~/.aws/cli-agent-orchestrator/agent-store to ~/.claude/agents/. Users with a saved value are unaffected (saved-over-default merge semantics). - docs/settings.md: document the new discovery behavior. - test/utils/test_claude_plugin_discovery.py: unit tests covering happy path, empty enabledPlugins, orphan plugin entries, file-vs-dir source validation, cross-marketplace name collision, symlink escape, and _read_agent_profile_source() plugin integration. - CHANGELOG.md: Unreleased entry noting discovery + default path change.

haofeif · 2026-05-08T01:42:09Z

@patricka3125 would you like to review this if you get a chance ?

codecov-commenter · 2026-05-08T01:43:26Z

Codecov Report

❌ Patch coverage is 85.84071% with 16 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (main@acb78e6). Learn more about missing BASE report.

Files with missing lines	Patch %	Lines
src/cli_agent_orchestrator/utils/agent_profiles.py	85.84%	16 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #231   +/-   ##
=======================================
  Coverage        ?   92.73%           
=======================================
  Files           ?       65           
  Lines           ?     5548           
  Branches        ?        0           
=======================================
  Hits            ?     5145           
  Misses          ?      403           
  Partials        ?        0

Flag	Coverage Δ
unittests	`92.73% <85.84%> (?)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…arsing Some profile generators (e.g. AIM `plugins install --local`) prepend  HTML comment blocks before the YAML frontmatter delimiter. python-frontmatter requires '---' on the first line, so these profiles silently parse with empty metadata — causing mcpServers, model, and allowedTools to be None at runtime. Add a defensive regex strip at the top of parse_agent_profile_text() so CAO handles these profiles correctly regardless of upstream generator behavior. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

anilkmr-a2z · 2026-05-09T05:48:05Z

+            plugin_dir = (mkt_path / plugin_source).resolve()
+            # Path containment check
+            try:
+                plugin_dir.relative_to(resolved_mkt)


Individual files still could be outside. Probably should validate individual files too.

Addressed in b43a1d2 — added a _scan_plugin_directory wrapper that resolves each candidate file and validates resolve().relative_to(plugin_root) before adding it. Scoped narrowly to the claude_plugin discovery path (not broadening _scan_directory, so other agent sources keep their existing behavior).

Tests covering this:

test_symlink_outside_plugin_root_rejected — symlink inside agents/ pointing outside plugin root is skipped with a warning

test_symlink_within_plugin_root_accepted — symlinks resolving within the plugin root still work

test_read_agent_profile_source_rejects_symlink_escape — covers the second call site end-to-end

TestFixAScopeIsNarrow::test_symlink_escape_in_non_plugin_dir_not_blocked — regression guard so the narrow-scope decision is pinned

anilkmr-a2z · 2026-05-09T05:50:00Z

+    # Strip leading HTML comments before the YAML frontmatter fence.
+    # Some profile generators (e.g. AIM) prepend <!-- ... --> blocks that
+    # prevent python-frontmatter from detecting the opening '---' delimiter.
+    resolved_text = re.sub(r"^<!--.*?-->\s*", "", resolved_text, flags=re.DOTALL)


HTML comment stripping regex only removes a single leading comment block. If a generator produces multiple consecutive comments (e.g., ), only the first is stripped and the second still blocks frontmatter detection.

Addressed in 6a93db3 — changed the regex from ^\s* to ^(?:\s*)+ so all consecutive leading comment blocks are stripped in a single re.sub call. Linear-time (no backtracking risk — the inner .*?--> is bounded by a literal terminator, the outer + advances sequentially).

Tests:

test_multiple_leading_html_comments_stripped — two consecutive blocks

test_three_leading_html_comments_stripped — three blocks (edge case)

Existing test_leading_html_comment_stripped_before_frontmatter / test_multiline_html_comment_stripped / test_no_comment_passthrough / test_non_leading_comment_not_stripped all still pass

anilkmr-a2z

Mostly minor commant except one.

haofeif · 2026-05-10T01:02:02Z

@leih1219 can you please help to address the comments ?

The ^-anchored regex introduced in 03e6d35 only matched the first leading HTML comment block. Profiles with multiple consecutive  blocks (e.g. AIM-generated profiles with a boilerplate header + per-agent header) would have only the first stripped, causing frontmatter detection to fail. Replace the single-match pattern with a non-capturing repeat group `^(?:\s*)+` so all leading comment blocks are stripped in a single regex call.

Individual files inside a plugin's agents/ directory (e.g. symlinks pointing outside the marketplace root) were enumerated without per-file validation. Add _scan_plugin_directory() that wraps _scan_directory with resolve()+is_relative_to() checks against the marketplace root for each file entry. Scope is narrow: only the claude_plugin discovery path uses the new containment check. Other _scan_directory callers (local store, provider dirs, extra dirs) are unchanged. _discover_claude_plugin_agent_dirs() now returns (agents_dir, plugin_root) tuples so callers can pass the root for containment validation. Addresses reviewer comment about individual files escaping containment.

…invalidation _discover_claude_plugin_agent_dirs() is called on every list_agent_profiles() invocation from the long-running CAO server. Add a module-level cache keyed on the mtime of settings.json and each marketplace.json so repeated calls with unchanged filesystem state return instantly. Cache invalidation is automatic: any mtime change on settings.json triggers a full re-discovery, and mtime changes on individual marketplace.json files are also detected. Expose _reset_plugin_discovery_cache() (underscore-prefixed, test-only) so tests can isolate from each other. An autouse fixture in the test file ensures no cross-test cache pollution. Addresses reviewer comment about repeated filesystem reads on every invocation.

Tester-agent pass adding 7 new tests and 2 mock additions, filling scenario-coverage gaps surfaced by the tester task for PR awslabs#231: Fix C (HTML-comment strip): - test_three_leading_html_comments_stripped: edge-case for 3+ blocks Fix A (per-file plugin containment): - test_read_agent_profile_source_rejects_symlink_escape / accepts_regular_plugin_file: cover the second call site (_read_agent_profile_source) end-to-end - test_symlink_escape_in_non_plugin_dir_not_blocked: regression guard asserting the scope stays narrow (claude_plugin only) Fix B (discovery cache): - test_reset_plugin_discovery_cache_clears_state: explicit reset-helper test - test_cache_invalidates_when_new_marketplace_added - test_cache_invalidates_when_marketplace_json_disappears Mocks (home-dir leakage in existing tests): - test_builtin_store_exception_handled: mock _discover_claude_plugin_agent_dirs - test_non_md_builtin_files_skipped: same Coverage on agent_profiles.py: 85%% -> 92%% Impacted test count: 73 -> 80 (with the 2 mock-added tests now stable) Full suite: 1642 / 1642 non-infra tests pass.

patricka3125 · 2026-05-14T02:09:11Z

Hi @leih1219 this is a really interesting PR, great work. I have some overall thoughts on the high level direction and approach of this feature that I would like to request clarification on...

I think the motivation needs a sharper compatibility model before we add automatic discovery.

The PR assumes that “Claude Code plugin ships agents” implies “those agents should be selectable from CAO,” but I don’t think that follows yet. A Claude plugin agent is part of a broader Claude Code plugin runtime: plugins can ship agents alongside skills, hooks, MCP/LSP configs, monitors, bin/ executables, and default settings. CAO currently appears to consume only the agent markdown as a CAO agent profile, so an auto-discovered plugin agent may be listed without the surrounding plugin capabilities it was authored to rely on.

There is also a scoping question. Claude has distinct project, user, and plugin subagent scopes, with defined precedence. CAO has local/provider/custom/built-in profile sources and an orchestration-specific runtime model. Before scanning plugin marketplaces by default, I think we should define the relationship between:

CAO-native agent profiles
Claude user/project agents in .claude/agents and ~/.claude/agents
Claude plugin agents under plugin marketplaces

Those are not obviously interchangeable. Their frontmatter and semantics also differ: Claude subagents use fields like tools, disallowedTools, permissionMode, skills, mcpServers, hooks, isolation, etc., while CAO profiles have CAO/provider-specific concepts like provider, allowedTools, mcpServers, and orchestration assumptions. Some fields may be ignored, renamed, or meaningful only inside Claude Code.

Because of that, I’m not convinced automatic plugin-agent discovery is the right first step. It may make CAO list agents that look available but are degraded or incorrect when run outside the Claude plugin runtime. It also does not solve the stated expectation fully: if a plugin agent depends on plugin skills, hooks, scripts, or settings, simply pointing CAO at agents/ does not make the plugin usable in CAO.

I would prefer one of these narrower approaches:

Treat Claude plugin agents as a separate source type with an explicit “experimental/Claude-only” compatibility contract.
Add an explicit import/convert flow that maps Claude subagent fields into CAO profile fields and warns about unsupported plugin dependencies.
Start with documentation/design for the intended relationship between CAO profiles and Claude agents before enabling automatic discovery.
Avoid changing the default Claude agent directory until we know whether installing CAO profiles into ~/.claude/agents pollutes normal Claude Code workflows.

So I’m supportive of improving interop, but I think this PR currently conflates discoverability with compatibility. I’d like to see the compatibility/scoping model clarified before this becomes default behavior.

gutosantos82

PR Review: #231 — feat: auto-discover agents from Claude Code plugin marketplaces

Summary

The PR makes CAO auto-discover agent profiles from enabled Claude Code plugin
marketplaces (~/.claude/settings.json → extraKnownMarketplaces + enabledPlugins),
and changes the claude_code default agent dir from
~/.aws/cli-agent-orchestrator/agent-store to ~/.claude/agents/. The discovery,
mtime-cache, and path-containment mechanics are well-engineered and heavily tested. The
core concern is the trust posture: discovered profiles are loaded and launchable like
any other, but they originate from third-party marketplace content and carry executable
fields — and the feature is on by default with no CAO-side off switch. There is also a
recurring "plugin root" vs "marketplace root" naming/semantics drift and a dead
defense-in-depth branch that three reviewers independently flagged. Recommend Request
changes.

Correctness review did not return; that angle is not covered here. Tests could not be
executed in-session (no venv) — pass/fail was assessed by reading; the author/human should
run the suite.

Blocking (must fix before merge)

[security] src/cli_agent_orchestrator/utils/agent_profiles.py — auto-discovered claude_plugin profiles are fully trusted.
Discovered profiles parse straight into AgentProfile, which carries executable fields
(mcpServers command/args/env, hooks, command, tools/allowedTools). Launching
such an agent spins up MCP servers / hooks = arbitrary command execution defined by
third-party marketplace content. Worse, load_agent_profile runs resolve_env_vars()
(utils/env.py, Template.safe_substitute over CAO's managed .env) on the raw
profile text, so a hostile profile embedding ${AWS_SECRET_ACCESS_KEY} gets
CAO-managed secrets interpolated into its system_prompt or an mcpServers args/env
→ secret exfiltration + RCE. Amplified by this repo's documented --yolo /
--dangerously-skip-permissions launch convention. The env-substitution mechanism is
pre-existing, but this PR newly makes it reachable from untrusted third-party input,
gated only on enabledPlugins[...] == true. This is core utils/ code, so weighted up.
Fix: treat source == "claude_plugin" profiles as untrusted — drop/deny
mcpServers/hooks/command (or require an explicit per-agent allowlist), do not
apply resolve_env_vars() to plugin-sourced text, and surface claude_plugin
provenance prominently at launch.

Important (should fix)

[security/consistency] agent_profiles.py:96-145, 234-248, 310 — containment is checked against the MARKETPLACE root but named/documented as the PLUGIN root.
_discover_claude_plugin_agent_dirs returns (agents_dir, resolved_mkt) (the
marketplace root), yet it is threaded through as the param plugin_root, and
_scan_plugin_directory's docstring + warning claim it "prevents symlinks … from
exposing files outside the plugin tree." It only prevents escaping the marketplace
tree — a symlink in plugin A's agents/ can point into plugin B's files or the
marketplace's .claude-plugin/marketplace.json and pass. So the docstring overclaims,
and cross-plugin file disclosure / agent impersonation is possible.
docs/settings.md:112 ("within their marketplace root") is actually the accurate
statement; the code's internal naming/comments are what drift.
Fix: pick one — rename plugin_root → marketplace_root throughout (param,
docstring, warning at ~:111, read-path branch ~:389-395) and correct the docstring to
say "marketplace tree"; or tighten containment to the real plugin_dir if
per-plugin isolation was the intent. Make code + comments + docs agree.
[security] agent_profiles.py:309-311 / whole feature — auto-discovery is ON BY DEFAULT with no CAO-side off switch.
The only documented suppression path (toggling in ~/.claude/settings.json) also
disables the plugin in Claude Code; users cannot keep a plugin in Claude Code but hide
it from CAO. Auto-trusting every enabled marketplace plugin as a launchable CAO agent is
not a safe default. Fix: add a settings flag (e.g. enable_plugin_discovery,
default false) to make the surface opt-in, and/or an allowlist of trusted
marketplaces.
[security/tests/consistency] agent_profiles.py:382-396 — the per-file containment branch in _read_agent_profile_source is dead / ineffective code.
_lookup_in_directory reads only via _safe_join, which resolves symlinks and returns
None for anything escaping the agents dir (a strictly tighter root than
plugin_root). So found is non-None only when already contained → the function always
returns at :390 and the logger.warning(...); continue failure path is unreachable.
Bytes are also read before the re-check, so it could not gate content even if it fired.
The PR's own test docstring
(test_claude_plugin_discovery.py:1409-1418) concedes it is "effectively
defense-in-depth behind _safe_join." So the PR body's claimed "per-file containment in
_read_agent_profile_source" is real code that never fires.
Fix: remove the redundant branch (rely on _safe_join) or mark it explicitly as
unreachable defense-in-depth — and drop the misleading test that purports to cover it
(no test can reach it while _safe_join stands in front).
[consistency] docs/settings.md:20, :69 — Default Directories table + settings.json example still show the OLD claude_code default.
They show ~/.aws/cli-agent-orchestrator/agent-store, but settings_service.py:75
changed the default to ~/.claude/agents (and CHANGELOG announces it). Doc↔code drift
introduced by this PR. Fix: update the table row and the JSON example block to
~/.claude/agents.
[tests] settings_service default change has no real test.
test_settings_service.py:112 asserts result["claude_code"] == _DEFAULTS["claude_code"]
— a tautology (both sides reference the same constant), so an accidental revert of the
default would not be caught, and the CHANGELOG's "users with a saved
agent_dirs.claude_code are unaffected" migration guarantee is untested. Fix: assert
against the literal expected path (str(Path.home()/".claude"/"agents")) and add a
regression test that a saved override still wins over the new default.
[consistency] agent_profiles.py:337-349 — _read_agent_profile_source docstring "Search order:" is stale.
It still lists only 4 steps (local → provider → extra → built-in) and omits the
plugin-marketplace step this PR inserts between provider and extra dirs. Fix: add the
plugin step to the numbered search order.
[conventions/consistency] CHANGELOG.md — ## [Unreleased] heading glued to the intro paragraph with no blank line.
Breaks the Keep-a-Changelog / Markdown convention the file itself claims to follow and
can trip changelog automation. Fix: insert a blank line before the ## [Unreleased]
heading.
[conventions] CHANGELOG.md — incomplete entry: HTML-comment-stripping change not recorded.
The PR also changes user-facing frontmatter parsing (parse_agent_profile_text now
strips leading  blocks — agent_profiles.py:287-290), but the CHANGELOG only
mentions auto-discovery + the default-path change. Fix: add a Fixed/Changed line
noting profiles with leading HTML comments now parse correctly.

Nits (optional)

[consistency] docs/settings.md:7-12 — the top "Agent Profile Directories" ordered scan list (local/provider/extra/built-in) wasn't updated to include plugin-marketplace discovery, yet the new "Precedence" section (:92-94) says plugins scan before extra dirs. The two lists in the same doc disagree; insert plugins as the appropriate step.
[security] agent_profiles.py:108-137 — TOCTOU window: _is_path_contained/exists() are checked, then read_text() happens separately; a symlink swapped in between passes the check. Same-user FS, low practical risk; note only.
[conventions/security] agent_profiles.py:121,136 — frontmatter.loads(item.read_text()) uses platform-default encoding, unlike _read_agent_profile_source which uses encoding="utf-8". Prefer explicit utf-8 (pre-existing pattern copied from _scan_directory).
[conventions] agent_profiles.py:124-125, :138-139 — bare except Exception: pass in _scan_plugin_directory silently swallows errors. Mirrors _scan_directory (pre-existing), so don't block; suggest logger.debug(..., exc_info=True) in the new code.
[conventions] agent_profiles.py:148 — module-level mutable global _PLUGIN_DIR_CACHE; testability + staleness are handled (reset helper, autouse fixture, mtime keys), so acceptable. A small cache class would isolate it.
[conventions] agent_profiles.py:157-162 — _get_mtime uses os.stat(path).st_mtime; idiomatic pathlib is path.stat().st_mtime, which would also drop the sole os import.
[conventions] agent_profiles.py:148 — Optional[Dict] bare generic under mypy strict=true; matches existing loose typing in the module so likely passes, but a TypedDict for the cache shape would be clearer/strict-safe.
[conventions] test/utils/test_claude_plugin_discovery.py — third test file for the same module diverges slightly from the test_agent_profiles_* module-mirroring convention; several docstrings reference ephemeral commit labels ("Fix A/Fix C", 03e6d35).
[tests] agent_profiles.py:130-145 — the nested agent.md (directory-style plugin agent) branch of _scan_plugin_directory is untested; every fixture creates flat <name>-agent.md. Add one nested case.
[consistency] out-of-scope — the HTML-comment-stripping fix ("Fix C") and its tests are bundled into this discovery PR. Plausibly related (AIM-generated profiles), but distinct; confirm it belongs or split.

Tests

Coverage of the discovery / cache / containment logic is genuinely strong and mostly
meaningful — mtime invalidation (settings.json touch, marketplace.json touch/disappear,
new-marketplace-added, explicit reset, cache-hit-avoids-recompute), path traversal
(../../etc), non-directory source skipping, dedup precedence (local-beats-plugin,
cross-marketplace first-wins), the list-path per-file symlink containment (genuinely
reachable and verified), and the Fix-A-scope-stays-narrow regression guard are all solid
and not tautological. HTML-comment-strip regression is well covered including a negative
(non-leading) case. Markers are correct; symlink tests are skipif(win32)-guarded.
Two real gaps: (1) the Fix A branch in _read_agent_profile_source is unreachable
dead code with an illusory test, and (2) the settings default change has only a
tautological assertion. Tests were not executed in-session — the human should run
uv run pytest test/utils/test_claude_plugin_discovery.py test/utils/test_agent_profiles.py test/services/test_settings_service.py -q.

Verdict

Request changes — sound engineering, but the default-on trust posture (blocking) plus
the marketplace-vs-plugin-root drift, dead containment branch, and docs/default-path drift
should be resolved before merge. (Correctness angle not covered.)

The single "activity since review" flag was noisy — it keyed off the PR's updatedAt, which bumps on anything (CI, labels, bot comments), so it fired on Codecov-only activity (awslabs#350/awslabs#351) and conflated "code changed" with "someone commented." Now two precise signals in metadata + dashboard pills: - code_changed (🔁 re-review): a review exists at an older sha but not the current head → the author pushed, so it genuinely needs re-review. - human_activity (💬 discussion since review): the newest NON-BOT comment or review is later than our review file. Bots (codecov/dependabot/ github-actions/copilot) are excluded so their automated comments don't masquerade as human engagement. Verified: awslabs#231/awslabs#336/awslabs#115 flag human_activity (real maintainer comments); awslabs#350/awslabs#351 (Codecov-only) no longer flag; none flag code_changed (no pushes).

The human_activity flag was firing on the dashboard user's OWN comments (awslabs#115/awslabs#231/awslabs#336 were all gutosantos82) — no signal, you know what you said. Now exclude $ME (gh api user login) alongside bots, so the flag means 'someone OTHER than you engaged after the review' (author reply, another maintainer) — the actually-actionable case.

haofeif requested a review from patricka3125 May 8, 2026 01:41

haofeif added the enhancement New feature or request label May 8, 2026

Lei Huang and others added 3 commits May 8, 2026 17:43

style: apply black formatting to test_claude_plugin_discovery.py

5d8cee4

test(agent_profiles): cover HTML-comment strip before frontmatter parse

140d6e9

anilkmr-a2z reviewed May 9, 2026

View reviewed changes

Comment thread src/cli_agent_orchestrator/utils/agent_profiles.py Outdated

anilkmr-a2z reviewed May 9, 2026

View reviewed changes

Merge branch 'main' into claude-plugin-discovery

f49c827

Lei Huang and others added 6 commits May 11, 2026 23:15

Merge branch 'main' into claude-plugin-discovery

e186148

Merge branch 'main' into claude-plugin-discovery

f837d38

gutosantos82 requested changes Jul 1, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(claude-code): auto-discover plugin marketplace agents#231

feat(claude-code): auto-discover plugin marketplace agents#231
leih1219 wants to merge 11 commits into
awslabs:mainfrom
leih1219:claude-plugin-discovery

leih1219 commented May 8, 2026 •

edited

Loading

Uh oh!

haofeif commented May 8, 2026

Uh oh!

codecov-commenter commented May 8, 2026 •

edited

Loading

Uh oh!

anilkmr-a2z May 9, 2026

Uh oh!

leih1219 May 12, 2026

Uh oh!

Uh oh!

anilkmr-a2z May 9, 2026

Uh oh!

leih1219 May 12, 2026

Uh oh!

anilkmr-a2z left a comment

Uh oh!

haofeif commented May 10, 2026

Uh oh!

patricka3125 commented May 14, 2026 •

edited

Loading

Uh oh!

gutosantos82 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

leih1219 commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

What changed

Security & path safety

Round 2 review response (2026-05-12)

Tests

Backwards compatibility

Follow-ups (intentionally out of scope)

Uh oh!

haofeif commented May 8, 2026

Uh oh!

codecov-commenter commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

anilkmr-a2z May 9, 2026

Choose a reason for hiding this comment

Uh oh!

leih1219 May 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

anilkmr-a2z May 9, 2026

Choose a reason for hiding this comment

Uh oh!

leih1219 May 12, 2026

Choose a reason for hiding this comment

Uh oh!

anilkmr-a2z left a comment

Choose a reason for hiding this comment

Uh oh!

haofeif commented May 10, 2026

Uh oh!

patricka3125 commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gutosantos82 left a comment

Choose a reason for hiding this comment

PR Review: #231 — feat: auto-discover agents from Claude Code plugin marketplaces

Summary

Blocking (must fix before merge)

Important (should fix)

Nits (optional)

Tests

Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

leih1219 commented May 8, 2026 •

edited

Loading

codecov-commenter commented May 8, 2026 •

edited

Loading

patricka3125 commented May 14, 2026 •

edited

Loading