Handle excluded input column in event expansion by rasmusfaber · Pull Request #6 · METR/inspect_scout

rasmusfaber · 2026-03-25T09:38:54Z

Summary

_expand_events_in_df crashed with KeyError when the input column was excluded from scan results but input_data was present
Added early-return guard for missing input column
Added unit test covering the exact failure scenario

Test plan

New unit test test_expand_events_no_input_column passes
Existing test_event_expansion.py tests still pass

🤖 Generated with Claude Code

…ai#294) * initial work on transcript node detection * transcript nodes for typescript * more work on design * wip * infra events * fixups * utility agents * add support for branches * timeline ui * updates * add section on registration to docs * regenerate docs * timeline panel * multiple timelines * custom outline * remove type suffix * custom outline design notes * design docs * implementation plan * more synthetic nodes * Add swimlane row computation and make node times non-nullable Phase 1 of timeline core logic: implement computeSwimLaneRows() which transforms an AgentNode's children into SwimLaneRow[] for rendering as horizontal swimlane bars. Handles sequential, iterative (multiple spans), and parallel (overlapping) agent patterns with case-insensitive grouping. Also makes start_time/end_time non-nullable across both Python and TypeScript node types, since every Event has a required timestamp field. This eliminates pervasive null checks throughout the codebase. Container nodes use epoch sentinel for the degenerate empty-content case. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * save timeline context * Add content item building for timeline detail panel Phase 2 of timeline core logic: implement buildContentItems() which transforms an AgentNode into a flat list of ContentItems (event, agent_card, branch_card). Branch cards are inserted after the event matching their forkedAt UUID; unmatched branches append at the end. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Add marker computation for timeline UI Phase 3 of timeline core logic: implement collectMarkers() which finds error, compaction, and branch markers in an AgentNode at configurable depth (direct, children, recursive). Includes isErrorEvent() and isCompactionEvent() helpers for event classification. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * update context * commit timeline * ruff * import event tree span * rename to timeline * rename to timelinebranch * rename to timeline node * update docs * add timeline scanning target * add support for timeline=true * improved auto-span detection * timeline scanning doc * don't do fingerprinting across compaction boundaries * rename timeline => timelines * update scanning doc * update with working guidelines * message_numbering function * add commit id * messages_by_compaction * add commit id * message chunking * update heading * message chunking * update scanning doc * phase 5 timeline control * improve compaction handling * add depth parameter * claude code transcript source * extract answer functions * update plan * refactor llm_scanner * update doc * parallel segment scanning * llm scanner reduction * update reference * draft doc enhancements * remove scan_segments * Improve minimap * clear breadcrumbs on reload * Baseline events * fine tuning * collapsible * Drive minimap with selection * tweak minimap * market improvements * Emplace timeline outline as placeholder * Show selection in breadcrumbs * Add marker controller to test rig * Toggleable outline * reorganize code * Refactoring * fitler_timeline rather than include param * scorers section * factor out timeline detection * move function * exclude scorers from transcript_messages by default * claude code import fixes * correct token counting * propagate to ts * reorder imports * fix lint * improve import perf * baselines script * improve code quality * more tests for claude code source * remove calls to getattr * improve event routing * correct total tokens * correctly populate task_repeat * properly group assistant messages * populate description field * more flexible slicing out of compactions * messages_by_compaction function * trajectories example * ensure previous messages are flushed after compaction * Make sure parent of selection is navigable * Improve spacing * populate top level claude code messages using timeline * collapse tool output in messages view * import_cc example * merge sessions * scout import * no gitignore * improve import ux * overwrite flag * apply limit by time for claude code * some tool views for claude code * improve /clear handling * yaml parsing cleanup * improve transcript dir handling * code review feedback * code review feedback * remove spurious user command messages * doc plan * consolidate messages_by_compaction into span_messages * initial restructuring of scanner docs * add docs on multi-label * update transcript_fields * update documentation plan * improve handling of model instances w/ multiprocessing * update scanner_ir with changes in llm_scanner * address feedback * improve custom scanner doc * more doc improvements * improve timeline docs * improve scanner tools * simplify llm reducer * update reference * scout import docs * improve llm reducer * improve default reducers * support for images from claude code logs * fix majority reducer * code review feedback * more liberal text extraction * regen docs * doc fixes * extract_refs * update test_generate tests * remove timelines from docs * Revert "remove timelines from docs" This reverts commit 6e3a887. * clean out some design and examples files * dev version callouts * more dev callouts * update changelog * ruff format * fix date format in python 3.10 * format typescript * reformat * update uv lock --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Charles Teague <charles@merdianlabs.ai>

* add timeline_dump and timeline_load functions * import timeline functions from inspect_ai * remove cc events

* set assistant message id from underlying jsonl event * update inspect dep * call inspect_swe for cc events * add inspect-swe depencency * tweak changelog

…ianlabs-ai#298) * timelines: Improve agent detection logic in `timeline_build()` * fix tests * update python tests

…nlabs-ai#295)

Implements a new source for importing traces from Weights & Biases Weave, following the existing pattern established by langsmith, logfire, and phoenix. Features: - Async generator that yields Transcript objects from Weave traces - Provider format detection for OpenAI, Anthropic, and Google - Tree building for hierarchical call/span structures - Event conversion (ModelEvent, ToolEvent, SpanBeginEvent, SpanEndEvent) - Message extraction using inspect_ai converters - Retry logic with tenacity for API resilience - Support for time-based filtering and custom filters Includes comprehensive test suite with: - 92 unit tests covering all modules - Integration test framework with bootstrap.py for real trace creation - Mock objects simulating real Weave call structures Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

meridianlabs-ai#299) Addresses meridianlabs-ai#270

Co-authored-by: Charles Teague <charles@meridianlabs.ai> Co-authored-by: jjallaire <jj.allaire@gmail.com>

Summary._report() was counting all items in a resultset (including value=0 items) as positive results. This caused the sidebar to show inflated numbers (e.g., 965) while the results list correctly filtered to only positive matches (e.g., 12). Now uses is_positive_value() to check each item's value field. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

RecorderBuffer loaded the existing _summary.json on both init and resume, causing summary counters (scans, results) to accumulate across scan reruns. Now init() passes reset=True to start fresh, while resume() preserves the existing summary as intended. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Make error file truncation conditional on reset (matching the summary logic). Previously errors were always truncated, even on resume. Also: parametrize counting tests, use monkeypatch for env vars, use public scan_summary() API in assertions. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-authored-by: Charles Teague <charles@meridianlabs.ai>

* Bump to main * correct types? * point to latest main --------- Co-authored-by: Charles Teague <charles@meridianlabs.ai>

…#350)

… events to client and expand client-side (meridianlabs-ai#351)

…labs-ai#352) Co-authored-by: Rasmus Faber-Espensen <rfaber@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

Co-authored-by: Charles Teague <charles@meridianlabs.ai>

…s-ai#358)

* Update to take marker fix * bump to latest --------- Co-authored-by: Charles Teague <charles@meridianlabs.ai>

When input is excluded from scan results, _expand_events_in_df would crash with a KeyError because it checked for input_data but not input. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jjallaire and others added 30 commits February 23, 2026 11:09

remove timelines from docs

af95dc4

protect against null transcript_metadata

55dc7a7

correct string for docs

4d73081

fix timestamp parsing in tests

bc44e8c

remove timelines from navbar

8873e33

improved null guard

daed6a9

imrprove reducer docs

99d2473

clarify llm_scanner docs

62828f5

use explicit init span

e6a6db8

improve typing on tests

cd95894

rename timeline functions

2b0e3ec

inspect ai timelines (meridianlabs-ai#296)

4ebb63a

* add timeline_dump and timeline_load functions * import timeline functions from inspect_ai * remove cc events

don't re-export inspect timeline functions

8678e6f

Refactor Claude Code events into Inspect SWE (meridianlabs-ai#297)

d5f9046

* set assistant message id from underlying jsonl event * update inspect dep * call inspect_swe for cc events * add inspect-swe depencency * tweak changelog

timelines: Improve agent detection logic in timeline_build() (merid…

dab0453

…ianlabs-ai#298) * timelines: Improve agent detection logic in `timeline_build()` * fix tests * update python tests

Allow route level code splitting. (meridianlabs-ai#289)

f7b84fc

Add --root-path flag to scout view for reverse proxy support (meridia…

ab3c5d2

…nlabs-ai#295)

improved handling of empty timeline spans

91f6838

Regenerate schema and dist (meridianlabs-ai#302)

ec865ca

Fix early-exit bug; unthin target and and add scores from sample JSON. (

bc0fedf

meridianlabs-ai#299) Addresses meridianlabs-ai#270

Pass scores and target through in as_scorer path (meridianlabs-ai#303)

ad829a4

Properly support value types in scanner results (meridianlabs-ai#301)

69d76f8

Co-authored-by: Charles Teague <charles@meridianlabs.ai> Co-authored-by: jjallaire <jj.allaire@gmail.com>

Merge branch 'main' into feature/weave-source

da5ad6f

run tests + refinements

81a2440

docs

7aaf83d

Merge branch 'scottire-feature/weave-source'

fc42d61

regen docs

cd331a8

ruff

fa78667

jjallaire and others added 27 commits March 16, 2026 16:51

update multi-agent docs

e7f73ad

Enable LFS support in publish workflow

5be079b

update changelog for release

b63fe5a

style: fix import sorting in test files

aea1f0f

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

TypeScript monorepo migration (meridianlabs-ai#290)

3aa2860

Bump ts-mono: build to dist/ for turbo caching (meridianlabs-ai#345)

aeb7e70

Bump to latest main (meridianlabs-ai#346)

be1997d

Co-authored-by: Charles Teague <charles@meridianlabs.ai>

gitignore

496653e

Bump to main (meridianlabs-ai#348)

6be5e73

* Bump to main * correct types? * point to latest main --------- Co-authored-by: Charles Teague <charles@meridianlabs.ai>

Bump to pick up doc change

7d1f7e0

CI: block merge if submodule not on ts-mono main (meridianlabs-ai…

11f63e5

…#350)

Avoid reintroducing n² Transcript payload size by sending condensed…

ece60ba

… events to client and expand client-side (meridianlabs-ai#351)

Expand message/call pool refs in load_filtered_transcript (meridian…

dab93b8

…labs-ai#352) Co-authored-by: Rasmus Faber-Espensen <rfaber@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

Bump to improved markdown parsing (meridianlabs-ai#353)

6bff837

Co-authored-by: Charles Teague <charles@meridianlabs.ai>

Merge branch 'main' into fix/summary-counting-bugs

871b25c

update changelog

8c28996

Merge branch 'METR-fix/summary-counting-bugs'

f0e51f8

update db schema docs

770d429

update changelog

1dc5b17

update changelog for release

ee74702

add mlflow to docs

0898579

Delete stray file and add frontend setup steps to README (meridianlab…

22ff4dc

…s-ai#358)

Update to take marker fix (meridianlabs-ai#360)

e0411aa

* Update to take marker fix * bump to latest --------- Co-authored-by: Charles Teague <charles@meridianlabs.ai>

Handle excluded input column in event expansion

e3a39a2

When input is excluded from scan results, _expand_events_in_df would crash with a KeyError because it checked for input_data but not input. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

rasmusfaber closed this Mar 25, 2026

rasmusfaber deleted the fix/handle-excluded-input branch March 25, 2026 10:21

rasmusfaber restored the fix/handle-excluded-input branch March 25, 2026 10:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle excluded input column in event expansion#6

Handle excluded input column in event expansion#6
rasmusfaber wants to merge 110 commits into
mainfrom
fix/handle-excluded-input

rasmusfaber commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

rasmusfaber commented Mar 25, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants