feat: support generated-output flow in run_experiment (no prompt_temp… by BipinShetty · Pull Request #486 · rungalileo/galileo-python

BipinShetty · 2026-02-24T09:50:21Z

User description

…late)

Remove hard ValueError for prompt_template=None in run_experiment(). The API is the authority on flow determination: if the dataset has a generated_output column, the API accepts prompt_template_version_id=None and runs metrics without LLM generation. If not, the API returns 3512.
Experiments.run(): only set default prompt_settings when a template is provided; pass prompt_template_id=None for generated-output flow.
Jobs.create(): make prompt_template_id and prompt_settings Optional; only include them in CreateJobRequest when non-None so the API contract is respected for both flows.
Add 3 tests: generated-output flow, prompt-takes-precedence, and no-prompt-no-dataset raises ValueError.

Shortcut:

Description:

Tests:

Unit Tests Added
E2E Test Added (if it's a user-facing feature, or fixing a bug)

Generated description

Below is a concise technical summary of the changes proposed in this PR:
Enables the generated-output flow in experiment runs by allowing prompt_template to be optional, shifting flow determination to the API based on dataset content. Updates the experiment and job creation logic to conditionally include prompt-related parameters only when a template is provided.

Topic Details

Flow Validation

Add unit tests to verify the generated-output flow, ensure prompt templates take precedence when provided, and validate error handling for missing datasets.

Modified files (1)

tests/test_experiments.py

Latest Contributors(2)

User	Commit	Date
jweiler@galileo.ai	feat-Add-GalileoMetric...	December 19, 2025
david@rungalileo.io	feat-Add-batch-get-dat...	December 12, 2025

Generated Output

Modify run_experiment, Experiments.run, and Jobs.create to support optional prompt templates and settings, ensuring the API receives None values for generated-output flows.

Modified files (2)

src/galileo/experiments.py
src/galileo/jobs.py

Latest Contributors(2)

User	Commit	Date
vamaq@users.noreply.gi...	fix-Define-explicit-er...	February 03, 2026
jweiler@galileo.ai	feat-Add-GalileoMetric...	December 19, 2025

This pull request is reviewed by Baz. Review like a pro on (Baz).

…late) - Remove hard ValueError for prompt_template=None in run_experiment(). The API is the authority on flow determination: if the dataset has a generated_output column, the API accepts prompt_template_version_id=None and runs metrics without LLM generation. If not, the API returns 3512. - Experiments.run(): only set default prompt_settings when a template is provided; pass prompt_template_id=None for generated-output flow. - Jobs.create(): make prompt_template_id and prompt_settings Optional; only include them in CreateJobRequest when non-None so the API contract is respected for both flows. - Add 3 tests: generated-output flow, prompt-takes-precedence, and no-prompt-no-dataset raises ValueError. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…dataset_raises Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

codecov · 2026-02-24T10:01:53Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 81.87%. Comparing base (0e364eb) to head (38924d6).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #486      +/-   ##
==========================================
+ Coverage   81.85%   81.87%   +0.02%     
==========================================
  Files          96       96              
  Lines        9164     9166       +2     
==========================================
+ Hits         7501     7505       +4     
+ Misses       1663     1661       -2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

BipinShetty requested a review from a team as a code owner February 24, 2026 09:50

BipinShetty requested a review from csurfer February 24, 2026 09:50

baz-reviewer bot approved these changes Feb 24, 2026

View reviewed changes

BipinShetty requested review from avalencia-galileo, fernandocorreia-galileo and jasmine-ab-tea February 24, 2026 09:56

fix: correct error message match in test_run_experiment_no_prompt_no_…

38924d6

…dataset_raises Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

style: ruff-format test_experiments.py

2fd04f8

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

dmcwhorter approved these changes Feb 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support generated-output flow in run_experiment (no prompt_temp…#486

feat: support generated-output flow in run_experiment (no prompt_temp…#486
BipinShetty wants to merge 3 commits intomainfrom
feat/generated-output-flow-sdk

BipinShetty commented Feb 24, 2026 •

edited by baz-reviewer bot

Loading

Uh oh!

codecov bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

BipinShetty commented Feb 24, 2026 • edited by baz-reviewer bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

User description

Generated description

Uh oh!

codecov bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BipinShetty commented Feb 24, 2026 •

edited by baz-reviewer bot

Loading

codecov bot commented Feb 24, 2026 •

edited

Loading