Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

toncho11 · 2025-11-10T10:30:43Z

A re-written _inc_exc_datasets() that fixes issues and provides much needed checks. Before it was possible not to recognize correctly if the input is string or object and thus process the input incorrectly. Fixes: #654. It avoids some confusion induced by the old version.

Below is the code I used for testing:

# -------------------------------
# 1. Pipelines and datasets
# -------------------------------
pipelines_test = [
    # test the new router classifier
    {
        "paradigms": ["P300","LeftRightImagery"],
        "pipeline": make_pipeline(
            Covariances("oas"),
            CustomCspTransformer2(mode="high_electrodes_count"),
            CustomCspTransformer2(mode="low_electrodes_count"),
            MDM()
        ),
        "name": "TSLR"
    }
]

# P300 databases
from moabb.datasets import (
    BI2013a,
    BNCI2014_008,
    BNCI2014_009,
    BNCI2015_003,
    EPFLP300,
    Lee2019_ERP,
    BI2014a,
    BI2014b,
    BI2015a,
    BI2015b,
)

# Motor imagery databases
from moabb.datasets import (
    BNCI2014_001,
    Zhou2016,
    BNCI2015_001,
    BNCI2014_002,
    BNCI2014_004,
    #BNCI2015_004, #not tested
    AlexMI,
    Weibo2014,
    Cho2017,
    GrosseWentrup2009,
    PhysionetMI,
    Shin2017A,
    Lee2019_MI, #new
    Schirrmeister2017 #new
)

# -------------------------------
# 2. Run the benchmark
# -------------------------------

# Ensure the results folder exists
results_path = "./results/"
os.makedirs(results_path, exist_ok=True)

results = benchmark(
    pipelines=pipelines_test,
    evaluations=["WithinSession"], #, "CrossSession"
    paradigms=["P300", "LeftRightImagery"],
    results=results_path,
    output="./benchmark_results/",
    #exclude_datasets=["Stieger2021","Liu2024"], #must be OK
    #include_datasets=["BNCI2014-001"], # #must be OK
    #exclude_datasets=[Zhou2016(), Weibo2014()], #must be OK
    #include_datasets=[Zhou2016(), Weibo2014()], #must be OK
    #exclude_datasets=["Stieger2021","fsdfsdfs"], # should  fail  
    #include_datasets=[PhysionetMI(), Shin2017A(), Lee2019_MI()], #must be OK
    #include_datasets=[PhysionetMI(), Shin2017A(), "BNCI2014-001"], #should fail
    #exclude_datasets=[PhysionetMI(), Shin2017A(), "BNCI2014-001"], #should fail
    #exclude_datasets=[EPFLP300()], 
    #include_datasets=[Lee2019_ERP(), BI2015b()], #should be OK
    #include_datasets=[Lee2019_ERP(), "fsdfsdfdwwww"], #should fail
    #include_datasets=["fsdfsdfdwwww"], #should fail
    exclude_datasets = None, include_datasets = None, # should be OK
    overwrite=True,
    n_jobs=3,
    plot=False,
    n_splits=5
)

…checks.

Adds function filter_paradigms(). It provides better error messages.

…ithub.com/toncho11/moabb into improve_fix_inc_exc_datasets_in_benchmark

bruAristimunha · 2025-11-11T13:30:19Z

hey @toncho11,

can you fix the tests please:

FAILED moabb/tests/test_benchmark.py::TestBenchmark::test_benchmark_strdataset - ValueError: Invalid dataset codes in include_datasets: ['FakeDataset-p300-10-2--60-60--120-120--target-nontarget--c3-cz-c4', 'FakeDataset-ssvep-10-2--60-60--120-120--13-15--c3-cz-c4', 'FakeDataset-cvep-10-2--60-60--120-120--10-00--c3-cz-c4']
FAILED moabb/tests/test_benchmark.py::TestBenchmark::test_benchmark_objdataset - ValueError: Some datasets in include_datasets are not part of available datasets for the paradigms you requested in benchmark(): ['FakeDataset-p300-10-2--60-60--120-120--target-nontarget--c3-cz-c4', 'FakeDataset-ssvep-10-2--60-60--120-120--13-15--c3-cz-c4', 'FakeDataset-cvep-10-2--60-60--120-120--10-00--c3-cz-c4']
FAILED moabb/tests/test_benchmark.py::TestBenchmark::test_include_exclude - ValueError: Cannot specify both include_datasets and exclude_datasets.
===== 3 failed, 301 passed, 90 skipped, 207 war

toncho11 · 2025-11-13T10:33:20Z

I need some more time on the code.

Added one FakeDataset for P300 testing.

toncho11 and others added 8 commits November 10, 2025 10:33

A re-written _inc_exc_datasets() that fixes issues and provides more …

33c1ff5

…checks.

Add another improvement - handling of the paradigms in moabb.

e4d7fc6

Adds function filter_paradigms(). It provides better error messages.

small typo

109ad92

A number of fixes and improvements.

0ae75e8

[pre-commit.ci] auto fixes from pre-commit.com hooks

3df21c6

small docstring update

fdf8ea7

Merge branch 'improve_fix_inc_exc_datasets_in_benchmark' of https://g…

ce0729e

…ithub.com/toncho11/moabb into improve_fix_inc_exc_datasets_in_benchmark

small comment clarification

9a4853c

toncho11 and others added 10 commits November 12, 2025 15:52

Improved tests

9193304

[pre-commit.ci] auto fixes from pre-commit.com hooks

1ca36c7

optuna test enabled

7bc64bf

[pre-commit.ci] auto fixes from pre-commit.com hooks

6652208

There are 8 tests. This has been a lot of effort.

d8cf34b

Merge branch 'develop' into improve_fix_inc_exc_datasets_in_benchmark

b7215ab

updated whats_new file

8733fe4

Improved documentation

58a5da3

[pre-commit.ci] auto fixes from pre-commit.com hooks

210be29

comments updated

c7c05b6

toncho11 marked this pull request as draft November 13, 2025 10:33

toncho11 and others added 5 commits November 13, 2025 14:52

Multiple paradigms in the parameter "paradigms" are handled better now.

abd52c1

Added one FakeDataset for P300 testing.

[pre-commit.ci] auto fixes from pre-commit.com hooks

786d6d9

small improvements

ebd3916

All fake datasets are set to 2 subjects to reduce execution time.

ba08834

[pre-commit.ci] auto fixes from pre-commit.com hooks

2c7bd0a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

toncho11 commented Nov 10, 2025 •

edited

Loading

Uh oh!

bruAristimunha commented Nov 11, 2025

Uh oh!

toncho11 commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

Are you sure you want to change the base?

Improved _inc_exc_datasets() and paradigms handling in benchmark() #834

Conversation

toncho11 commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bruAristimunha commented Nov 11, 2025

Uh oh!

toncho11 commented Nov 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

toncho11 commented Nov 10, 2025 •

edited

Loading