IOHClustering

IOHClustering provides an interface for clustering problems, allowing users to map a dataset and a specified number of clusters to the IOHprofiler problem format. This enables seamless integration with the IOHprofiler framework for further analysis and performance evaluation.

As part of the IOHprofiler framework, IOHClustering is under active development. Some features may still be evolving, with potential updates to functionality and interfaces.

Features

Transform datasets into clustering problems compatible with the IOHprofiler framework.
Real-world datasets for clustering, such as:
- Iris dataset
- Wine dataset
- KC1 dataset
- Glass dataset

Installation

The minimum supported Python version is 3.10. Install IOHClustering via pip and git:

pip install iohclustering

Basic Usage

Examples and Tutorials

Below are several examples demonstrating how to use IOHClustering for various clustering tasks. These examples cover basic usage, working with benchmark problems, solving custom datasets, and defining custom evaluation functions.

Example: Clustering a Dataset

The following example shows how to get a benchmark clustering problem (by name or ID) the IOHClustering framework:

from iohclustering import get_problem, download_benchmark_datasets


# Get benchmark problem by name (e.g., "iris_pca") with k=2 clusters
clustering_problem, retransform = get_problem(fid="iris_pca", k=2)

# Alternatively, get benchmark problem by its ID (e.g., ID=5) with k=2 clusters
clustering_problem, retransform = get_problem(fid=5, k=2)

# Print metadata of the clustering problem
print(clustering_problem.meta_data)

# Set up a logger to store results in the specified directory
logger = ioh.logger.Analyzer(
    root=os.getcwd(),  # Current working directory
    folder_name="AttachedLogger",  # Folder to store logs
    algorithm_name="None",  # Name of the algorithm (can be customized)
)

# Attach the logger to the created clustering problem
clustering_problem.attach_logger(logger)

Example: Listing Available Benchmark Problems

You can retrieve a list of all benchmark problems available in IOHClustering:

from iohclustering import load_problems

problems = load_problems()

for problem in problems.keys():
    print(problem)

Tutorials

Explore the following Jupyter notebooks for step-by-step tutorials on using IOHClustering:

Custom Dataset and Random Search Tutorial: Learn how to define clustering problems with your own datasets and explore solutions using random search.
Custom Metric and Random Search Tutorial: Understand how to define custom clustering metrics and solve clustering problems with random search.

License

This project is licensed under a standard BSD-3 clause License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github/workflows		.github/workflows
src/iohclustering		src/iohclustering
tests		tests
tutorials		tutorials
.gitignore		.gitignore
LISCENSE		LISCENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

IOHClustering

Features

Installation

Basic Usage

Examples and Tutorials

Example: Clustering a Dataset

Example: Listing Available Benchmark Problems

Tutorials

License

Acknowledgments

Cite Us

About

Uh oh!

Releases 3

Packages

Uh oh!

Languages

IOHprofiler/IOHClustering

Folders and files

Latest commit

History

Repository files navigation

IOHClustering

Features

Installation

Basic Usage

Examples and Tutorials

Example: Clustering a Dataset

Example: Listing Available Benchmark Problems

Tutorials

License

Acknowledgments

Cite Us

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Languages

Packages