Skip to content

Latest commit

Β 

History

History
167 lines (125 loc) Β· 6.68 KB

File metadata and controls

167 lines (125 loc) Β· 6.68 KB

Dataset Forge

Dataset Forge Thumbnail NEW 2

Separator local

The all-in-one, modular image dataset utility for ML, with a focus on HQ/LQ image pairs for SISR and general computer vision. CLI-first, highly extensible, and packed with advanced tools for dataset curation, analysis, transformation, and validation.


πŸš€ What is Dataset Forge?

Dataset Forge is a Python CLI tool for managing, analyzing, and transforming image datasetsβ€”especially high/low quality pairs for super-resolution and machine learning.
It streamlines dataset curation, analysis, transformation, and validation with an intuitive, extensible interface.


✨ Key Features

  • Clean and organize image datasets (HQ/LQ pairs for super-resolution)
  • Analyze dataset quality and generate reports
  • Process, augment, and transform images
  • Modular, CLI-first, and highly extensible
  • Robust parallel and GPU-accelerated processing
  • 🌐 Global Command System: Context-aware help and instant quit from any menu
  • πŸ“š Comprehensive Help: Menu-specific documentation and navigation assistance
  • See all features

πŸ“¦ Quickstart

git clone https://github.com/Courage-1984/Dataset-Forge.git
cd Dataset-Forge

πŸ“– Documentation


πŸ–₯️ Requirements


πŸ’œ Credits


🧰 External Tools integrated:


πŸͺͺ License

This project is licensed under the Creative Commons CC-BY-SA-4.0. See LICENSE for details.


License Python Version Issues Stars Last Commit Release Latest Tag


Project Architecture

A simplified overview of Dataset Forge's modular architecture:

flowchart TD
    A["πŸš€ CLI Entrypoint"] --> B["πŸ“‹ Main Menu"]
    B --> C["πŸŽ›οΈ Menu System"]
    C --> D["⚑ Actions Layer"]
    D --> E["πŸ› οΈ Core Utilities"]
    D --> F["πŸ”§ DPID Implementations"]

    subgraph "Core Components"
        G["πŸ“‚ Dataset Management"]
        H["πŸ” Analysis & Validation"]
        I["✨ Image Processing"]
        J["πŸ› οΈ Utilities & Tools"]
    end

    subgraph "Supporting Systems"
        K["πŸ’Ύ Memory Management"]
        L["⚑ Parallel Processing"]
        M["🎨 UI/CLI System"]
        N["πŸ”§ External Libraries"]
    end

    C --> G
    C --> H
    C --> I
    C --> J

    D --> K
    D --> L
    D --> M
    D --> N
Loading

For the full roadmap and advanced usage, see the Documentation Home.