@'
A repeatable benchmarking toolkit for PyTorch/CUDA workloads.
Planned features:
- CLI to run benchmark scenarios from YAML configs
- Throughput + latency, GPU/CPU memory, GPU utilization (NVML where available)
- JSONL logs + CSV export + plots
'@ | Set-Content -Encoding UTF8 README.md