Add multi MIG GPU support in release-v1.3 as fix for #1586 by Tishwings · Pull Request #1589 · nanoporetech/dorado

Tishwings · 2026-05-08T07:29:40Z

fix for #1586 in release-v1.3

Summary of changes

This PR improves support for NVIDIA MIG GPU devices by determining the maximum number of devices from both NVML and the CUDA runtime (torch). This ensures MIG instances are recognized correctly, even when CUDA exposes more devices than NVML.

Key points:

Device count now uses the maximum from NVML and torch (torch::cuda::device_count()), allowing proper enumeration of MIG instances.
Device mapping and validation now reference this unified count.
Instead of errors when too many CUDA_VISIBLE_DEVICES are specified, a warning is logged for more robust operation.
Deprecated NVML API warnings (CUDA 13+) are suppressed to reduce CI/CD build noise.

Why error suppression is needed

Suppressing NVML deprecation warnings is necessary because newer CUDA versions (13+) produce many build warnings due to outdated, but still required, APIs. This keeps our build logs clean and avoids unnecessary alarm for unavoidable warnings.

Next steps (out of scope for this PR)

Proper validation should include tests on hardware other than ours, including different MIG and non-MIG GPUs. Migrating to future NVIDIA APIs for device management would also be recommended once available. For now, these improvements safely extend current support for us without risking regressions elsewhere.

…he max device count from torch (incl MIGs)

add support for mig gpu devices on different parent GPUs by setting t…

59c4ed0

…he max device count from torch (incl MIGs)

Tishwings mentioned this pull request May 8, 2026

Add multi MIG GPU support in release-v1.4 as fix for #1586 #1590

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multi MIG GPU support in release-v1.3 as fix for #1586#1589

Add multi MIG GPU support in release-v1.3 as fix for #1586#1589
Tishwings wants to merge 1 commit into
nanoporetech:release-v1.3from
Tishwings:issue-1586-mig-fix-v1.3

Tishwings commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Tishwings commented May 8, 2026

Summary of changes

Why error suppression is needed

Next steps (out of scope for this PR)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant