Skip to content

[phyai] MagiAttention Backend #16

@chenghuaWang

Description

@chenghuaWang

Summary

Add backend support for SandAI-org/MagiAttention, a distributed/context-parallel attention implementation designed for ultra-long-context training and heterogeneous attention masks.

Motivation

MagiAttention provides a scalable distributed attention backend with support for flexible mask patterns, computation load balancing, zero-redundant communication, and computation/communication overlap. Supporting it in phyai would make it easier to run long-context training workloads that require context parallelism and efficient distributed attention.

References

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed
    No fields configured for Feature.

    Projects

    Status
    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions