Skip to content

[phyai] mxfp4 linear #18

@chenghuaWang

Description

@chenghuaWang

Summary

Add support for MXFP4-quantized Linear layers in phyai.

Motivation

MXFP4 is an OCP microscaling FP4 format that stores 4-bit floating-point values with block-wise shared scaling. It can significantly reduce memory footprint for large model inference while preserving better numerical behavior than simple INT4-style quantization.

Supporting MXFP4 linear layers would let phyai run models or checkpoints that are distributed with MXFP4 weights, and provide a hardware-agnostic FP4 path where runtime support is available.

References

Metadata

Metadata

Assignees

No one assigned

    Labels

    help wantedExtra attention is needed
    No fields configured for Feature.

    Projects

    Status
    Todo

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions