ISSUE/346: 支持CublasLT，实现了LT的linear，fp8 linear，fp8 block-wise linear，和fp8 的 group-wise quant #510

qinyiqun · 2025-10-16T07:20:56Z

A100 只支持F16及以上精度

H100 支持FP8E4M3和Fp8E5M2
FP8-Block-Wise

FP8

其他

FP8-Quantize-Group-Wise

qinyiqun · 2025-10-16T07:22:39Z

有些代码需要做一些格式的format

…cuBLASLt and fp8 group-wise quant

PanZezhong1725 · 2025-10-21T02:52:58Z

include/infinicore.h

    INFINI_STATUS_BAD_TENSOR_SHAPE = 11,
    INFINI_STATUS_BAD_TENSOR_STRIDES = 12,
    INFINI_STATUS_INSUFFICIENT_WORKSPACE = 13,
+    INFINI_STATUS_NOT_ALIGNED = 14,


需要在InfiniCore/src/utils/infini_status_string.h中添加string

PanZezhong1725 · 2025-10-21T02:54:05Z

include/infinicore.h

    INFINI_DTYPE_C64 = 17,
    INFINI_DTYPE_C128 = 18,
    INFINI_DTYPE_BF16 = 19,
+    INFINI_DTYPE_F8_E4M3 = 20,


这个会影响python前端接口，和 @voltjia 过一下

这个没关系，我在 infinicore Python 层还会再封装一次，所以只要能跟 torch 里的对上号就行。

PanZezhong1725 · 2025-10-21T02:54:33Z

include/infiniop/ops/linear.h

+__C __export infiniStatus_t
+infiniopDestroyLinearDescriptor(infiniopLinearDescriptor_t desc);
+
+#endif


末尾空行

PanZezhong1725 · 2025-10-21T02:56:35Z

test/infiniop-test/test_generate/testcases/add.py

 from numpy.lib.stride_tricks import as_strided

-from .. import InfiniopTestWriter, InfiniopTestCase, np_dtype_to_ggml, gguf_strides, contiguous_gguf_strides, process_zero_stride_tensor
+from .. import (


这些无关文件的更改都回滚吧

PanZezhong1725 · 2025-10-21T02:57:56Z

test/infiniop/linear_fp8_blockwise.py

@@ -0,0 +1,406 @@
+import torch


如果测试都是分成两个，那是不是分成两个算子会更好一点？

PanZezhong1725 · 2025-10-21T02:58:56Z

include/infiniop/ops/quantize.h

+
+typedef struct InfiniopDescriptor *infiniopQuantizeDescriptor_t;
+
+__C __export infiniStatus_t infiniopCreateQuantizeDescriptor(


这不是通用的quantize，在命名的时候应该注明是什么quantize

PanZezhong1725 · 2025-10-21T03:00:10Z

src/infiniop/elementwise/cpu/elementwise_cpu.h

 #include <utility>

 /**
- * @brief Define the process for initializing a Descriptor of an elementwise operation


请不要在无关文件进行格式修改

PanZezhong1725 · 2025-10-21T03:02:23Z

写一下接口设计文档，有的参数的含义需要解释

issue/346: 增加CublasLt支持

80212cb

qinyiqun added the 模块：算子 label Oct 16, 2025

qinyiqun force-pushed the fp8 branch 2 times, most recently from abb321d to 104c424 Compare October 16, 2025 07:37

issue/346: Implemented linear, fp8 linear, fp8 blockwise linear with …

8e07b04

…cuBLASLt and fp8 group-wise quant

qinyiqun force-pushed the fp8 branch from 104c424 to 8e07b04 Compare October 16, 2025 07:53

qinyiqun added the 准备好了 label Oct 16, 2025

qinyiqun requested review from PanZezhong1725 and Ziminli and removed request for PanZezhong1725 October 16, 2025 08:08

PanZezhong1725 requested a review from voltjia October 21, 2025 02:53

PanZezhong1725 requested changes Oct 21, 2025

View reviewed changes

PanZezhong1725 added 需要修改 and removed 准备好了 labels Oct 21, 2025

PanZezhong1725 force-pushed the main branch from 7300e69 to 37c76a9 Compare October 22, 2025 02:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ISSUE/346: 支持CublasLT，实现了LT的linear，fp8 linear，fp8 block-wise linear，和fp8 的 group-wise quant #510

ISSUE/346: 支持CublasLT，实现了LT的linear，fp8 linear，fp8 block-wise linear，和fp8 的 group-wise quant #510

Uh oh!

qinyiqun commented Oct 16, 2025

Uh oh!

qinyiqun commented Oct 16, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

voltjia Oct 23, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

PanZezhong1725 Oct 21, 2025

Uh oh!

PanZezhong1725 commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		typedef struct InfiniopDescriptor *infiniopQuantizeDescriptor_t;

		__C __export infiniStatus_t infiniopCreateQuantizeDescriptor(

ISSUE/346: 支持CublasLT，实现了LT的linear，fp8 linear，fp8 block-wise linear，和fp8 的 group-wise quant #510

Are you sure you want to change the base?

ISSUE/346: 支持CublasLT，实现了LT的linear，fp8 linear，fp8 block-wise linear，和fp8 的 group-wise quant #510

Uh oh!

Conversation

qinyiqun commented Oct 16, 2025

Uh oh!

qinyiqun commented Oct 16, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

PanZezhong1725 commented Oct 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants