Skip to content

Conversation

@qinyiqun
Copy link
Collaborator

A100 只支持F16及以上精度
image
H100 支持FP8E4M3和Fp8E5M2
FP8-Block-Wise
image
FP8
image
其他
image
FP8-Quantize-Group-Wise
image

@qinyiqun
Copy link
Collaborator Author

有些代码需要做一些格式的format

@qinyiqun qinyiqun force-pushed the fp8 branch 2 times, most recently from abb321d to 104c424 Compare October 16, 2025 07:37
INFINI_STATUS_BAD_TENSOR_SHAPE = 11,
INFINI_STATUS_BAD_TENSOR_STRIDES = 12,
INFINI_STATUS_INSUFFICIENT_WORKSPACE = 13,
INFINI_STATUS_NOT_ALIGNED = 14,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

需要在InfiniCore/src/utils/infini_status_string.h中添加string

INFINI_DTYPE_C64 = 17,
INFINI_DTYPE_C128 = 18,
INFINI_DTYPE_BF16 = 19,
INFINI_DTYPE_F8_E4M3 = 20,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个会影响python前端接口,和 @voltjia 过一下

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个没关系,我在 infinicore Python 层还会再封装一次,所以只要能跟 torch 里的对上号就行。

__C __export infiniStatus_t
infiniopDestroyLinearDescriptor(infiniopLinearDescriptor_t desc);

#endif No newline at end of file
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

末尾空行

from numpy.lib.stride_tricks import as_strided

from .. import InfiniopTestWriter, InfiniopTestCase, np_dtype_to_ggml, gguf_strides, contiguous_gguf_strides, process_zero_stride_tensor
from .. import (
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这些无关文件的更改都回滚吧

@@ -0,0 +1,406 @@
import torch
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

如果测试都是分成两个,那是不是分成两个算子会更好一点?


typedef struct InfiniopDescriptor *infiniopQuantizeDescriptor_t;

__C __export infiniStatus_t infiniopCreateQuantizeDescriptor(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这不是通用的quantize,在命名的时候应该注明是什么quantize

#include <utility>

/**
* @brief Define the process for initializing a Descriptor of an elementwise operation
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

请不要在无关文件进行格式修改

@PanZezhong1725
Copy link
Collaborator

写一下接口设计文档,有的参数的含义需要解释

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants