We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 3ec9069 commit 9e532c4Copy full SHA for 9e532c4
README.md
@@ -23,6 +23,7 @@ pip install wheel
23
export CUDA_HOME=/pkgs/cuda-12.3
24
25
pip install flash-attn --no-build-isolation
26
+pip install vllm-flash-attn
27
```
28
29
## Launch an inference server
0 commit comments