File tree Expand file tree Collapse file tree 2 files changed +6
-6
lines changed Expand file tree Collapse file tree 2 files changed +6
-6
lines changed Original file line number Diff line number Diff line change @@ -532,13 +532,13 @@ export PATH=$PATH:/usr/local/cuda/bin
532532Then run:
533533
534534``` shell
535- # This can take a while as we need to compile a lot of cuda kernels
535+ # This can take a while as we need to compile a lot of CUDA kernels
536536
537537# On Turing GPUs (T4, RTX 2000 series ... )
538- cargo install --path router -F candle-cuda-turing -F http --no-default-features
538+ cargo install --path router -F candle-cuda-turing
539539
540540# On Ampere and Hopper
541- cargo install --path router -F candle-cuda -F http --no-default-features
541+ cargo install --path router -F candle-cuda
542542```
543543
544544You can now launch Text Embeddings Inference on GPU with:
Original file line number Diff line number Diff line change @@ -39,18 +39,18 @@ curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
3939
4040## Step 3: Install necessary packages
4141
42- This step can take a while as we need to compile a lot of cuda kernels.
42+ This step can take a while as we need to compile a lot of CUDA kernels.
4343
4444### For Turing GPUs (T4, RTX 2000 series ... )
4545
4646``` shell
47- cargo install --path router -F candle-cuda-turing -F http --no-default-features
47+ cargo install --path router -F candle-cuda-turing
4848```
4949
5050### For Ampere and Hopper
5151
5252``` shell
53- cargo install --path router -F candle-cuda -F http --no-default-features
53+ cargo install --path router -F candle-cuda
5454```
5555
5656## Step 4: Launch Text Embeddings Inference
You can’t perform that action at this time.
0 commit comments