File tree Expand file tree Collapse file tree 2 files changed +6
-6
lines changed Expand file tree Collapse file tree 2 files changed +6
-6
lines changed Original file line number Diff line number Diff line change @@ -532,13 +532,13 @@ export PATH=$PATH:/usr/local/cuda/bin
532
532
Then run:
533
533
534
534
``` shell
535
- # This can take a while as we need to compile a lot of cuda kernels
535
+ # This can take a while as we need to compile a lot of CUDA kernels
536
536
537
537
# On Turing GPUs (T4, RTX 2000 series ... )
538
- cargo install --path router -F candle-cuda-turing -F http --no-default-features
538
+ cargo install --path router -F candle-cuda-turing
539
539
540
540
# On Ampere and Hopper
541
- cargo install --path router -F candle-cuda -F http --no-default-features
541
+ cargo install --path router -F candle-cuda
542
542
```
543
543
544
544
You can now launch Text Embeddings Inference on GPU with:
Original file line number Diff line number Diff line change @@ -39,18 +39,18 @@ curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
39
39
40
40
## Step 3: Install necessary packages
41
41
42
- This step can take a while as we need to compile a lot of cuda kernels.
42
+ This step can take a while as we need to compile a lot of CUDA kernels.
43
43
44
44
### For Turing GPUs (T4, RTX 2000 series ... )
45
45
46
46
``` shell
47
- cargo install --path router -F candle-cuda-turing -F http --no-default-features
47
+ cargo install --path router -F candle-cuda-turing
48
48
```
49
49
50
50
### For Ampere and Hopper
51
51
52
52
``` shell
53
- cargo install --path router -F candle-cuda -F http --no-default-features
53
+ cargo install --path router -F candle-cuda
54
54
```
55
55
56
56
## Step 4: Launch Text Embeddings Inference
You can’t perform that action at this time.
0 commit comments