docs: fix broken link (#31370)

* docs: fix broken link

* fix link
This commit is contained in:
谭九鼎
2024-06-12 18:33:00 +08:00
committed by GitHub
parent 20fac1f249
commit 84351d57eb
7 changed files with 7 additions and 7 deletions

View File

@@ -71,7 +71,7 @@ model_id = "TheBloke/zephyr-7B-alpha-AWQ"
model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float32)
```
AWQ quantization can also be combined with [FlashAttention-2](perf_infer_gpu_one#flashattention-2) to further accelerate inference:
AWQ quantization can also be combined with [FlashAttention-2](../perf_infer_gpu_one#flashattention-2) to further accelerate inference:
```py
from transformers import AutoModelForCausalLM, AutoTokenizer