docs: fix broken link (#31370)

* docs: fix broken link * fix link
2024-06-12 18:33:00 +08:00
parent 20fac1f249
commit 84351d57eb
7 changed files with 7 additions and 7 deletions
--- a/docs/source/en/quantization/awq.md
+++ b/docs/source/en/quantization/awq.md
@@ -71,7 +71,7 @@ model_id = "TheBloke/zephyr-7B-alpha-AWQ"
 model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float32)
 ```

-AWQ quantization can also be combined with [FlashAttention-2](perf_infer_gpu_one#flashattention-2) to further accelerate inference:
+AWQ quantization can also be combined with [FlashAttention-2](../perf_infer_gpu_one#flashattention-2) to further accelerate inference:

 ```py
 from transformers import AutoModelForCausalLM, AutoTokenizer