Add training version check for AQLM quantizer. (#29142)

* training version check * warn old aqlm * aqlm 1.0.2 real * docs
2024-02-21 19:09:36 +03:00
parent ae49b218c3
commit fc37f38915
2 changed files with 16 additions and 2 deletions
--- a/docs/source/en/quantization.md
+++ b/docs/source/en/quantization.md
@@ -39,10 +39,14 @@ Inference support for AQLM is realised in the `aqlm` library. Make sure to insta
 pip install aqlm[gpu,cpu]
 ```

-The library provides efficient kernels for both GPU and CPU inference.
+The library provides efficient kernels for both GPU and CPU inference and training.

 The instructions on how to quantize models yourself, as well as all the relevant code can be found in the corresponding GitHub [repository](https://github.com/Vahe1994/AQLM).

+### PEFT
+
+Starting with version `aqlm 1.0.2`, AQLM supports Parameter-Efficient Fine-Tuning in a form of [LoRA](https://huggingface.co/docs/peft/package_reference/lora) integrated into the [PEFT](https://huggingface.co/blog/peft) library.
+
 ### AQLM configurations

 AQLM quantization setpus vary mainly on the number of codebooks used as well as codebook sizes in bits. The most popular setups, as well as inference kernels they support are: