Files
HuggingFace_transformer/tests/quantization/aqlm_integration
Andrei Panferov e3fc90ae68 Cleaner Cache dtype and device extraction for CUDA graph generation for quantizers compatibility (#29079)
* input_layernorm as the beacon of hope

* cleaner dtype extraction

* AQLM + CUDA graph test

* is available check

* shorter text test
2024-02-27 09:32:39 +01:00
..
2024-02-14 09:25:41 +01:00