HuggingFace_transformer

Files

g-prz fe484726aa Add falcon gguf (#33437 )

* feat(gguf): add falcon q2 k

* fix(gguf): remove useless renaming

* feat(gguf): seperate falcon 7b and 40b

* feat(gguf): apply fixup

* fix(test): error rebase

* feat(gguf): add fp16 weight comparison for falcon

* feat(gguf): test weight of all layers

* test(gguf): add falcon 40b under skip decorator

* feat(gguf): quick example for extracting model size

2024-10-02 14:10:39 +02:00

aqlm_integration

Cache: use batch_size instead of max_batch_size (#32657 )

2024-08-16 11:48:45 +01:00

autoawq

Skip tests properly (#31308 )

2024-06-26 21:59:08 +01:00

bnb

Enable BNB multi-backend support (#31098 )

2024-09-24 03:40:56 -06:00

compressed_tensor

HFQuantizer implementation for compressed-tensors library (#31704 )