HuggingFace_transformer/tests/quantization at 67890de3b86c81fb4775f41b4690b2abaf2a19cf - HuggingFace_transformer - Gitea: Git with SSUM

SUMIN/HuggingFace_transformer

Files

History

Marc Sun 67890de3b8 Torchao weights only + prequantized compability (#34355 )

* weights only compability

* better tests from code review

* ping torch version

* add weights_only check

2024-11-20 17:24:45 +01:00

..

aqlm_integration

Cache: use batch_size instead of max_batch_size (#32657 )

2024-08-16 11:48:45 +01:00

Enables CPU AWQ model with IPEX version. (#33460 )

2024-10-04 16:25:10 +02:00

bitnet_integration

FEAT : Adding BitNet quantization method to HFQuantizer (#33410 )

2024-10-09 17:51:41 +02:00

Fix bnb training test failure (#34414 )

2024-10-25 10:23:20 -04:00

compressed_tensor

HFQuantizer implementation for compressed-tensors library (#31704 )

2024-09-25 14:31:38 +02:00

eetq_integration

[FEAT]: EETQ quantizer support (#30262 )

2024-04-22 20:38:58 +01:00

Fix FbgemmFp8Linear not preserving tensor shape (#33239 )

2024-09-11 13:26:44 +02:00

Fix use_parallel_residual and qkv_bias for StableLM GGUF config extraction (#34450 )

2024-11-05 18:26:20 +01:00

🚨 Remove dataset with restrictive license (#31452 )

2024-06-17 17:56:51 +01:00

Hqq serialization (#33141 )

2024-09-30 14:47:18 +02:00

quanto_integration

[Quantization] Switch to optimum-quanto (#31732 )

2024-10-02 15:14:34 +02:00

torchao_integration

Torchao weights only + prequantized compability (#34355 )

2024-11-20 17:24:45 +01:00