Logo
Explore Help
Register Sign In
SUMIN/HuggingFace_transformer
1
0
Fork 0
You've already forked HuggingFace_transformer
Code Issues Pull Requests Actions 7 Packages Projects Releases Wiki Activity
Files
67890de3b86c81fb4775f41b4690b2abaf2a19cf
HuggingFace_transformer/tests/quantization
History
Marc Sun 67890de3b8 Torchao weights only + prequantized compability (#34355)
* weights only compability

* better tests from code review

* ping torch version

* add weights_only check
2024-11-20 17:24:45 +01:00
..
aqlm_integration
Cache: use batch_size instead of max_batch_size (#32657)
2024-08-16 11:48:45 +01:00
autoawq
Enables CPU AWQ model with IPEX version. (#33460)
2024-10-04 16:25:10 +02:00
bitnet_integration
FEAT : Adding BitNet quantization method to HFQuantizer (#33410)
2024-10-09 17:51:41 +02:00
bnb
Fix bnb training test failure (#34414)
2024-10-25 10:23:20 -04:00
compressed_tensor
HFQuantizer implementation for compressed-tensors library (#31704)
2024-09-25 14:31:38 +02:00
eetq_integration
[FEAT]: EETQ quantizer support (#30262)
2024-04-22 20:38:58 +01:00
fbgemm_fp8
Fix FbgemmFp8Linear not preserving tensor shape (#33239)
2024-09-11 13:26:44 +02:00
ggml
Fix use_parallel_residual and qkv_bias for StableLM GGUF config extraction (#34450)
2024-11-05 18:26:20 +01:00
gptq
🚨 Remove dataset with restrictive license (#31452)
2024-06-17 17:56:51 +01:00
hqq
Hqq serialization (#33141)
2024-09-30 14:47:18 +02:00
quanto_integration
[Quantization] Switch to optimum-quanto (#31732)
2024-10-02 15:14:34 +02:00
torchao_integration
Torchao weights only + prequantized compability (#34355)
2024-11-20 17:24:45 +01:00
Powered by Gitea Version: 1.25.5 Page: 15ms Template: 6ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API