Logo
Explore Help
Register Sign In
SUMIN/HuggingFace_transformer
1
0
Fork 0
You've already forked HuggingFace_transformer
Code Issues Pull Requests Actions 7 Packages Projects Releases Wiki Activity
Files
65442718c478aed0183155cd69decb8fc7e47f5f
HuggingFace_transformer/tests/quantization
History
Vladislav Bronzov cb5ca3265f Add GGUF for starcoder2 (#34094)
* add starcoder2 arch support for gguf

* fix q6 test
2024-10-14 10:22:49 +02:00
..
aqlm_integration
Cache: use batch_size instead of max_batch_size (#32657)
2024-08-16 11:48:45 +01:00
autoawq
Enables CPU AWQ model with IPEX version. (#33460)
2024-10-04 16:25:10 +02:00
bitnet_integration
FEAT : Adding BitNet quantization method to HFQuantizer (#33410)
2024-10-09 17:51:41 +02:00
bnb
Enable BNB multi-backend support (#31098)
2024-09-24 03:40:56 -06:00
compressed_tensor
HFQuantizer implementation for compressed-tensors library (#31704)
2024-09-25 14:31:38 +02:00
eetq_integration
[FEAT]: EETQ quantizer support (#30262)
2024-04-22 20:38:58 +01:00
fbgemm_fp8
Fix FbgemmFp8Linear not preserving tensor shape (#33239)
2024-09-11 13:26:44 +02:00
ggml
Add GGUF for starcoder2 (#34094)
2024-10-14 10:22:49 +02:00
gptq
🚨 Remove dataset with restrictive license (#31452)
2024-06-17 17:56:51 +01:00
hqq
Hqq serialization (#33141)
2024-09-30 14:47:18 +02:00
quanto_integration
[Quantization] Switch to optimum-quanto (#31732)
2024-10-02 15:14:34 +02:00
torchao_integration
Add TorchAOHfQuantizer (#32306)
2024-08-14 16:14:24 +02:00
Powered by Gitea Version: 1.25.5 Page: 107ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API