Logo
Explore Help
Register Sign In
SUMIN/HuggingFace_transformer
1
0
Fork 0
You've already forked HuggingFace_transformer
Code Issues Pull Requests Actions 7 Packages Projects Releases Wiki Activity
Files
b0f0c61899019d316db17a493023828aa44db06d
HuggingFace_transformer/tests/quantization
History
Vladislav Bronzov cb5ca3265f Add GGUF for starcoder2 (#34094)
* add starcoder2 arch support for gguf

* fix q6 test
2024-10-14 10:22:49 +02:00
..
aqlm_integration
Cache: use batch_size instead of max_batch_size (#32657)
2024-08-16 11:48:45 +01:00
autoawq
Enables CPU AWQ model with IPEX version. (#33460)
2024-10-04 16:25:10 +02:00
bitnet_integration
FEAT : Adding BitNet quantization method to HFQuantizer (#33410)
2024-10-09 17:51:41 +02:00
bnb
Enable BNB multi-backend support (#31098)
2024-09-24 03:40:56 -06:00
compressed_tensor
HFQuantizer implementation for compressed-tensors library (#31704)
2024-09-25 14:31:38 +02:00
eetq_integration
[FEAT]: EETQ quantizer support (#30262)
2024-04-22 20:38:58 +01:00
fbgemm_fp8
Fix FbgemmFp8Linear not preserving tensor shape (#33239)
2024-09-11 13:26:44 +02:00
ggml
Add GGUF for starcoder2 (#34094)
2024-10-14 10:22:49 +02:00
gptq
🚨 Remove dataset with restrictive license (#31452)
2024-06-17 17:56:51 +01:00
hqq
Hqq serialization (#33141)
2024-09-30 14:47:18 +02:00
quanto_integration
[Quantization] Switch to optimum-quanto (#31732)
2024-10-02 15:14:34 +02:00
torchao_integration
Add TorchAOHfQuantizer (#32306)
2024-08-14 16:14:24 +02:00
Powered by Gitea Version: 1.25.5 Page: 403ms Template: 11ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API