Logo
Explore Help
Register Sign In
SUMIN/HuggingFace_transformer
1
0
Fork 0
You've already forked HuggingFace_transformer
Code Issues Pull Requests Actions 7 Packages Projects Releases Wiki Activity
Files
54be2d7ae87e873482b984cc956e165ca4dc0ba3
HuggingFace_transformer/tests/quantization
History
Mohamed Mekkouri 54be2d7ae8 Bitnet test fix to avoid using gated model (#34863)
small test fix
2024-11-22 17:18:49 +01:00
..
aqlm_integration
Cache: use batch_size instead of max_batch_size (#32657)
2024-08-16 11:48:45 +01:00
autoawq
Enables CPU AWQ model with IPEX version. (#33460)
2024-10-04 16:25:10 +02:00
bitnet_integration
Bitnet test fix to avoid using gated model (#34863)
2024-11-22 17:18:49 +01:00
bnb
Fix bnb training test failure (#34414)
2024-10-25 10:23:20 -04:00
compressed_tensor
HFQuantizer implementation for compressed-tensors library (#31704)
2024-09-25 14:31:38 +02:00
eetq_integration
[FEAT]: EETQ quantizer support (#30262)
2024-04-22 20:38:58 +01:00
fbgemm_fp8
Fix FbgemmFp8Linear not preserving tensor shape (#33239)
2024-09-11 13:26:44 +02:00
ggml
Add Nemotron GGUF Loading Support (#34725)
2024-11-21 11:37:34 +01:00
gptq
🚨 Remove dataset with restrictive license (#31452)
2024-06-17 17:56:51 +01:00
hqq
Hqq serialization (#33141)
2024-09-30 14:47:18 +02:00
quanto_integration
[Quantization] Switch to optimum-quanto (#31732)
2024-10-02 15:14:34 +02:00
torchao_integration
Fix CI by tweaking torchao tests (#34832)
2024-11-20 20:28:51 +01:00
Powered by Gitea Version: 1.25.5 Page: 15ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API