Logo
Explore Help
Register Sign In
SUMIN/HuggingFace_transformer
1
0
Fork 0
You've already forked HuggingFace_transformer
Code Issues Pull Requests Actions 7 Packages Projects Releases Wiki Activity
Files
80bee7b11444a698894b114a923710ab8a772d30
HuggingFace_transformer/tests/quantization
History
Vladislav Bronzov c9afee5392 Add gguf support for gpt2 (#34044)
* add gpt2 gguf support

* add doc change

* small refactoring
2024-10-10 13:42:18 +02:00
..
aqlm_integration
Cache: use batch_size instead of max_batch_size (#32657)
2024-08-16 11:48:45 +01:00
autoawq
Enables CPU AWQ model with IPEX version. (#33460)
2024-10-04 16:25:10 +02:00
bitnet_integration
FEAT : Adding BitNet quantization method to HFQuantizer (#33410)
2024-10-09 17:51:41 +02:00
bnb
Enable BNB multi-backend support (#31098)
2024-09-24 03:40:56 -06:00
compressed_tensor
HFQuantizer implementation for compressed-tensors library (#31704)
2024-09-25 14:31:38 +02:00
eetq_integration
[FEAT]: EETQ quantizer support (#30262)
2024-04-22 20:38:58 +01:00
fbgemm_fp8
Fix FbgemmFp8Linear not preserving tensor shape (#33239)
2024-09-11 13:26:44 +02:00
ggml
Add gguf support for gpt2 (#34044)
2024-10-10 13:42:18 +02:00
gptq
🚨 Remove dataset with restrictive license (#31452)
2024-06-17 17:56:51 +01:00
hqq
Hqq serialization (#33141)
2024-09-30 14:47:18 +02:00
quanto_integration
[Quantization] Switch to optimum-quanto (#31732)
2024-10-02 15:14:34 +02:00
torchao_integration
Add TorchAOHfQuantizer (#32306)
2024-08-14 16:14:24 +02:00
Powered by Gitea Version: 1.25.5 Page: 528ms Template: 9ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API