HuggingFace_transformer

Files

Jerry Zhang 86777b5e2f Support AOPerModuleConfig and include_embedding (#37802 )

* Support `AOPerModuleConfig` and include_embedding

Summary:
This PR adds support per module configuration for torchao
Also added per module quantization examples:

1. Quantizing different layers with different quantization configs
2. Skip quantization for certain layers

Test Plan:
python tests/quantization/torchao_integration/test_torchao.py -k test_include_embedding
python tests/quantization/torchao_integration/test_torchao.py -k test_per_module_config_skip

Reviewers:

Subscribers:

Tasks:

Tags:

* format

* format

* inlcude embedding remove input embedding from module not to convert

* more docs

* Update docs/source/en/quantization/torchao.md

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_torchao.py

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* Update src/transformers/quantizers/quantizer_torchao.py

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

---------

Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

2025-04-30 20:16:29 +02:00

aqlm_integration

Use Python 3.9 syntax in tests (#37343 )

2025-04-08 14:12:08 +02:00

autoawq

Fixing quantization tests (#37650 )

2025-04-22 13:59:57 +02:00

autoround

Fix typos in strings and comments (#37799 )

2025-04-28 11:39:11 +01:00

bitnet_integration

Add Bitnet model (#37742 )