Shang Zhang
a59e7c1ed4
Add QDQBert model and quantization examples of SQUAD task (#14066)
* clean up branch for add-qdqbert-model
* README update for QAT example; update docstrings in modeling_qdqbert.py
* Update qdqbert.rst
* Update README.md
* Update README.md
* calibration data using traning set; QAT example runs in fp32
* re-use BERTtokenizer for qdqbert
* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* Update docs/source/model_doc/qdqbert.rst
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
* remove qdqbert tokenizer
* Update qdqbert.rst
* update evaluate-hf-trt-qa.py
* update configuration_qdqbert.py
* update modeling_qdqbert.py: add copied statement; replace assert with ValueError
* update copied from statement
* add is_quantization_available; run make fix-copies
* unittest add require_quantization
* add backend dependency to qdqbert model
* update README; update evaluate script; make style
* lint
* docs qdqbert update
* circleci build_doc add pytorch-quantization for qdqbert
* update README
* update example readme with instructions to upgrade TensorRT to 8.2
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* Update src/transformers/models/qdqbert/configuration_qdqbert.py
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
* change quantization to pytorch_quantization for backend requirement
* feed_forward_chunking not supported in QDQBert
* make style
* update model docstrings and comments in testing scripts
* rename example to quantization-qdqbert; rename example scripts from qat to quant
* Update src/transformers/models/qdqbert/modeling_qdqbert.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
* rm experimental functions in quant_trainer
* qa cleanup
* make fix-copies for docs index.rst
* fix doctree; use post_init() for qdqbert
* fix early device assignment for qdqbert
* fix CI:Model templates runner
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Lysandre Debut <lysandre@huggingface.co>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
2021-11-19 13:33:39 -05:00
..
2021-11-17 11:39:12 -05:00
2021-11-19 15:15:02 +01:00
2021-08-12 05:49:02 -04:00
2021-11-10 10:18:35 +01:00
2021-11-19 13:33:39 -05:00
2021-10-25 07:48:02 -04:00
2021-10-25 07:48:02 -04:00
2021-06-16 15:14:53 -04:00
2021-01-05 06:18:48 -05:00
2021-10-25 07:48:02 -04:00
2021-10-28 12:56:46 -04:00
2020-06-17 14:01:10 -04:00
2021-10-25 07:48:02 -04:00
2021-11-09 14:12:50 -05:00
2021-10-25 07:48:02 -04:00
2020-05-27 11:36:55 -04:00
2021-04-05 10:51:16 -04:00
2020-02-25 13:48:24 -05:00
2021-06-15 06:37:37 -04:00
2021-11-19 13:33:39 -05:00
2021-10-25 07:40:44 -04:00
2021-06-14 13:34:32 -07:00
2021-10-25 07:48:02 -04:00
2021-06-15 06:37:37 -04:00
2021-04-21 11:11:20 -04:00
2020-04-06 14:32:39 -04:00
2021-11-14 17:19:15 -08:00
2021-11-14 17:19:15 -08:00
2021-09-13 13:32:32 +02:00
2020-12-23 10:15:49 -05:00
2021-11-03 17:43:19 -04:00
2021-10-21 17:00:26 -04:00
2021-10-04 11:57:54 -04:00
2021-09-10 14:47:48 +02:00
2021-07-12 18:02:51 +02:00
2021-10-26 11:22:22 +02:00
2021-10-20 15:10:57 -04:00
2021-09-30 09:26:49 -07:00
2021-10-25 07:40:44 -04:00
2021-10-25 15:04:36 +01:00
2021-04-14 08:39:23 -07:00