HuggingFace_transformer

Author	SHA1	Message	Date
DanielHesslow	607acd4fbd	Add Gated-SiLU to T5 (#17420 ) * Add gated-silu to t5 architecture to support UL2 * Fix error message * formatting * formatting again * refactor * fix classnames in _init_weights * remove is_gated * add test * fix test * Try without the test? * Add back the test. * Improve error message. Co-authored-by: Daniel Hesslow <daniel@lighton.ai>	2022-06-03 10:56:37 +02:00
lewtun	1c220ced8e	Update URL for Hub PR docs (#17532 )	2022-06-02 21:52:30 +02:00
Arthur	013462c57b	fix OPT-Flax CI tests (#17512 )	2022-06-02 18:52:46 +02:00
Stas Bekman	2f59ad1609	[trainer/deepspeed] load_best_model (reimplement re-init) (#17151 ) * [trainer/deepspeed] load_best_model * to sync with DS PR #1947 * simplify * rework load_best_model test * cleanup * bump deepspeed>=0.6.5 Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>	2022-06-02 09:14:21 -07:00
Moreno La Quatra	046c5ea906	Implemented loss for training AudioFrameClassification (#17513 ) * Implemented loss for training AudioFrameClassification * reported changes in wav2vec2 main class and used make copies to propagate * running black for code formatting	2022-06-02 17:40:02 +02:00
Kamal Raj	085321c9a1	Update configuration_auto.py (#17527 )	2022-06-02 10:37:00 -04:00
Sylvain Gugger	048dd73bba	Check list of models in the main README and sort it (#17517 ) * Script for README * Fix copies * Complete error message	2022-06-02 08:10:08 -04:00
Sylvain Gugger	588d8f1f26	Fix when Accelerate is not installed (#17518 )	2022-06-02 07:45:41 -04:00
Sylvain Gugger	f128ccb997	Clean README in post release job as well. (#17519 )	2022-06-02 07:44:03 -04:00
Yih-Dar	216499bfcc	Fix CI tests hang forever (#17471 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-02 10:30:54 +02:00
Yih-Dar	659b27fd26	Print more library versions in CI (#17384 ) * print more lib. versions and just befor test runs * update print_env_pt.py * rename to print_env * Disable warning + better job name * print python version Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-02 10:24:16 +02:00
Yih-Dar	0932adb3e8	Split push CI into 2 workflows (#17369 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-02 10:19:26 +02:00
Yih-Dar	58fb3c9f98	Fix Tapas tests (#17510 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-01 21:01:32 +02:00
Joao Gante	ca1f1c8685	CLI: tool to convert PT into TF weights and open hub PR (#17497 )	2022-06-01 18:52:07 +01:00
Zachary Mueller	3766df4fe1	Fix flakey no-trainer test (#17515 )	2022-06-01 13:40:49 -04:00
fireindark707	028d4b7c8b	Deal with the error when task is regression (#16330 )	2022-06-01 11:15:53 -04:00
Anugunj Naman	84aaadd8c5	Adding LeViT Model by Facebook (#17466 ) * levit files * levit tests * weights script * weights script * update * style fixes * few minor corrections * Added teacher model * edit docs * fix-copies * style fixes * pr error resolved * Update README.md Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/index.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/levit.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/levit.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/levit.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update docs/source/en/model_doc/levit.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/__init__.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/__init__.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/configuration_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/configuration_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * suggested pr changes * style fixes * minor bug * update * minor doc edit * style * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update tests/models/levit/test_modeling_levit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/levit/modeling_levit.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * residual layer readable * style * Update docs/source/en/model_doc/levit.mdx Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/modeling_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/modeling_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/modeling_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update tests/models/levit/test_feature_extraction_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * change checkpoints and style * update * minor changes * Update src/transformers/models/levit/modeling_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * Update src/transformers/models/levit/modeling_levit.py Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-06-01 17:06:20 +02:00
Yih-Dar	1d2b57b8a2	Fix CTRL tests (#17508 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-01 16:27:23 +02:00
Yih-Dar	693720e567	Fix LayoutXLMProcessorTest (#17506 ) * fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-06-01 16:26:37 +02:00
Ryokan RI	4d1ce39683	Debug LukeForMaskedLM (#17499 ) * add a test for a word only input * make LukeForMaskedLM work without entity inputs * update test * add LukeForMaskedLM to MODEL_FOR_MASKED_LM_MAPPING_NAMES * restore pyproject.toml * empty line at the end of pyproject.toml	2022-06-01 10:03:06 -04:00
Sylvain Gugger	4390151ba2	Fix MP and CPU offload tests for Funnel and GPT-Neo (#17503 )	2022-06-01 09:59:40 -04:00
Sylvain Gugger	6813439fdc	Exclude Databricks from notebook env (#17496 )	2022-06-01 09:00:11 -04:00
Will Frey	3042ea4f6f	Fix `tokenizer` type annotation in `pipeline(...)` (#17500 ) I think you mean to accept either an instance of `PreTrainedTokenizer` or `PreTrainedTokenizerFast` inside of the `pipeline(...)` factory function, if the `tokenizer` argument isn't a `str`.	2022-06-01 08:43:28 -04:00
amyeroberts	bdc01711d6	Refactor classes to inherit from nn.Module instead of nn.Sequential (#17493 ) * Adapt Maskformer, VAN, ResNet and RegNet modules to inherit from nn.Module	2022-06-01 13:36:19 +01:00
nilboy	b1160c0b56	Fix wav2vec2 export onnx model with attention_mask error (#16004 ) * Fix wav2vec2 export onnx model with attention_mask error * fix repository_consistency	2022-06-01 13:30:58 +02:00
Xing Han Lu	d91da4c6df	Add warning when using older version of torch for ViltFeatureExtractor (#16756 ) * Update feature_extraction_vilt.py * apply black * Update imports * Change warning to logging * Use logger instead of logging.logging * make fixup * Move error message * Update src/transformers/models/vilt/feature_extraction_vilt.py Co-authored-by: Xing Han Lu <xhlperso@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co>	2022-06-01 07:15:38 -04:00
Kyeongpil Kang	24092b1464	Fix typo of variable names for key and query projection layer (#17155 ) self.pos_proj and self.pos_q_proj should be changed to self.pos_key_proj and self.pos_query_proj as same as PyTorch implements.	2022-06-01 11:38:44 +01:00
Jimin Park	811da2b8c2	Fixed wrong error message for missing weight file (#17216 )	2022-06-01 06:24:20 -04:00
Ruihua Fang	4f38808e9e	Add OnnxConfig for SqueezeBert iss17314 (#17315 ) * add onnx config for SqueezeBert * add test for onnx config for SqueezeBert * add automatically updated doc for onnx config for SqueezeBert * Update src/transformers/onnx/features.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> * Update src/transformers/models/squeezebert/configuration_squeezebert.py Co-authored-by: lewtun <lewis.c.tunstall@gmail.com> Co-authored-by: lewtun <lewis.c.tunstall@gmail.com>	2022-06-01 06:16:15 -04:00
Patrick von Platen	ba286fe7d5	[GPT2Tokenizer] Fix GPT2 with bos token (#17498 )	2022-05-31 20:06:48 +02:00
Arthur	7822a9b7a7	Opt in flax and tf (#17388 ) * initial commit * add init file * update globakl init * update index and dummy objects * style * update modelling auto * fix initi typo in src/transformers * fix typo in modeling tf auto, opt was in wrong mapping name * fixed a slow test : saved_model * style * fix positionnal embedding if no position id is provided * update tf test * update test flax requirements * fixed serialization * update * update tf name to allow smooth convertion * update flax tests * style * fix test typo * fix tf typo test * add xla for generate support in causal LM * fixed bug * cleaned tf tests * style * removed from PT for slow tests * fix typp * opt test as slow * trying to fix GPT2 undefined * correct documentation and add to test doc * update tf doc * fix doc * fake commit * Apply suggestions from code review Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> * update test based on review * merged main layer for functionning test * fixup + quality * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * update long comment * make fix copies Co-authored-by: Arthur <arthur@huggingface.co> Co-authored-by: Joao Gante <joaofranciscocardosogante@gmail.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-05-31 18:41:22 +02:00
Patrick von Platen	f394a2a50d	[Json configs] Make json prettier for all saved tokenizer files & ensure same json format for all processors (tok + feat_extract) (#17457 ) * [Json dump] Make json prettier * correct more tokenizeirs * more patterns * add aggressive test * the aggressive test was actually useful :-) * more tests * Apply suggestions from code review	2022-05-31 17:07:30 +02:00
Vít Novotný	6ee1474b67	Accumulate tokens into batches in `PreTrainedTokenizerBase.add_tokens()` (#17119 ) * Accumulate tokens into batches in PreTrainedTokenizerBase.add_tokens() For tokenizers with a small number of special tokens or special tokens with consecutive token IDs, this reduces the time complexity of creating the trie from quadratic to linear, see also #16936. * Extend explanation of batching added tokens	2022-05-31 16:36:45 +02:00
Patrick von Platen	52e7c92920	Add HF.co for PRs / Issues regarding specific model checkpoints (#17485 ) * Add HF.co for PRs / Issues regarding specific model checkpoints * Update .github/ISSUE_TEMPLATE/config.yml Co-authored-by: Julien Chaumond <julien@huggingface.co> Co-authored-by: Julien Chaumond <julien@huggingface.co>	2022-05-31 15:58:39 +02:00
Martina Fumanelli	dfc38463b8	Setup for Italian translation and add quicktour.mdx translation (#17472 ) * Setup for Italian translation and add first document - Add 'it' folder for files translated into Italian - Add _config.py and _toctree.yml files - Add translation of quicktour.mdx * Fix style issue of italian documentation files * Add 'it' to the languages section in the .github/workflows * Remove - installation from _toctree for Italian * Translation for index file - Add index to _toctree.yml - Add translation of index.mdx * Fix typo in docs/source/it/index.mdx * Translate code comments in docs/source/it/_config.py Co-authored-by: Martina Fumanelli <martinafumanelli@Martinas-MBP.homenet.telecomitalia.it>	2022-05-31 09:57:43 -04:00
Yih-Dar	8f8b3cbce4	Fix checkpoint name (#17484 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-31 15:40:48 +02:00
Yih-Dar	400b30936a	Docker image build in parallel (#17434 ) * docker image build in parallel Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-31 15:39:03 +02:00
Ritik Nandwal	5af38953bb	Added XLM onnx config (#17030 ) * Add onnx configuration for xlm * Add supported features for xlm * Add xlm to models exportable with onnx * Add xlm architecture to test file * Modify docs * Make code quality fixes	2022-05-31 09:26:06 -04:00
Sylvain Gugger	567d9c061d	Disk offload fix (#17428 ) * Fix offload to disk for big models * Add test * Fix test for other models	2022-05-31 09:16:18 -04:00
Joao Gante	975dd2bbbc	TF: GPT-2 generation supports left-padding (#17426 ) * TF GPT-2 now properly works with left padding * throw a warning when eos token == pad token and there is no attention mask	2022-05-31 14:06:44 +01:00
Yih-Dar	c1a138613d	Fix ViTMAEModelTester (#17470 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-05-31 15:01:54 +02:00
Patrick von Platen	b0e0ac8a67	[Generate] Fix output scores greedy search (#17442 )	2022-05-31 14:59:49 +02:00
Omar U. Espejel	2ef09ecfb8	Fix nits (#17349 )	2022-05-31 08:41:54 -04:00
Michael Benayoun	28d0048218	Fx support for multiple model architectures (#17393 ) * Support for Bart and LayoutLM, and partial support for XLNet * Support for mbart * A lot of new models supported * Support for other models * LayoutLM fix * Use strings instead of classes	2022-05-31 10:02:55 +02:00
Ivan Gonzalez	04681c1d81	typo IBERT in __repr__ quant_mode (#17398 ) fix #17397	2022-05-31 03:48:10 -04:00
Michele Conti	13fd67346a	Fix typo (remove parenthesis) (#17415 )	2022-05-31 03:21:32 -04:00
Sourab Mangrulkar	d156898f3b	Improve notrainer examples (#17449 ) * improve no-trainer examples * Trigger CI * adding comment to clarify tracker init on main process * Trigger CI * Trigger CI * Trigger CI	2022-05-28 00:06:31 +05:30
Patrick von Platen	7999ec125f	[OPT] Fix bos token id default (#17441 )	2022-05-26 18:24:12 +02:00
Sylvain Gugger	98f6e1ee87	Fix model parallelism test (#17439 )	2022-05-26 09:57:12 -04:00
Sylvain Gugger	7535d92e71	Pin protobouf that breaks TensorBoard in PyTorch (#17440 )	2022-05-26 09:56:55 -04:00

1 2 3 4 5 ...

9923 Commits