HuggingFace_transformer

Author	SHA1	Message	Date
Sylvain Gugger	66fd3a8d62	Patch: v4.30.2 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.30.2	2023-06-13 14:24:02 -04:00
NielsRogge	8f9f1efaf8	Fix push to hub (#24187 ) Add fix	2023-06-13 14:23:39 -04:00
Matt	497d66740b	Fix how we detect the TF package (#24255 ) * Fix how we detect the TF package * Add a comment as a talisman warding against future harm * Actually put the comment in the right place	2023-06-13 14:22:33 -04:00
Sylvain Gugger	65a1ec05ca	Patch: v4.30.1 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details v4.30.1	2023-06-09 10:48:44 -04:00
Younes Belkada	fd59fc1a7f	[`bnb`] Fix bnb config json serialization (#24137 ) * fix bnb config json serialization * forward contrib credits from discussions --------- Co-authored-by: Andrechang <Andrechang@users.noreply.github.com>	2023-06-09 08:50:23 -04:00
Sourab Mangrulkar	a272e4135c	fix bugs with trainer (#24134 ) * fix the deepspeed test failures * apex fix * FSDP save ckpt fix * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-09 08:31:09 -04:00
Matt	50ed79312d	Correctly build models and import call_context for older TF versions (#24138 )	2023-06-09 08:30:54 -04:00
Younes Belkada	fe861e578f	[`GPT2`] Add correct keys on `_keys_to_ignore_on_load_unexpected` on all child classes of `GPT2PreTrainedModel` (#24113 ) Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details * add correct keys on `_keys_to_ignore_on_load_unexpected` * oops v4.30.0	2023-06-08 10:22:12 -04:00
Sylvain Gugger	b3e27a8057	Update the pin on Accelerate (#24110 )	2023-06-08 10:22:05 -04:00
Younes Belkada	53e1f5cf66	[`Trainer`] Correct behavior of `_load_best_model` for PEFT models (#24103 ) * v1 * some refactor - add ST format as well * fix * add `ADAPTER_WEIGHTS_NAME` & `ADAPTER_SAFE_WEIGHTS_NAME`	2023-06-08 09:40:01 -04:00
Sourab Mangrulkar	17db177714	reset accelerate env variables after each test (#24107 )	2023-06-08 09:36:42 -04:00
Sylvain Gugger	905892f090	Release: v4.30.0	2023-06-07 16:48:28 -04:00
Sylvain Gugger	c3572e6bfb	Add AzureOpenAiAgent (#24058 ) * Add AzureOpenAiAgent * quality * Update src/transformers/tools/agents.py Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> --------- Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2023-06-07 16:34:53 -04:00
Zachary Mueller	5eb3d3c702	Up pinned accelerate version (#24089 ) * Min accelerate * Also min version * Min accelerate * Also min version * To different minor version * Empty	2023-06-07 16:21:51 -04:00
Sourab Mangrulkar	d1c039e398	fix accelerator prepare during eval only mode (#24014 ) * fix mixed precision prep during eval only mode * update to address comments * update to reflect the changes in accelerate	2023-06-08 01:03:13 +05:30
Sylvain Gugger	2c887cf8e0	Do not prepare lr scheduler as it as the right number of steps (#24088 ) * Do not prepare lr scheduler as it as the right number of steps * Trigger CI * Trigger CI * Trigger CI * Add fake comment * Remove fake comment * Trigger CI please!	2023-06-07 15:31:32 -04:00
Sourab Mangrulkar	12298cb65c	fix executable batch size issue (#24067 ) * fix executable batch size issue * fix * undo	2023-06-07 22:08:04 +05:30
Mishig	ef010071ee	Update delete_doc_comment_trigger.yml (#24084 ) fix base workflow name	2023-06-07 17:55:48 +02:00
Sylvain Gugger	89b00eef94	Fix expected value in tests of the test fetcher (#24077 ) * Fix expected value in tests of the test fetcher * Fix trigger for repo util tests	2023-06-07 11:38:56 -04:00
Mishig	5c9394b54c	[doc build] Use secrets (#24079 )	2023-06-07 17:33:39 +02:00
Matt	1fc832b454	Make the TF dummies even smaller (#24071 ) * Let's see if we can use the smallest possible dummies * Make GPT-2's dummies a little longer * Just use (1,2) as the default shape * Update other dummies in sync * Correct imports for Keras 2.13 * Shrink the Wav2Vec2 dummies	2023-06-07 16:23:05 +01:00
Yih-Dar	092c14c37d	Be nice to TF (#24076 ) * fix * fix --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-07 16:18:13 +02:00
Younes Belkada	4795219228	[`bnb`] Fix bnb skip modules (#24043 ) * fix skip modules test * oops * address comments	2023-06-07 15:27:46 +02:00
Michael Benayoun	a1160185ff	Fix `is_optimum_neuron_available` (#23961 ) Fix is_optimum_neuron_available	2023-06-07 09:13:01 -04:00
Younes Belkada	6b548129b1	[`Hub`] Add `safe_serialization` in push_to_hub (#24074 ) add `safe_serialization` in push_to_hub	2023-06-07 09:07:33 -04:00
Younes Belkada	6daf7c311b	Support PEFT models when saving the model using trainer (#24073 ) * support PEFT models when saving the model using trainer * fixup	2023-06-07 14:30:55 +02:00
YangLiu	1e4a7737ed	Add support for non-rust implemented tokenization for `__getitem__` method. (#24039 ) * Add support for non-rust implemented tokenization for `__getitem__` method. * Update for error message on adding new sub-branch for `__item__` method. --------- Co-authored-by: liuyang17 <liuyang17@zhihu.com>	2023-06-07 12:29:19 +01:00
Patrick von Platen	52972e70c7	[Wav2Vec2] Fix torch srcipt (#24062 ) * [Wav2Vec2] Fix torch srcipt * fix more	2023-06-07 07:27:07 -04:00
Joao Gante	612b2a1a6d	Generate: increase left-padding test atol (#23448 ) increase atol	2023-06-07 11:56:57 +01:00
Sylvain Gugger	f1660d7e23	Remote code improvements (#23959 ) * Fix model load when it has both code on the Hub and locally * Add input check with timeout * Add tests * Apply suggestions from code review Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> * Some non-saved stuff * Add feature extractors * Add image processor * Add model * Add processor and tokenizer * Reduce timeout --------- Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2023-06-06 14:31:14 -04:00
Sylvain Gugger	60825f2c6e	Fix device placement for model-parallelism in generate for encoder/de… (#24025 ) * Fix device placement for model-parallelism in generate for encoder/decoders * Remove debug statements	2023-06-06 14:30:59 -04:00
Yih-Dar	02d255db26	bring back `filtered_test_list_cross_tests.txt` (#24055 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-06 19:35:24 +02:00
Edward Z. Yang	bc9ecef942	Use new parametrization based weight norm if available (#24030 ) * Use new parametrization based weight norm if available See https://github.com/pytorch/pytorch/pull/103001 Signed-off-by: Edward Z. Yang <ezyang@meta.com> * handle copies Signed-off-by: Edward Z. Yang <ezyang@meta.com> * black Signed-off-by: Edward Z. Yang <ezyang@meta.com> --------- Signed-off-by: Edward Z. Yang <ezyang@meta.com>	2023-06-06 13:34:57 -04:00
Matt	4a55e47877	Move TF building to an actual build() method (#23760 ) * A fun new PR where I break the entire codebase again * A fun new PR where I break the entire codebase again * Handle cross-attention * Move calls to model(model.dummy_inputs) to the new build() method * Seeing what fails with the build context thing * make fix-copies * Let's see what fails with new build methods * Fix the pytorch crossload build calls * Fix the overridden build methods in vision_text_dual_encoder * Make sure all our build methods set self.built or call super().build(), which also sets it * make fix-copies * Remove finished TODO * Tentatively remove unneeded (?) line * Transpose b in deberta correctly and remove unused threading local * Get rid of build_with_dummies and all it stands for * Rollback some changes to TF-PT crossloading * Correctly call super().build()	2023-06-06 18:30:51 +01:00
Zachary Mueller	cbf6bc2350	Oops, missed one (#24054 ) Oops	2023-06-06 13:30:19 -04:00
Matt	7203ea6797	Reduce memory usage in TF building (#24046 ) * Make the default dummies (2, 2) instead of (3, 3) * Fix for Funnel * Actually fix Funnel	2023-06-06 18:29:54 +01:00
Zachary Mueller	072188d638	Act on deprecations in Accelerate no_trainer examples (#24053 ) Act on deprecation	2023-06-06 13:04:38 -04:00
Yih-Dar	ff4c0fc7d2	Tiny fix for `check_self_hosted_runner.py` (#24052 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-06 18:17:41 +02:00
amyeroberts	a717e0318c	Add TimmBackbone model (#22619 ) * Add test_backbone for convnext * Add TimmBackbone model * Add check for backbone type * Tidying up - config checks * Update convnextv2 * Tidy up * Fix indices & clearer comment * Exceptions for config checks * Correclty update config for tests * Safer imports * Safer safer imports * Fix where decorators go * Update import logic and backbone tests * More import fixes * Fixup * Only import all_models if torch available * Fix kwarg updates in from_pretrained & main rebase * Tidy up * Add tests for AutoBackbone * Tidy up * Fix import error * Fix up * Install nattan in doc_test_job * Revert back to setting self._out_xxx directly * Bug fix - out_indices mapping from out_features * Fix tests * Dont accept output_loading_info for Timm models * Set out_xxx and don't remap * Use smaller checkpoint for test * Don't remap timm indices - check out_indices based on stage names * Skip test as it's n/a * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Cleaner imports / spelling is hard --------- Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2023-06-06 17:11:30 +01:00
Sylvain Gugger	b8935980a2	Modification of one text example file should trigger said test (#24051 )	2023-06-06 12:02:56 -04:00
Tom Aarsen	02fe3af275	Prevent ZeroDivisionError on `trainer.evaluate` if model and dataset are tiny (#24049 ) Prevent ZeroDivisionError if evaluation is too quick	2023-06-06 11:31:05 -04:00
Roy Hvaara	d924390d5b	Use TruncatedNormal from Keras initializers (#24036 ) Co-authored-by: Andrey Voynov <avoin@google.com>	2023-06-06 14:51:44 +01:00
Nicolas Patry	c2e3fa0b2a	Fixing single candidate_label return. (#24023 )	2023-06-06 15:26:10 +02:00
Marc Sun	6307312dfc	Add check for tied parameters (#24029 ) * Add check for tied parameters * Fix style * fix style * Fix versioning * Change if to elif	2023-06-06 09:12:46 -04:00
Wonhyeong Seo	7da3ce04a6	🌐 [i18n-KO] Translated `bertology.mdx` to Korean (#23968 ) * docs: ko: `bertology.mdx` * feat: nmt draft * fix: manual edits * fix: resolve suggestions Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com> --------- Co-authored-by: Hyeonseo Yun <0525yhs@gmail.com>	2023-06-06 09:08:45 -04:00
Wonhyeong Seo	c938597657	🌐 [i18n-KO] Translated `language-modeling.mdx` (#23969 ) * docs: ko: `language_modeling.mdx` * feat: nmt draft * fix: manual edits * fix: add inline toc * fix: typo in toc_tree.yml * fix: resolve suggestions Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com> --------- Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-06-06 09:08:26 -04:00
Yih-Dar	7631db0fdc	Pin `deepspeed` to `0.9.2` for now (#24024 ) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2023-06-05 20:00:28 +02:00
Yih-Dar	17846646f2	Fix `MobileViTV2` checkpoint name (#24018 ) * fix * fix * Apply suggestions from code review Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>	2023-06-05 18:12:45 +02:00
Hyeonseo Yun	649ffbf575	🌐 [i18n-KO] Translated `tasks_explained.mdx` to Korean (#23844 ) * docs: ko: tasks_explained.mdx * feat: nmt and manual edit `tasks_explained.mdx` * revised: resolve suggestions task_explained.mdx * fixed: added draft of reference docs Co-Authored-By: Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-Authored-By: Nayeon Han <nayeon2.han@gmail.com> * revised: resolve suggestions(voca, spell check) task_explained.mdx Co-Authored-By: Sohyun Sim <96299403+sim-so@users.noreply.github.com> * revised: remove duplicate sentence in task_explained.mdx * fixed: remove draft of reference docs - I think it will be confusing in the translation process. - This issue is included in #23971. --------- Co-authored-by: Kihoon Son <75935546+KIHOON71@users.noreply.github.com> Co-authored-by: Nayeon Han <nayeon2.han@gmail.com> Co-authored-by: Sohyun Sim <96299403+sim-so@users.noreply.github.com>	2023-06-05 12:02:03 -04:00
Brian Yu	2872f9671b	TensorBoard callback no longer adds hparams (#23999 ) tensorboard callback no longer adds hparams	2023-06-05 11:53:45 -04:00

1 2 3 4 5 ...

13113 Commits