HuggingFace_transformer

Author	SHA1	Message	Date
Patrick von Platen	2bef3433e5	[Flax] Correct all return tensors to numpy (#13307 ) * fix_torch_device_generate_test * remove @ * finish find and replace	2021-08-27 17:38:34 +02:00
Stefan Schweter	319d840b46	examples: add keep_linebreaks option to CLM examples (#13150 ) * examples: add keep_linebreaks option to text dataset loader for all CLM examples * examples: introduce new keep_linebreaks option as data argument in CLM examples	2021-08-27 11:35:45 +02:00
dependabot[bot]	0245cee469	Bump notebook from 6.1.5 to 6.4.1 in /examples/research_projects/lxmert (#13226 ) Bumps [notebook](http://jupyter.org) from 6.1.5 to 6.4.1. --- updated-dependencies: - dependency-name: notebook dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2021-08-24 09:52:39 -04:00
Allan Lin	91ff480e26	Update namespaces inside torch.utils.data to the latest. (#13167 ) * Update torch.utils.data namespaces to the latest. * Format * Update Dataloader. * Style	2021-08-19 14:29:51 +02:00
Suraj Patil	f5cd27694a	[FlaxCLIP] allow passing params to image and text feature methods (#13099 ) * allow passing params to image and text feature method * ifx for hybrid clip as well	2021-08-12 18:35:01 +05:30
Sylvain Gugger	9a498c37a2	Rely on huggingface_hub for common tools (#13100 ) * Remove hf_api module and use hugginface_hub * Style * Fix to test_fetcher * Quality	2021-08-12 14:59:02 +02:00
Gunjan Chhablani	c71f73f438	Add VisualBERT demo notebook (#12263 ) * Initialize VisualBERT demo * Update demo * Add commented URL * Update README * Update README	2021-08-11 10:10:59 -04:00
Patrick von Platen	13a9c9a354	[Flax] Refactor gpt2 & bert example docs (#13024 ) * fix_torch_device_generate_test * remove @ * improve docs for clm * speed-ups * correct t5 example as well * push final touches * Update examples/flax/language-modeling/README.md * correct docs for mlm * Update examples/flax/language-modeling/README.md Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-08-09 13:37:50 +02:00
abhishek thakur	3ff2cde5ca	tfhub.de -> tfhub.dev (#12565 )	2021-08-09 08:11:17 +02:00
Patrick von Platen	24cbf6bc5a	Update README.md	2021-08-08 17:11:19 +02:00
Sylvain Gugger	7fcee113c1	Tpu tie weights (#13030 ) * Fix tied weights on TPU * Manually tie weights in no trainer examples * Fix for test * One last missing * Gettning owned by my scripts * Address review comments * Fix test * Fix tests * Fix reformer tests	2021-08-06 20:41:39 +02:00
Patrick von Platen	2e4082364e	[Flax T5] Speed up t5 training (#13012 ) * fix_torch_device_generate_test * remove @ * update * up * fix * remove f-stings * correct readme * up Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-08-06 11:21:37 +02:00
Patrick von Platen	da9754a3a0	[Flax] Align jax flax device name (#12987 ) * [Flax] Align device name in docs * make style * fix import error	2021-08-04 16:00:09 +02:00
Chungman Lee	75b8990d90	fix typo in example/text-classification README (#12974 ) * fix typo in example/text-classification README * add space to align the table	2021-08-02 12:58:43 +02:00
Stefan Schweter	3d4b3bc3fd	examples: use correct way to get vocab size in flax lm readme (#12947 )	2021-07-30 21:57:53 +05:30
21jun	5c673efad7	fix typo in gradient_checkpointing arg (#12855 ) help for `ModelArguments.gradient_checkpointing` should be "If True, use gradient checkpointing to save memory at the expense of slower backward pass." not "Whether to freeze the feature extractor layers of the model." (which is duplicated from `freeze_feature_extractor` arg)	2021-07-30 15:06:33 +08:00
chutaklee	c164064eef	Fix distiller.py (#12910 ) * fix distiller * fix style	2021-07-29 02:11:38 +08:00
Sylvain Gugger	3ec851dc5e	Fix QA examples for roberta tokenizer (#12928 )	2021-07-28 09:47:49 -04:00
Sylvain Gugger	fd85734e0e	Add option to set max_len in run_ner (#12929 )	2021-07-28 09:38:12 -04:00
Elysium1436	f3d0866ed9	Correct validation_split_percentage argument from int (ex:5) to float (0.05) (#12897 ) * Fixed train_test_split test_size argument * `Seq2SeqTrainer` set max_length and num_beams only when non None (#12899) * set max_length and num_beams only when non None * fix instance variables * fix code style * [FLAX] Minor fixes in CLM example (#12914) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files * Fix module path for symbolic_trace example Co-authored-by: cchen-dialpad <47165889+cchen-dialpad@users.noreply.github.com> Co-authored-by: Stefan Schweter <stefan@schweter.it> Co-authored-by: Sylvain Gugger <sylvain.gugger@gmail.com>	2021-07-27 21:01:40 -04:00
Stefan Schweter	d3c3e722d6	[FLAX] Minor fixes in CLM example (#12914 ) * readme: fix retrieval of vocab size for flax clm example * examples: fix flax clm example when using training/evaluation files	2021-07-27 19:48:04 +05:30
Matt	569f61a760	Add TF multiple choice example (#12865 ) * Add new multiple-choice example, remove old one	2021-07-26 15:15:51 +01:00
Sylvain Gugger	303989de0e	Add accelerate to examples requirements (#12888 )	2021-07-26 09:57:34 -04:00
Stas Bekman	98364ea74f	[tests] fix logging_steps requirements (#12860 )	2021-07-23 08:05:48 -07:00
Lysandre	40de2d5a4f	Docs for v4.10.0dev0	2021-07-22 12:52:25 +02:00
Lysandre	72aee83ced	Release: v4.9.0 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details	2021-07-22 12:11:55 +02:00
Maxwell Forbes	fcf83011df	Fix type of max_seq_length arg in run_swag.py (#12832 )	2021-07-22 02:14:14 -04:00
Patrick von Platen	acdd78db08	Update README.md	2021-07-20 16:48:37 +02:00
Patrick von Platen	31d06729f4	Update README.md	2021-07-20 14:19:37 +02:00
Patrick von Platen	13fefdf340	Update README.md cc @patil-suraj	2021-07-20 13:51:15 +02:00
fgaim	66197adc98	Flax MLM: Allow validation split when loading dataset from local file (#12689 ) * Allow validation split when loading dataset from local file * Flax clm & t5, enable validation split for datasets loaded from local file	2021-07-20 13:38:25 +02:00
Patrick von Platen	c6b9095cb2	Update README.md	2021-07-17 19:22:26 +02:00
Patrick von Platen	b4b562d834	[Wav2Vec2] Padded vectors should not allowed to be sampled (#12764 ) * fix_torch_device_generate_test * remove @ * finish * correct script * correct script	2021-07-16 19:07:08 +02:00
Suraj Patil	8ef3f36561	fix typos (#12757 )	2021-07-16 16:44:59 +05:30
Patrick von Platen	a76dd7ee82	Update README.md	2021-07-16 00:16:30 +01:00
Patrick von Platen	2e9fb13fb1	[Wav2Vec2] Correctly pad mask indices for PreTraining (#12748 ) * fix_torch_device_generate_test * remove @ * start adding tests * correct wav2vec2 pretraining * up * up Co-authored-by: Patrick von Platen <patrick@huggingface.co>	2021-07-15 21:40:25 +01:00
Suraj Patil	44f5b260fe	flax model parallel training (#12590 ) * update scripts * add copyright * add logging * cleanup * add z loss * add readme * shard description * update readme	2021-07-14 22:55:44 +05:30
Matt	f9ac677eba	Update TF examples README (#12703 ) * Update Transformers README, rename token_classification example to token-classification to be consistent with the others * Update examples/tensorflow/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Add README for TF token classification * Update examples/tensorflow/token-classification/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update examples/tensorflow/token-classification/README.md Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2021-07-14 15:15:25 +01:00
Patrick von Platen	f4399ec570	Update README.md	2021-07-14 12:54:31 +01:00
Matt	65bf05cd18	Adding TF translation example (#12667 ) * Adding TF translation example * Fixes and style pass for TF translation example * Remove unused postprocess_text copied from run_summarization * Adding README * Review fixes * Move changes to model.config to after we've initialized the model	2021-07-13 19:08:25 +01:00
Nick Doiron	5803a2a7ac	Add ByT5 option to example run_t5_mlm_flax.py (#12634 ) * Allow ByT5 type in Flax T5 script * use T5TokenizerFast * change up tokenizer config * model_args * reorder imports * Update run_t5_mlm_flax.py	2021-07-13 13:39:57 +01:00
Omar Sanseviero	c523b241c2	Update timeline for Flax event evaluation	2021-07-12 21:24:58 +02:00
Matt	379f649434	TF summarization example (#12617 ) * Adding a TF summarization example * Style pass * Style fixes * Updates for review comments * Adding README * Style pass * Remove unused import	2021-07-12 15:58:38 +01:00
Eduardo Gonzalez Ponferrada	2dd9440d08	Point to the right file for hybrid CLIP (#12599 )	2021-07-12 12:16:22 +05:30
Bhadresh Savani	de23ecea36	added test file (#12630 )	2021-07-12 12:15:14 +05:30
Patrick von Platen	deecdd4939	[Flax] Fix cur step flax examples (#12608 ) * fix_torch_device_generate_test * remove @ * fix save problem	2021-07-09 13:51:28 +01:00
Omar Sanseviero	8fe836af5a	Add Flax sprint project evaluation section (#12592 )	2021-07-09 08:52:30 +02:00
Sylvain Gugger	6f1adc4334	Fix group_lengths for short datasets (#12558 )	2021-07-08 07:23:41 -04:00
Ibraheem Moosa	122d7dc34f	Remove logging of GPU count etc logging. (#12569 ) Successfully logging this requires Pytorch. For the purposes of this script we are not using Pytorch.	2021-07-07 23:05:47 +01:00
Suraj Patil	d7e156bd1a	fix loading clip vision model (#12566 )	2021-07-07 22:50:27 +05:30

1 2 3 4 5 ...

1752 Commits