HuggingFace_transformer

Author	SHA1	Message	Date
Sylvain Gugger	b9a768b3ff	Enable doc in Spanish (#16518 ) * Reorganize doc for multilingual support * Fix style * Style * Toc trees * Adapt templates	2022-04-04 10:25:46 -04:00
Sylvain Gugger	3951b9f390	Add utility to find model labels (#16526 ) * Add utility to find model labels * Use it in the Trainer * Update src/transformers/utils/generic.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com> * Quality Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2022-04-04 10:06:57 -04:00
Daniel Stancl	ec4da72fe9	Fix flax import in __init__.py: modeling_xglm -> modeling_flax_xglm (#16556 )	2022-04-04 14:54:25 +02:00
Nicolas Patry	013a7dbe3d	Making the impossible to connect error actually report the right URL. (#16446 )	2022-04-04 14:26:23 +02:00
Patrick von Platen	ad0cba08ea	[FlaxSpeechEncoderDecoder] Fix dtype bug (#16581 ) * [FlaxSpeechEncoderDecoder] Fix dtype bug * more fixes	2022-04-04 13:53:54 +02:00
Yih-Dar	60d27b1f15	Add code samples for TF speech models (#16494 ) Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-01 17:54:01 +02:00
Lysandre Debut	53a4d6b115	Pin tokenizers version <0.13 (#16539 ) * Pin tokenizers version <0.13 * Style	2022-04-01 11:53:18 -04:00
NielsRogge	61ee26a892	Improve code example (#16450 ) Co-authored-by: Niels Rogge <nielsrogge@nielss-mbp.home>	2022-04-01 17:19:36 +02:00
Yih-Dar	2199382dfd	Use random_attention_mask for TF tests (#16517 ) * use random_attention_mask for TF tests * Fix for TFCLIP test (for now). Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-04-01 16:53:07 +02:00
Gunjan Chhablani	823dbf8a41	Remove MBart subclass of XLMRoberta in tokenzier docs (#16546 ) * Remove MBart subclass of XLMRoberta in tokenzier * Fix style * Copy docs from MBart50 tokenizer	2022-04-01 16:39:28 +02:00
Rishav Chandra Varma	5fe06b9bdd	Adding missing type hints for mBART model (PyTorch) (#16429 ) * added type hints for mbart tensorflow tf implementation * Adding missing type hints for mBART model Tensorflow Implementation model added with missing type hints * Missing Type hints - correction For TF model * Code fixup using make quality tests * Hint types - typo error * make fix-copies and make fixup * type hints * updated files * type hints update * making dependent modesls coherent Co-authored-by: matt <rocketknight1@gmail.com>	2022-04-01 15:21:26 +01:00
Gunjan Chhablani	9947dd077c	Add VisualBert type hints (#16544 )	2022-04-01 15:02:58 +01:00
Gunjan Chhablani	59a9c83e40	Fix Bart type hints (#16297 ) * Add type hints to PLBart PyTorch * Remove pending merge conflicts * Fix PLBart Type Hints * Add changes from review	2022-04-01 14:50:22 +01:00
Dahlbomii	afc5a1ea3a	Type hints added (#16529 )	2022-04-01 14:27:41 +01:00
Ferdinand Schlatt	483a9450a0	call on_train_end when trial is pruned (#16536 )	2022-04-01 08:50:47 -04:00
Jim Rohrer	9de70f213e	Add ONNX export for BeiT (#16498 ) * Add beit onnx conversion support * Updated docs * Added cross reference to ViT ONNX config	2022-04-01 10:52:42 +02:00
Cathy	bfeff6cc6a	Fixed a typo in legacy seq2seq_trainer.py (#16531 )	2022-04-01 09:17:31 +02:00
Anton Lozhkov	5807054bd3	[research] link to the XTREME-S paper (#16519 ) * [research] link to the XTREME-S paper * Update examples/research_projects/xtreme-s/README.md Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr> Co-authored-by: Lysandre Debut <lysandre.debut@reseau.eseo.fr>	2022-03-31 23:26:50 +04:00
Sylvain Gugger	e4b234834a	Fix syntax error in generate docstrings (#16516 )	2022-03-31 08:45:47 -04:00
Mowaninuola Osifeso	b808d8a596	added type hints to xglm pytorch (#16500 ) * added type hints to xglm pytorch * Update src/transformers/models/xglm/modeling_xglm.py * Update src/transformers/models/xglm/modeling_xglm.py Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>	2022-03-31 13:43:04 +01:00
Bhadresh Savani	05b4c32908	fixed a typo (#16508 )	2022-03-31 07:49:02 -04:00
Santiago Gómez	6a4dbba1a3	Translate accelerate.mdx from english to spanish (#16176 ) * Translate accelerate.mdx from english to spanish * Update docs/source_es/accelerate.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Apply suggestions from code review Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Apply suggestions from code review Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Fix nits and finish translation Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-03-31 07:45:18 -04:00
Liliana Badillo	c551addeb0	Translate installation.mdx to Spanish (#16229 ) * Translate installation.mdx to Spanish * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/installation.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Fix nits and finish translation Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-03-31 07:44:47 -04:00
Juanjo do Olmo	98939e6aee	Spanish translation of the file multilingual.mdx (#16329 ) * Duplication of the source eng file * Spanish translation of the file multilingual.mdx * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Update docs/source_es/multilingual.mdx Co-authored-by: Omar U. Espejel <espejelomar@gmail.com> * Fix nits and finish translation Co-authored-by: Omar U. Espejel <espejelomar@gmail.com>	2022-03-31 07:43:31 -04:00
chenbohua3	99a01423b9	make tuple annotation more specific to avoid failures during symbolic_trace (#16490 ) * make tuple annotation more specific to avoid failures during symbolic_trace * make tuple annotation more specific to avoid failures during symbolic_trace	2022-03-31 12:39:46 +01:00
Francesco Saverio Zuppichini	a8b6443e06	Refactor Modeling Outputs (#16341 ) * first proposal * replace model outputs in various models * conflicts * docstring * update poolformer * minor change in docstring * CI * removed poolformer specific outputs from doc * removed convnext specific outputs from doc * CI * weird char in segformer * conversations * reverted docstring for BaseModelOutputWithPooling * update outputs * changed docstring in BaseModelOutput * updated docstring in modeling outputs * typos :) * fixed typo after copy & paste it all around * CI * Apply suggestions from code review Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com> * segformer Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com>	2022-03-31 09:32:33 +02:00
Manuel R. Ciosici	857eb87cc4	Support reduce_bucket_size=auto for deepspeed stages <3 (#16496 )	2022-03-30 14:12:29 -07:00
Lai Wei	81ac45f85c	update smddp api to v1.4.0 (#16371 ) * update smddp api to v1.4.0 * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Update src/transformers/trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * address comments * fix style * remove unused import * fix indent * disable style check for import * fix space Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-30 16:28:35 -04:00
Stas Bekman	a73281e3e4	[examples] max samples can't be bigger than the len of dataset (#16501 ) * [examples] max samples can't be bigger than then len of dataset * do tf and flax	2022-03-30 12:33:16 -07:00
Francesco Saverio Zuppichini	c4deb7b3ae	Feature Extractor accepts `segmentation_maps` (#15964 ) * feature extractor accepts * resolved conversations * added examples in test for ADE20K * num_classes -> num_labels * Apply suggestions from code review Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * resolving conversations * resolving conversations * removed ADE * CI * minor changes in conversion script * reduce_labels in feature extractor * minor changes * correct preprocess for instace segmentation maps * minor changes * minor changes * CI * debugging * better padding * going to update labels inside the model * going to update labels inside the model * minor changes * tests * removed changes in feature_extractor_utils * conversation * conversation * example in feature extractor * more docstring in modeling * test * make style * doc Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-30 18:46:51 +02:00
Joao Gante	c2f8eaf6bc	TF: unpack inputs on Convbert, GPTJ, LED, and templates (#16491 ) * Add unpack_inputs to remaining models * remove stray use of inputs in the templates; fix tf.debugging of attn masks	2022-03-30 17:12:27 +01:00
tomerip	ae189ef991	Add support for exporting GPT-J to ONNX-TRT (#16492 ) Add support for exporting GPT-J to ONNX-TRT Co-authored-by: Tomer Stav <stavt@amazon.com>	2022-03-30 17:56:03 +02:00
dctelus	d04adc3521	Add length to PreTrainedTokenizer train_new_from_iterator (#16493 )	2022-03-30 11:41:04 -04:00
Aditya Kane	147c816685	Nit: MCSCOCO -> MS COCO (#16481 )	2022-03-30 10:06:32 -04:00
Dahlbomii	ffd19ee1de	TF GPT-J Type hints and TF decorator (#16488 ) * Type hints and TF decorator added * Type hints and TF decorator added * make style Co-authored-by: matt <rocketknight1@gmail.com>	2022-03-30 14:03:54 +01:00
Antoni Baum	277d49a590	Do not initialize `torch.distributed` process group if one is already initailized (#16487 ) * Do not initialize torch process group twice * Apply suggestions from code review	2022-03-29 19:07:31 -04:00
Yih-Dar	2b483230a1	Raise diff tolerance value for TFViTMAEModelTest (#16483 ) * Raise diff tolerance value Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 22:12:27 +02:00
Christopher Akiki	ee18d4d2a9	TF GPT2: clearer model variable naming with @unpack_inputs (#16311 ) * add unpack_inputs decorator to Main Layer * add unpack_inputs decorator to Model * add unpack_inputs decorator to LMHead Model * add unpack_inputs decorator to Double Head Model * add unpack_inputs decorator to Sequence Classification Model * run fixup recipe * make unpack_inputs the first decorator	2022-03-29 20:35:25 +01:00
Sander Land	d7c8ce57d4	Avoid accessing .dataset of a DataLoader in Trainer (#16451 ) * Avoid accessing .dataset of a dataloader * style * fix * cleaning up, reverting some misunderstandings * black * add train_dataset argument to get_train_dataloader, and fix other instances of length checks * flake8 * address comments * fix bug * cleanup * add test * Update tests/trainer/test_trainer.py Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * under torch * merge * stylistic suggestion Co-authored-by: Sander Land <sander@chatdesk.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>	2022-03-29 15:00:18 -04:00
akashe	781af7362b	added typehints for RAG pytorch models (#16416 )	2022-03-29 18:24:25 +01:00
Sayak Paul	5b40a37bc4	Add TF ViT MAE (#16255 ) * ported TFViTMAEIntermediate and TFViTMAEOutput. * added TFViTMAEModel and TFViTMAEDecoder. * feat: added a noise argument in the implementation for reproducibility. * feat: vit mae models with an additional noise argument for reproducibility. Co-authored-by: ariG23498 <aritra.born2fly@gmail.com> Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 18:24:15 +01:00
Joao Gante	7a9ef8181c	TF: properly handle kwargs in encoder_decoder architectures (#16465 ) * properly handle kwargs in encoder_decoder architectures * make fixup	2022-03-29 18:17:47 +01:00
Dan Tegzes	0540d1b6c0	Add type hints for UniSpeech (#16399 ) * Add type hints for UniSpeech * Added type hints for UniSpeechSat * Added type hints for Wave2Vec2 (PT) * Added type hints for models dependent of wave2vec	2022-03-29 18:02:46 +01:00
Wesley A. Cheng	875e07a9e3	[doc] Fix missing trainer import (#16469 )	2022-03-29 18:57:43 +02:00
Yih-Dar	6358a4c8ec	Add TF vision model code samples (#16477 ) * add code samples Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 18:57:16 +02:00
Wesley A. Cheng	3015d12bfb	fix wrong variable name (#16467 )	2022-03-29 18:55:40 +02:00
Sylvain Gugger	b62ac4d240	Fix example test and test_fetcher for examples (#16478 )	2022-03-29 12:21:19 -04:00
Yih-Dar	86cff21cf6	Fix some TF GPT-J CI testings (#16454 ) * Fix for test_mixed_precision * Fix test_saved_model_creation by using shape_list instead of shape * skit test_model_from_pretrained on GPU for now to avoid GPU OOM * skip test_gptj_sample_max_time for now Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>	2022-03-29 18:04:20 +02:00
Yih-Dar	aebca696af	Fix missing output_attentions in PT/Flax equivalence test (#16271 ) * fix - set output_attentions to True * Update tests/test_modeling_flax_common.py * update for has_attentions * overwrite check_outputs in FlaxBigBirdModelTest Co-authored-by: ydshieh <ydshieh@users.noreply.github.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-03-29 17:51:48 +02:00
Steven Liu	45abb37ac9	Remove duplicate mLuke (#16460 ) * Remove duplicate mLuke * 🖍 apply feedback	2022-03-29 10:34:30 -05:00

1 2 3 4 5 ...

9455 Commits