NielsRogge
ec4e421b7d
Update expected slices for pillow > 9 ( #16117 )
...
* Update expected slices for pillow > 9
* Add expected slices depending on pillow version
* Add different slices depending on pillow version for other models
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local >
2022-03-18 09:46:45 +01:00
Kshitiz Sharma
12d1f07770
integrations: mlflow: skip start_run() if a run is already active and sanity check on enabling integration ( #16131 )
...
* integrations: mlflow: skip start_run() call if a run is already active
* integrations: typo fix
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2022-03-17 16:39:57 -04:00
Stas Bekman
47cccb5318
[Deepspeed] non-HF Trainer doc update ( #16238 )
2022-03-17 13:33:55 -07:00
Patrick von Platen
8a96b0f10a
[Generate Docs] Correct docs ( #16133 )
...
* [Generate Docs] Correct docs
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2022-03-17 20:05:28 +01:00
Suraj Patil
632ff3c39e
[FlaxSpeechEncoderDecoderModel] Skip from_encoder_decoder_pretrained ( #16236 )
...
* skip the test
* fix
* fix skip
2022-03-17 20:05:14 +01:00
Boris Dayma
b6e06c845f
fix(flax): generate with logits processor/warper ( #16231 )
2022-03-17 19:39:16 +01:00
Johannes Kolbe
1c1e377e99
TF - add unpack_inputs decorator for marian ( #16226 )
...
* add unpack_inputs decorator
* small fix for attn_mask string
Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team >
2022-03-17 18:23:40 +00:00
罗崚骁(LUO Lingxiao)
81643edda5
Support PEP 563 for HfArgumentParser ( #15795 )
...
* Support PEP 563 for HfArgumentParser
* Fix issues for Python 3.6
* Add test for string literal annotation for HfArgumentParser
* Remove wrong comment
* Fix typo
* Improve code readability
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
* Use `isinstance` to compare types to pass quality check
* Fix style
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2022-03-17 13:51:37 -04:00
Suraj Patil
93d3fd8645
remove jax.ops.index ( #16220 )
2022-03-17 17:51:43 +01:00
Ulaş "Sophylax" Sert
8481ecefbd
Fix Type Hint of Nan/Inf Logging Filter Arg ( #16227 )
2022-03-17 11:05:38 -04:00
Lysandre Debut
5a6b3ccd28
Skip equivalence test for TransfoXL ( #16224 )
...
* Skip test for TransfoXL
* Single list
2022-03-17 09:03:07 -04:00
Rahul
abd503d939
TF - Adding Unpack Decorator For DPR model ( #16212 )
...
* Adding Unpack Decorator
* Adding Unpack Decorator-moved it on top
2022-03-17 12:33:02 +00:00
Francesco Saverio Zuppichini
d9b8d1a9f5
update test ( #16219 )
2022-03-17 08:11:55 -04:00
Li-Huai (Allan) Lin
7e0d04bed1
Fix readmes ( #16217 )
2022-03-17 07:47:01 -04:00
Sylvain Gugger
e1da89ccb8
Fix reproducibility in Training for PyTorch 1.11 ( #16209 )
2022-03-17 07:42:58 -04:00
Dayyan Smith
e5101c2e27
Fix typo ( #16208 )
2022-03-17 07:21:20 -04:00
Yih-Dar
25b8f9a85b
Fix FlaxRoFormerClassificationHead activation ( #16168 )
...
* fix activation
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-17 11:45:50 +01:00
NielsRogge
03c14a515f
[Tests] Fix DiT test ( #16218 )
...
* Fix device
* Clean up
Co-authored-by: Niels Rogge <nielsrogge@Nielss-MacBook-Pro.local >
2022-03-17 10:53:57 +01:00
Lysandre Debut
73f0a5d1f6
Fixes Loss for TransfoXL when using Trainer API v2 ( #16140 )
...
* fix(transfo_xl): Fixes TransfoXL support when using Trainer.
* fix(tests): Uses losses_1 and losses_2 pattern with TransfoXL test.
* fix(transfo_xl): Adds requested changes to allow for backward compatibility.
fix(transfo_xl): Adds requested changes to allow for backward compatibility.
fix(transfo_xl): Fixes code styling.
* Backward compatibility
* Update src/transformers/models/transfo_xl/modeling_transfo_xl.py
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
Co-authored-by: Gustavo de Rosa <gth.rosa@uol.com.br >
Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com >
2022-03-17 05:49:24 -04:00
Francesco Saverio Zuppichini
76c74b37c1
VAN: update modules names ( #16201 )
...
* done
* done
2022-03-17 10:25:09 +01:00
João Gustavo A. Amorim
99e2982f3e
Add/type annotations/model vision ( #16151 )
...
* add types annotations for Beit (PyTorch)
* add types annotations for ViT (PyTorch)
* add types annotations for Deit (PyTorch)
* change Optional[bool] to bool into some places at Beit
* change Optional[bool] to bool into some places at ViT
2022-03-16 20:27:54 +00:00
Patrick von Platen
2410d0f8ed
Fix generation min length ( #16206 )
...
* up
* fix min lengths
2022-03-16 18:49:23 +01:00
Francesco Saverio Zuppichini
667b823b89
Swin support for any input size ( #15986 )
...
* padding done
* correctly return one attention per layer
* almost correct, attentions are not flatten one tuple per stage
* tests green
* doc
* conversations
* reshaping hidden_states
* view in the test
* reshape_hidden_states in Encoder and Model
* new outputs with reshaped_hidden_states
* conversations
* doc
* Update docs/source/model_doc/swin.mdx
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
* Apply suggestions from code review
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
* conversations
* fix tests
* minor changes
* resolved conversations
* attentions one per stage
* typo
* typos
* typos
* function signature
* CI
* clean up tests
Co-authored-by: NielsRogge <48327001+NielsRogge@users.noreply.github.com >
2022-03-16 18:38:25 +01:00
Joao Gante
204c54d411
TF: add beam search tests ( #16202 )
2022-03-16 15:44:33 +00:00
Suraj Patil
190994573a
Fix loading CLIPVisionConfig and CLIPTextConfig ( #16198 )
...
* override from_pretrained
* add tests
* remove docstrings
* fix typo
* Trigger CI
2022-03-16 16:24:01 +01:00
Yih-Dar
09013efdf1
Update step name ( #16189 )
...
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-16 11:19:38 -04:00
Francesco Saverio Zuppichini
36f8c42519
ResNet: update modules names ( #16196 )
...
* updated names
* fit in one line
* typo
2022-03-16 15:59:56 +01:00
John Ryan
5bdf3313ef
Adding type hints for Distilbert ( #16090 )
...
* Distillbert type - squash
* Update src/transformers/models/distilbert/modeling_distilbert.py
Undo cleanup
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
* Update src/transformers/models/distilbert/modeling_distilbert.py
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
* Remove type
Co-authored-by: Matt <Rocketknight1@users.noreply.github.com >
2022-03-16 14:54:50 +00:00
Utku Saglam
0b8b06185d
clearer model variable naming: blenderbot_small ( #16194 )
...
Co-authored-by: utku saglam <utkusaglam@utku-MacBook-Pro.local >
2022-03-16 14:03:58 +00:00
Johannes Kolbe
f06c2c2ba1
TF unpack_input decorator for convnext ( #16181 )
...
* unpack_input decorator for tf_convnext
* set unpack_input as top decorator
Co-authored-by: Johannes Kolbe <johannes.kolbe@tech.better.team >
2022-03-16 14:01:32 +00:00
Anton Lozhkov
d35e0c6247
Minor fixes to XTREME-S ( #16193 )
...
* Minor fixes
* Fix vocab union
* Update examples/research_projects/xtreme-s/README.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update README
* unused import
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2022-03-16 17:23:00 +04:00
Utku Saglam
8cc925a241
TF clearer model variable naming: blenderbot ( #16192 )
...
Co-authored-by: utku saglam <utkusaglam@utku-MacBook-Pro.local >
2022-03-16 12:37:08 +00:00
Utku Saglam
0f35cda459
TF clearer model variable naming: funnel ( #16178 )
...
Co-authored-by: utku saglam <utkusaglam@utku-MacBook-Pro.local >
2022-03-16 10:37:47 +00:00
Sanchit Gandhi
ee27b3d7df
Replace all deprecated jax.ops operations with jnp's at ( #16078 )
...
* Replace all deprecated `jax.ops` operations with jnp's `at`
* np to jnp scores
* suggested changes
2022-03-16 09:08:55 +00:00
Patrick von Platen
c2dc89be62
[Xtreme-S] fix some namings ( #16183 )
2022-03-16 01:21:31 +01:00
Anton Lozhkov
99fd3eb4a5
Add the XTREME-S fine-tuning example ( #15985 )
...
* CTC+classification draft
* CTC+classification draft
* style
* multilingual runs
* Fix race condition during processor.from_reatrained
* Merge covost experiments
* Add README
* Quality
* Switch to .all configs
* Fix typos
2022-03-16 00:21:06 +01:00
Sylvain Gugger
db4dd44ae3
Trigger doc build
2022-03-15 17:00:31 -04:00
Yih-Dar
ea05d67164
Fix some Flax models' hidden_states ( #16167 )
...
* fix the last element in `hidden_states`
* fix missing elements in outputs for FlaxWav2Vec2EncoderLayerStableLayerNormCollection
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2022-03-15 19:06:46 +01:00
Dan Tegzes
88f7c564f0
Added type hints for Reformer ( #16175 )
2022-03-15 17:59:59 +00:00
Jack McDonald
16399d6197
Add type annotations for Perceiver ( #16174 )
2022-03-15 17:56:57 +00:00
Kamal Raj
015de6f081
TF clearer model variable naming: xlnet ( #16150 )
2022-03-15 17:50:30 +00:00
Thomas Chaigneau
a23a7c0cd6
Add flaubert types ( #16118 )
...
* Add type hints for FlauBERT PyTorch Base model. Others downstream tasks are inherited from XLM RoBERTa.
* Add type hints for FlaubERT Tensorflow models.
* fix output for TFFlaubertWithLMHeadModel
2022-03-15 16:57:45 +00:00
Kamal Raj
366c18f473
TF clearer model variable naming: Deberta ( #16146 )
2022-03-15 16:53:25 +00:00
Kamal Raj
79465ac521
TF clearer model variable naming: Tapas ( #16145 )
2022-03-15 16:52:56 +00:00
Suraj Patil
a78565b7aa
[MT5Config] add relative_attention_max_distance in config ( #16170 )
2022-03-15 16:26:52 +01:00
Sylvain Gugger
4f4e5ddbcb
Framework split ( #16030 )
...
* First files
* More files
* Last files
* Style
2022-03-15 10:13:34 -04:00
mowafess
4a353cacb7
added type hints to yoso ( #16163 )
2022-03-15 14:04:32 +00:00
Joydeep Bhattacharjee
c1c17bd0b3
update transformer XL with tf decorator ( #16166 )
...
* update transformer XL with tf decorator
* code fixup
* remove unused variables
2022-03-15 14:00:18 +00:00
Minh Chien Vu
611d3a09b2
Change unpacking of TF inputs: layoutlm, mpnet, rag, and roformer ( #16112 )
...
Co-authored-by: ChienVM <chien_vm@detomo.co.jp >
2022-03-15 13:47:45 +00:00
Kamal Raj
0d7322c1b7
TF clearer model variable naming: pegasus ( #16152 )
2022-03-15 13:45:59 +00:00