HuggingFace_transformer

Files

Matt d6cec45801 XLA train step fixes (#17973 )

* Copy inputs to train and test step before modifying them, as this breaks things

* Add XLA tests, fix our loss functions to be XLA-compatible

* make fixup

* Update loss computation test to expect vector of per-sample losses

* Patch loss for TFLED

* Patch loss for TFAlbert

* Add a tf_legacy_loss config flag that enables old loss functions

* Stop using config.get() because it's not a dict

* Skip loss computation test for RAG because its loss is very strange and I'm afraid to rewrite it

* make fixup

* Add XLA-compatible RAG loss

* Fix dtype of loss mask for TFAlbert

* Fix test for XLNet too because it overrides the default one

* make fixup

* Fix config test

* No more depending on GPU NaN behaviour

* Add test, avoid potential zero division

* Fix test item assignment

* Fix loss computation masking test

* make fixup

* Fix dtype bugs

2022-07-01 19:11:14 +01:00

__init__.py

[Test refactor 1/5] Per-folder tests reorganization (#15725 )

2022-02-23 15:46:28 -05:00

test_activations_tf.py

TF: Add sigmoid activation function (#16819 )

2022-04-19 16:13:08 +01:00

test_activations.py

TF: Add sigmoid activation function (#16819 )

2022-04-19 16:13:08 +01:00

test_add_new_model_like.py

Reorganize file utils (#16264 )