Lysandre Debut
d5d7d88612
ELECTRA (#3257)
* Electra wip
* helpers
* Electra wip
* Electra v1
* ELECTRA may be saved/loaded
* Generator & Discriminator
* Embedding size instead of halving the hidden size
* ELECTRA Tokenizer
* Revert BERT helpers
* ELECTRA Conversion script
* Archive maps
* PyTorch tests
* Start fixing tests
* Tests pass
* Same configuration for both models
* Compatible with base + large
* Simplification + weight tying
* Archives
* Auto + Renaming to standard names
* ELECTRA is uncased
* Tests
* Slight API changes
* Update tests
* wip
* ElectraForTokenClassification
* temp
* Simpler arch + tests
Removed ElectraForPreTraining which will be in a script
* Conversion script
* Auto model
* Update links to S3
* Split ElectraForPreTraining and ElectraForTokenClassification
* Actually test PreTraining model
* Remove num_labels from configuration
* wip
* wip
* From discriminator and generator to electra
* Slight API changes
* Better naming
* TensorFlow ELECTRA tests
* Accurate conversion script
* Added to conversion script
* Fast ELECTRA tokenizer
* Style
* Add ELECTRA to README
* Modeling Pytorch Doc + Real style
* TF Docs
* Docs
* Correct links
* Correct model intialized
* random fixes
* style
* Addressing Patrick's and Sam's comments
* Correct links in docs
2020-04-03 14:10:54 -04:00
..
2020-02-25 13:48:24 -05:00
2020-01-06 15:11:12 +01:00
2020-03-17 18:04:21 -04:00
2020-04-03 14:10:54 -04:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-03-30 08:49:24 -04:00
2020-01-06 15:11:12 +01:00
2019-09-26 18:19:51 -04:00
2020-02-25 13:48:24 -05:00
2020-01-23 09:38:45 -05:00
2020-04-03 14:10:54 -04:00
2020-03-17 10:17:11 -04:00
2020-01-06 15:11:12 +01:00
2020-03-10 16:52:44 -04:00
2020-03-04 19:11:31 -05:00
2020-01-06 15:11:12 +01:00
2020-03-27 10:57:16 -04:00
2020-02-26 11:36:27 -05:00
2020-01-06 15:11:12 +01:00
2020-01-06 15:11:12 +01:00
2020-03-31 09:36:03 -04:00