Rémi Louf
f8e98d6779
load pretrained embeddings in Bert decoder
...
In Rothe et al.'s "Leveraging Pre-trained Checkpoints for Sequence
Generation Tasks", Bert2Bert is initialized with pre-trained weights for
the encoder, and only pre-trained embeddings for the decoder. The
current version of the code completely randomizes the weights of the
decoder.
We write a custom function to initiliaze the weights of the decoder; we
first initialize the decoder with the weights and then randomize
everything but the embeddings.
2019-10-11 16:48:11 +02:00
..
2019-09-27 17:32:28 +02:00
2019-10-10 18:07:11 +02:00
2019-10-08 16:30:58 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-10-03 10:27:11 -04:00
2019-09-26 12:02:54 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-10-03 10:27:11 -04:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-10-11 16:48:11 +02:00
2019-10-03 15:51:33 -04:00
2019-10-03 10:27:11 -04:00
2019-09-26 10:15:53 +02:00
2019-09-27 19:03:55 -04:00
2019-09-26 10:15:53 +02:00
2019-10-02 15:04:18 +00:00
2019-10-03 15:51:33 -04:00
2019-10-03 10:27:11 -04:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-10-02 15:04:18 +00:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-10-10 10:02:03 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-09-26 10:15:53 +02:00
2019-10-03 10:27:11 -04:00
2019-09-26 10:15:53 +02:00
2019-09-27 17:02:53 -04:00
2019-09-26 10:15:53 +02:00
2019-10-04 14:27:38 -04:00
2019-09-26 10:15:53 +02:00
2019-09-26 12:02:54 +02:00