Files
HuggingFace_transformer/transformers
Rémi Louf f8e98d6779 load pretrained embeddings in Bert decoder
In Rothe et al.'s "Leveraging Pre-trained Checkpoints for Sequence
Generation Tasks", Bert2Bert is initialized with pre-trained weights for
the encoder, and only pre-trained embeddings for the decoder. The
current version of the code completely randomizes the weights of the
decoder.

We write a custom function to initiliaze the weights of the decoder; we
first initialize the decoder with the weights and then randomize
everything but the embeddings.
2019-10-11 16:48:11 +02:00
..
2019-09-27 17:32:28 +02:00
2019-10-08 16:30:58 +02:00
2019-10-03 10:27:11 -04:00
2019-10-02 15:04:18 +00:00
2019-10-02 15:04:18 +00:00
2019-09-27 17:02:53 -04:00
2019-10-04 14:27:38 -04:00