Files
HuggingFace_transformer/transformers
Simon Layton 899883644f Fix test fails and warnings
Attention output was in bnij ordering instead of ijbn which everything
else will expect. This was an oversight on my part, and keeps the
attention inputs/outputs identical to the original code.

Also moved back from tensor slicing to index_select in rel_shift_bnij to
make the tracer happy.
2019-10-03 12:05:15 -04:00
..
2019-09-27 17:32:28 +02:00
2019-10-03 12:05:15 -04:00
2019-09-27 17:02:53 -04:00