Files
HuggingFace_transformer/tests
Pradhy729 b25cec13c5 Feed forward chunking (#6024)
* Chunked feed forward for Bert

This is an initial implementation to test applying feed forward chunking for BERT.
Will need additional modifications based on output and benchmark results.

* Black and cleanup

* Feed forward chunking in BertLayer class.

* Isort

* add chunking for all models

* fix docs

* Fix typo

Co-authored-by: patrickvonplaten <patrick.v.platen@gmail.com>
2020-08-11 03:12:45 -04:00
..
2020-05-07 13:48:44 -04:00
2020-07-30 17:18:27 -04:00
2020-08-10 10:39:17 -04:00