Quentin Lhoest
fbd8792195
Add DPR model (#5279)
* beginning of dpr modeling
* wip
* implement forward
* remove biencoder + better init weights
* export dpr model to embed model for nlp lib
* add new api
* remove old code
* make style
* fix dumb typo
* don't load bert weights
* docs
* docs
* style
* move the `k` parameter
* fix init_weights
* add pretrained configs
* minor
* update config names
* style
* better config
* style
* clean code based on PR comments
* change Dpr to DPR
* fix config
* switch encoder config to a dict
* style
* inheritance -> composition
* add messages in assert startements
* add dpr reader tokenizer
* one tokenizer per model
* fix base_model_prefix
* fix imports
* typo
* add convert script
* docs
* change tokenizers conf names
* style
* change tokenizers conf names
* minor
* minor
* fix wrong names
* minor
* remove unused convert functions
* rename convert script
* use return_tensors in tokenizers
* remove n_questions dim
* move generate logic to tokenizer
* style
* add docs
* docs
* quality
* docs
* add tests
* style
* add tokenization tests
* DPR full tests
* Stay true to the attention mask building
* update docs
* missing param in bert input docs
* docs
* style
Co-authored-by: Lysandre <lysandre.debut@reseau.eseo.fr>
2020-07-07 08:56:12 -04:00
..
2020-06-12 14:20:19 -04:00
2020-06-22 14:43:52 -04:00
2020-06-12 15:47:57 -04:00
2020-04-18 13:43:57 +02:00
2020-06-08 11:28:19 -04:00
2020-06-08 11:28:19 -04:00
2020-04-28 14:32:31 +02:00
2020-06-15 18:31:41 -04:00
2020-07-07 08:56:12 -04:00
2020-06-10 15:17:52 -04:00
2020-06-16 16:50:02 -04:00
2020-06-08 11:28:19 -04:00
2020-04-18 13:43:57 +02:00
2020-06-17 14:01:10 -04:00
2020-06-08 11:28:19 -04:00
2020-06-08 11:28:19 -04:00
2020-06-22 14:43:52 -04:00
2020-07-01 22:43:18 +02:00
2020-06-16 16:36:58 -04:00
2020-06-08 11:28:19 -04:00
2020-06-18 09:16:29 +02:00
2020-04-18 13:43:57 +02:00
2020-06-08 11:28:19 -04:00
2020-06-08 21:22:37 -04:00
2020-06-08 11:28:19 -04:00