Files
HuggingFace_transformer/docs/source
Amit Garg e3775539c8 PhiMoE (#33363)
* onboard phimoe model

* removed debug code

* added unit tests

* updated docs

* formatted

* fixed unit tests

* fixed test case

* fixed format

* refactored code

* fixed expected outputs in the integration tests

* Added a warning msg

* Addressed comments

* Addressed comments

* fixed test cases

* added paper link

* Addressed comments

* Refactored PhimoeForCausalLM forward fn

* Refactored PhimoeRotaryEmbedding class

* fixed test cases

* fixed testcase

* fixed test case

* Addressed comments

* fixed test cases

* fixed testcases

* Used cache position instead to get the seq len
2024-10-04 21:39:45 +02:00
..
2024-06-26 21:59:08 +01:00
2024-10-04 21:39:45 +02:00
2024-04-16 11:58:55 +02:00
2024-04-23 16:06:20 +01:00
2023-11-08 08:35:20 -05:00