Add Qwen2MoE (#29377)
* add support for qwen2 MoE models * update docs * add support for qwen2 MoE models * update docs * update model name & test * update readme * update class names & readme & model_doc of Qwen2MoE. * update architecture name * fix qwen2_moe tests * use Qwen2Tokenizer instead of Qwen2MoeTokenizer * update modeling_qwen2_moe.py * fix model architecture * fix qwen2_moe tests * use Qwen2Tokenizer instead of Qwen2MoeTokenizer * update modeling_qwen2_moe.py * fix model architecture * fix style * fix test when there are sparse and non sparse layers * fixup * Update README.md Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com> * fixup * fixup * add archive back * add support for qwen2 MoE models * update docs * update model name & test * update readme * update class names & readme & model_doc of Qwen2MoE. * update architecture name * fix qwen2_moe tests * use Qwen2Tokenizer instead of Qwen2MoeTokenizer * update modeling_qwen2_moe.py * fix model architecture * fixup * fix qwen2_moe tests * use Qwen2Tokenizer instead of Qwen2MoeTokenizer * fix style * fix test when there are sparse and non sparse layers * fixup * add archive back * fix integration test * fixup --------- Co-authored-by: bozheng-hit <dsoul0621@gmail.com> Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>
This commit is contained in:
@@ -201,6 +201,7 @@ docs/source/en/model_doc/prophetnet.md
|
||||
docs/source/en/model_doc/pvt.md
|
||||
docs/source/en/model_doc/qdqbert.md
|
||||
docs/source/en/model_doc/qwen2.md
|
||||
docs/source/en/model_doc/qwen2_moe.md
|
||||
docs/source/en/model_doc/rag.md
|
||||
docs/source/en/model_doc/realm.md
|
||||
docs/source/en/model_doc/reformer.md
|
||||
@@ -759,6 +760,8 @@ src/transformers/models/qwen2/configuration_qwen2.py
|
||||
src/transformers/models/qwen2/modeling_qwen2.py
|
||||
src/transformers/models/qwen2/tokenization_qwen2.py
|
||||
src/transformers/models/qwen2/tokenization_qwen2_fast.py
|
||||
src/transformers/models/qwen2_moe/configuration_qwen2_moe.py
|
||||
src/transformers/models/qwen2_moe/modeling_qwen2_moe.py
|
||||
src/transformers/models/rag/configuration_rag.py
|
||||
src/transformers/models/rag/modeling_rag.py
|
||||
src/transformers/models/rag/modeling_tf_rag.py
|
||||
|
||||
Reference in New Issue
Block a user