[WIP] Emu3: add model (#33770)

* model can convert to HF and be loaded back

* nit

* works in single batch generation but hallucinates

* use the image tokens

* add image generation

* now it works

* add tests

* update

* add modulare but it doesn't work for porting docstring :(

* skip some tests

* add slow tests

* modular removed the import?

* guess this works

* update

* update

* fix copies

* fix test

* fix copies

* update

* docs

* fix tests

* last fix tests?

* pls

* repo consistency

* more style

* style

* remove file

* address comments

* tiny bits

* update after the new modular

* fix tests

* add one more cond in check attributes

* decompose down/up/mid blocks

* allow static cache generation in VLMs

* nit

* fix copies

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* Update docs/source/en/model_doc/emu3.md

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

* fix VAE upsampling

* Update src/transformers/models/emu3/modular_emu3.py

Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

* address comments

* state overwritten stuff explicitly

* fix copies

* add the flag for flex attn

---------

Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Co-authored-by: Arthur <48595927+ArthurZucker@users.noreply.github.com>

This commit is contained in:

Raushan Turganbay

2025-01-10 12:23:00 +01:00

committed by

GitHub

parent ccc0381d36

commit 52e1f87c7d

28 changed files with 5722 additions and 5 deletions

									
										2

tests/test_modeling_common.py
									
												View File
												
				@@ -3894,7 +3894,7 @@ class ModelTesterMixin:

				                for name, submodule in model_eager.named_modules():

				                    class_name = submodule.__class__.__name__

				                    if "SdpaAttention" in class_name or "SdpaSelfAttention" in class_name:

				                        raise ValueError("The eager model should not have SDPA attention layers")

				                        raise ValueError(f"The eager model should not have SDPA attention layers but got {class_name}")

				    @require_torch_sdpa

				    def test_sdpa_can_dispatch_composite_models(self):

[WIP] Emu3: add model (#33770)

2 tests/test_modeling_common.py Unescape Escape View File

2

tests/test_modeling_common.py

View File