Yih-Dar
cd8a041a4f
Update expected values (after switching to A10) - part 7 ( #39218 )
...
* fix
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-07-04 12:48:10 +02:00
Yao Matrix
2100ee6545
fix UT failures on XPU w/ stock PyTorch 2.7 & 2.8 ( #39116 )
...
* fix UT failures on XPU w/ stock PyTorch 2.7 & 2.8
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* zamba2
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* xx
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* internvl
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* tp cases
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
2025-06-30 11:49:03 +02:00
Cyril Vallez
4b8ec667e9
Remove all traces of low_cpu_mem_usage ( #38792 )
...
* remove it from all py files
* remove it from the doc
* remove it from examples
* style
* remove traces of _fast_init
* Update test_peft_integration.py
* CIs
2025-06-12 16:39:33 +02:00
Yih-Dar
ccc859620a
Fix Gemma2IntegrationTest ( #38492 )
...
* fix
* fix
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* skip-ci
* update
* fix
* fix
---------
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-06-02 22:45:09 +02:00
Yao Matrix
fb82a98717
enable large_gpu and torchao cases on XPU ( #38355 )
...
* cohere2 done
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* enable torchao cases on XPU
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
* rename
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
* fix
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
* fix comments
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
Signed-off-by: Matrix YAO <matrix.yao@intel.com >
2025-05-28 10:30:16 +02:00
Yao Matrix
a5a0c7b888
switch to device agnostic device calling for test cases ( #38247 )
...
* use device agnostic APIs in test cases
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* add one more
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* xpu now supports integer device id, aligning to CUDA behaviors
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* update to use device_properties
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* update comment
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix comments
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
* fix style
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
---------
Signed-off-by: Matrix Yao <matrix.yao@intel.com >
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
Co-authored-by: ydshieh <ydshieh@users.noreply.github.com >
2025-05-26 10:18:53 +02:00
Joao Gante
40a493c7ed
[tests] remove test_sdpa_equivalence (redundant) ( #37911 )
...
* rm test_sdpa_equivalence
* make fixup
---------
Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com >
2025-05-16 18:37:27 +01:00
cyyever
1e6b546ea6
Use Python 3.9 syntax in tests ( #37343 )
...
Signed-off-by: cyy <cyyever@outlook.com >
2025-04-08 14:12:08 +02:00
Joao Gante
179d02ffb8
[generate] ✨ vectorized beam search ✨ ( #35802 )
2025-03-18 18:39:36 +00:00
Joao Gante
fc8764c9a6
[Generation, Gemma 3] When passing a custom generation_config, overwrite default values with the model's base generation_config ( #36684 )
2025-03-15 12:40:09 +00:00
Joao Gante
42ebb6c23e
[tests] Parameterized test_eager_matches_sdpa_inference ( #36650 )
2025-03-14 14:41:27 +00:00
Joao Gante
62c7ea0201
CI: avoid human error, automatically infer generative models ( #33212 )
...
* tmp commit
* move tests to the right class
* remove ALL all_generative_model_classes = ...
* skip tf roberta
* skip InstructBlipForConditionalGenerationDecoderOnlyTest
* videollava
* reduce diff
* reduce diff
* remove on vlms
* fix a few more
* manual rebase bits
* more manual rebase
* remove all manual generative model class test entries
* fix up to ernie
* a few more removals
* handle remaining cases
* recurrent gemma
* it's better here
* make fixup
* tf idefics is broken
* tf bert + generate is broken
* don't touch tf :()
* don't touch tf :(
* make fixup
* better comments for test skips
* revert tf changes
* remove empty line removal
* one more
* missing one
2025-02-13 16:27:11 +01:00
Joao Gante
be2ac0916a
[generate] shape checks in tests compatible with fixed-length caches (+ some minor fixes) ( #35993 )
...
* shape checks compatible with static cache
* add test
* tmp
* manually turn on eager attn when we want to output attn
* typo
* generalize to encoder-decoder models
* force compilation on cpu
* tmp commit
* fix static cache shape checks
* models with odd caches
* fix copies
* shorter cache search loop
* use decoder_past_key_values everywhere
* better test variable names and comments
* signature
* rename _check_outputs into _check_generate_outputs
* add comments
* HybridCache future test note
2025-02-10 17:50:54 +00:00
Yaswanth Gali
7aee036e54
Iterative generation using Input embeds and past_key_values ( #35890 )
...
* Iterative generation using input embeds
* ruff fix
* Added Testcase
* Updated comment
* ♻️ Refactored testcase
* Skip test for these models
* Continue generation using input embeds and cache
* Skip generate_continue_from_embeds test
* Refactor `prepare_input_for_generation` func
* Continue generation using input embeds and cache
* Modular changes fix
* Overwrite 'prepare_inputs_for_generation' function
2025-02-06 11:06:05 +01:00
Cyril Vallez
3f860dba55
Fix mask slicing for models with HybridCache ( #35681 )
...
* correctly slice
* check mask
* Update modular_gemma2.py
* fix
* add tests
* fix typo
* finally fix mask slicing
* Finally correctly slice in all cases!!
* add test for all attention functions
* small fix in tests
* trick around dynamo tracing issue
* last update
* more robust
* kwargs propagation
* make it explicit for checkpointing
* apply modular
2025-01-28 14:35:00 +01:00
Cyril Vallez
ab1afd56f5
Fix some tests ( #35682 )
...
* cohere tests
* glm tests
* cohere2 model name
* create decorator
* update
* fix cohere2 completions
* style
* style
* style
* add cuda in comments
2025-01-17 12:10:43 +00:00
Joao Gante
94af1c0aa2
[generate] return Cache object even if passed in a legacy format ( #35673 )
...
* generate returns a Cache object by default
* fix tests
* fix test for encoder-decoder models
2025-01-16 17:06:24 +00:00
Cyril Vallez
3a4ae6eace
Refactor/fix Cohere2 ( #35594 )
...
* refactor/fix cohere2
* add kwargs
* tests
* remove func and import it
2025-01-09 17:54:57 +01:00
alexrs-cohere
64478c7631
Add Cohere2 model ( #35224 )
2024-12-13 09:35:50 +01:00