Files
HuggingFace_transformer/docs/source/en/model_doc
Yu Chin Fabian Lim 9613933b02 Add the Bamba Model (#34982)
* initial commit for PR

Co-authored-by: Gabe Goodhart <gabe.l.hart@gmail.com>

* rename dynamic cache

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* add more unit tests

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* add integration test

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* add integration test

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* Add modular bamba file

* Remove trainer changes from unrelated PR

* Modify modular and cofig to get model running

* Fix some CI errors and beam search

* Fix a plethora of bugs from CI/docs/etc

* Add bamba to models with special caches

* Updat to newer mamba PR for mamba sublayer

* fix test_left_padding_compatibility

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* fix style

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* fix remaining tests

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* missed this test

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* ran make style

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* move slow tag to integration obj

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* make style

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* address comments

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* fix modular

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* left out one part of modular

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* change model

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* Make Rotary modular as well

* Update bamba.md

Added overview, update Model inference card and added config

* Update bamba.md

* Update bamba.md

* Update bamba.md

Minor fixes

* Add docs for config and model back

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

* Add warning when using fast kernels

* replaced generate example

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>

* Address comments from PR

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

* Propagate attention fixes

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

* Fix attention interfaces to the new API

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

* Fix API for decoder layer

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

* Remove extra weights

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

---------

Signed-off-by: Yu Chin Fabian Lim <flim@sg.ibm.com>
Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>
Co-authored-by: Gabe Goodhart <gabe.l.hart@gmail.com>
Co-authored-by: Antoni Viros i Martin <aviros@ibm.com>
Co-authored-by: divya-kumari32 <72085811+divya-kumari32@users.noreply.github.com>
Co-authored-by: Antoni Viros <ani300@gmail.com>
2024-12-18 20:18:17 +01:00
..
2024-09-03 14:01:00 +01:00
2024-12-06 12:17:34 +01:00
2024-12-18 20:18:17 +01:00
2024-12-17 14:44:47 +01:00
2024-04-26 16:23:44 +01:00
2024-09-20 14:27:32 +01:00
2024-06-04 18:29:45 +02:00
2023-11-06 19:45:03 +00:00
2024-07-18 10:30:37 +05:30
2023-11-10 13:49:10 +00:00
2024-12-17 09:36:31 -08:00
2024-03-15 14:29:11 +01:00
2024-12-17 11:26:43 +01:00
2024-08-19 10:21:51 +01:00
2024-12-17 14:44:47 +01:00
2024-05-28 18:07:07 +01:00
2024-10-21 09:05:05 -04:00
2024-08-19 09:28:13 +01:00
2024-10-02 13:55:19 +01:00
2023-11-23 17:44:08 +00:00
2024-05-28 18:07:07 +01:00
2024-12-17 14:23:13 +01:00
2024-08-07 10:03:05 +05:00
2024-10-18 17:41:12 +02:00
2024-06-19 09:40:57 +02:00
2024-08-27 21:27:21 +02:00
2024-09-21 01:43:50 +02:00
2024-12-06 12:17:34 +01:00
2024-05-13 15:59:46 +01:00
2024-12-09 10:01:31 +01:00
2024-05-14 16:32:01 +02:00
2024-05-28 18:07:07 +01:00
2023-10-30 21:42:19 +01:00
2023-12-20 14:25:07 +05:30
2024-08-26 17:49:44 +02:00
2024-05-28 18:07:07 +01:00
2024-10-16 11:21:49 +02:00
2024-10-30 10:11:50 +01:00
2024-10-16 11:21:49 +02:00
2024-05-28 18:07:07 +01:00
2024-08-06 15:42:05 +02:00
2024-05-28 18:07:07 +01:00
2024-09-25 18:04:42 +01:00
2024-04-17 17:59:07 +02:00
2024-09-05 15:49:28 +02:00
2024-10-10 11:49:34 +02:00
2024-05-14 08:36:11 +02:00
2024-02-19 15:22:29 +01:00
2024-02-19 15:22:29 +01:00
2024-10-04 21:39:45 +02:00
2024-03-13 19:05:20 +00:00
2024-05-28 18:07:07 +01:00
2024-08-08 15:47:24 +02:00
2024-10-07 09:54:07 +02:00
2024-05-28 18:07:07 +01:00
2023-11-06 19:45:03 +00:00
2023-07-13 11:46:54 -04:00
2024-04-19 18:31:43 +01:00
2023-11-06 19:45:03 +00:00
2024-12-11 12:40:30 +00:00
2024-05-28 18:07:07 +01:00
2023-11-23 17:02:16 +00:00
2024-06-11 15:47:38 +01:00
2024-10-15 11:27:54 +02:00
2024-05-28 18:07:07 +01:00
2024-10-04 22:28:05 +02:00