Files
HuggingFace_transformer/docs/source/en/model_doc
Gustavo de Rosa c9693db2fc Phi-3 (#30423)
* chore(root): Initial commit of Phi-3 files.

* fix(root): Fixes Phi-3 missing on readme.

* fix(root): Ensures files are consistent.

* fix(phi3): Fixes unit tests.

* fix(tests): Fixes style of phi-3 test file.

* chore(tests): Adds integration tests for Phi-3.

* fix(phi3): Removes additional flash-attention usage, .e.g, swiglu and rmsnorm.

* fix(phi3): Fixes incorrect docstrings.

* fix(phi3): Fixes docstring typos.

* fix(phi3): Adds support for Su and Yarn embeddings.

* fix(phi3): Improves according first batch of reviews.

* fix(phi3): Uses up_states instead of y in Phi3MLP.

* fix(phi3): Uses gemma rotary embedding to support torch.compile.

* fix(phi3): Improves how rotary embedding classes are defined.

* fix(phi3): Fixes inv_freq not being re-computed for extended RoPE.

* fix(phi3): Adds last suggestions to modeling file.

* fix(phi3): Splits inv_freq calculation in two lines.
2024-04-24 17:32:09 +02:00
..
2024-03-11 17:26:38 +00:00
2023-11-06 19:45:03 +00:00
2023-11-10 13:49:10 +00:00
2024-04-16 11:58:55 +02:00
2024-03-15 14:29:11 +01:00
2024-04-18 15:18:52 +02:00
2023-11-23 17:44:08 +00:00
2023-11-23 17:44:08 +00:00
2024-03-12 10:16:21 +00:00
2024-02-21 14:21:28 +01:00
2024-04-15 17:03:03 +01:00
2024-04-18 11:04:02 +02:00
2023-10-30 21:42:19 +01:00
2023-12-20 14:25:07 +05:30
2024-04-24 10:11:19 +02:00
2024-04-22 10:41:03 +01:00
2024-04-17 17:59:07 +02:00
2024-02-19 15:22:29 +01:00
2024-02-19 15:22:29 +01:00
2024-04-24 17:32:09 +02:00
2024-03-13 19:05:20 +00:00
2024-02-23 10:43:31 +01:00
2023-11-06 19:45:03 +00:00
2023-07-13 11:46:54 -04:00
2024-04-19 21:03:07 +02:00
2024-04-22 10:41:03 +01:00
2024-02-19 15:22:29 +01:00
2024-04-19 18:31:43 +01:00
2023-11-06 19:45:03 +00:00
2023-11-23 17:02:16 +00:00
2023-12-15 20:16:47 +01:00
2024-02-19 15:22:29 +01:00