Files
HuggingFace_transformer/docs/source/en/model_doc
Ao Tang 6a03942db7 Add Nemotron HF Support (#31699)
* Add nemotron support

* fix inference

* add unit test

* add layernorm1p as a class to avoid meta device mismatch

* test fixed

* Add copied_from statements

* remove pretraining_tp args

* remove nemotronlayernorm

* force LN computation done in FP32

* remove nemotrontokenizer and use llamatokenizer

* license update

* add option for kv_channels for minitron8b

* remove assert

* o_proj fixed

* o_proj reshape

* add gated_proj option

* typo

* remove todos

* fix broken test after merging latest main

* remove nezha/nat after meging main

* chnage default config to 15b model

* add nemo conversion script

* rename conversion script

* remove gate_proj option

* pr comment resolved

* fix unit test

* rename kv_channels to head_dim

* resolve PR issue

* add nemotron md

* fix broken tests

* refactor rope for nemotron

* test fix

* remove linearscaling

* whitespace and import

* fix some copied-from

* code style fix

* reformatted

* add position_embedding to nemotronattention

* rope refactor to only use config, copied-from fix

* format

* Run make fix-copies

* nemotron md with autodoc

* doc  fix

* fix order

* pass check_config_docstrings.py

* fix config_attributes

* remove all llama BC related code

* Use PreTrainedTokenizerFast

* ruff check examples

* conversion script update

* add nemotron to toctree
2024-08-06 15:42:05 +02:00
..
2024-04-26 16:23:44 +01:00
2024-06-04 18:29:45 +02:00
2024-08-02 09:54:16 +05:00
2023-11-06 19:45:03 +00:00
2024-07-18 10:30:37 +05:30
2023-11-10 13:49:10 +00:00
2024-03-15 14:29:11 +01:00
2024-05-28 18:07:07 +01:00
2023-11-23 17:44:08 +00:00
2024-05-28 18:07:07 +01:00
2024-03-12 10:16:21 +00:00
2024-06-19 09:40:57 +02:00
2024-05-21 19:38:02 +02:00
2024-05-13 15:59:46 +01:00
2024-04-18 11:04:02 +02:00
2024-05-14 16:32:01 +02:00
2024-05-28 18:07:07 +01:00
2023-10-30 21:42:19 +01:00
2023-12-20 14:25:07 +05:30
2024-06-27 10:32:51 -07:00
2024-05-28 18:07:07 +01:00
2024-05-28 18:07:07 +01:00
2024-08-06 15:42:05 +02:00
2024-05-28 18:07:07 +01:00
2024-04-17 17:59:07 +02:00
2024-05-14 08:36:11 +02:00
2024-02-19 15:22:29 +01:00
2024-02-19 15:22:29 +01:00
2024-03-13 19:05:20 +00:00
2024-05-28 18:07:07 +01:00
2024-07-24 11:54:41 +01:00
2024-05-28 18:07:07 +01:00
2023-11-06 19:45:03 +00:00
2023-07-13 11:46:54 -04:00
2024-07-10 16:34:53 +01:00
2024-05-31 12:34:29 +02:00
2024-04-19 18:31:43 +01:00
2023-11-06 19:45:03 +00:00
2024-05-28 18:07:07 +01:00
2023-11-23 17:02:16 +00:00
2024-06-11 15:47:38 +01:00
2024-05-28 18:07:07 +01:00
2024-07-08 11:43:33 +02:00