Helium documentation fixes (#36170)

* Helium documentation fixes * Update helium.md * Update helium.md * Update helium.md
2025-02-13 12:20:53 +01:00
parent 8f137b2427
commit c82319b493
1 changed files with 4 additions and 8 deletions
--- a/docs/source/en/model_doc/helium.md
+++ b/docs/source/en/model_doc/helium.md
@@ -107,7 +107,7 @@ Tips:
  
 ## Usage tips

-`Helium` can be found on the [Huggingface Hub](https://huggingface.co/collections/kyutai/helium-1-preview)
+`Helium` can be found on the [Huggingface Hub](https://huggingface.co/models?other=helium)

 In the following, we demonstrate how to use `helium-1-preview` for the inference. 

@@ -115,16 +115,12 @@ In the following, we demonstrate how to use `helium-1-preview` for the inference
 >>> from transformers import AutoModelForCausalLM, AutoTokenizer
 >>> device = "cuda" # the device to load the model onto

->>> model = AutoModelForCausalLM.from_pretrained("helium-1-preview", device_map="auto")
->>> tokenizer = AutoTokenizer.from_pretrained("helium-1-preview")
+>>> model = AutoModelForCausalLM.from_pretrained("kyutai/helium-1-preview-2b", device_map="auto")
+>>> tokenizer = AutoTokenizer.from_pretrained("kyutai/helium-1-preview-2b")

 >>> prompt = "Give me a short introduction to large language model."

->>> messages = [{"role": "user", "content": prompt}]
-
->>> text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
-
->>> model_inputs = tokenizer([text], return_tensors="pt").to(device)
+>>> model_inputs = tokenizer(prompt, return_tensors="pt").to(device)

 >>> generated_ids = model.generate(model_inputs.input_ids, max_new_tokens=512, do_sample=True)