From a6ae2bd059563505c6e08765295ac96ae056bfbd Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?=EA=B9=80=EC=A4=80=EC=9E=AC=5FT3056?= <55151385+junejae@users.noreply.github.com> Date: Thu, 14 Sep 2023 00:27:41 +0900 Subject: [PATCH] docs: feat: add llama2 notebook resources from OSSCA community (#26076) --- docs/source/en/model_doc/llama2.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/docs/source/en/model_doc/llama2.md b/docs/source/en/model_doc/llama2.md index 5f8428b13c..11cb26f0dc 100644 --- a/docs/source/en/model_doc/llama2.md +++ b/docs/source/en/model_doc/llama2.md @@ -78,6 +78,10 @@ A list of official Hugging Face and community (indicated by 🌎) resources to h - A [notebook](https://colab.research.google.com/drive/1PEQyJO1-f6j0S_XJ8DV50NkpzasXkrzd?usp=sharing) on how to fine-tune Llama 2 in Google Colab using QLoRA and 4-bit precision. 🌎 - A [notebook](https://colab.research.google.com/drive/134o_cXcMe_lsvl15ZE_4Y75Kstepsntu?usp=sharing) on how to fine-tune the "Llama-v2-7b-guanaco" model with 4-bit QLoRA and generate Q&A datasets from PDFs. 🌎 + + +- A [notebook](https://colab.research.google.com/drive/1ggaa2oRFphdBmqIjSEbnb_HGkcIRC2ZB?usp=sharing) on how to fine-tune the Llama 2 model with QLoRa, TRL, and Korean text classification dataset. 🌎🇰🇷 + ⚗️ Optimization - [Fine-tune Llama 2 with DPO](https://huggingface.co/blog/dpo-trl), a guide to using the TRL library's DPO method to fine tune Llama 2 on a specific dataset. - [Extended Guide: Instruction-tune Llama 2](https://www.philschmid.de/instruction-tune-llama-2), a guide to training Llama 2 to generate instructions from inputs, transforming the model from instruction-following to instruction-giving.