Update TF fine-tuning docs (#18654)

* Update TF fine-tuning docs * Fix formatting * Add some section headers so the right sidebar works better * Squiggly it * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/training.mdx Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com> * Explain things in the text, not the comments * Make the two dataset creation methods into a list * Move the advice about collation out of a <Tip> * Edits for clarity * Edits for clarity * Edits for clarity * Replace `to_tf_dataset` with `prepare_tf_dataset` in the fine-tuning pages * Restructure the page a little bit * Restructure the page a little bit * Restructure the page a little bit Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
2022-09-07 13:30:07 +01:00
parent d842f2d5b9
commit 2b9513fdab
8 changed files with 130 additions and 87 deletions
--- a/docs/source/en/tasks/language_modeling.mdx
+++ b/docs/source/en/tasks/language_modeling.mdx
@@ -245,20 +245,18 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and labels in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
->>> tf_train_set = lm_dataset["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
-...     dummy_labels=True,
+>>> tf_train_set = model.prepare_tf_dataset(
+...     lm_dataset["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_test_set = lm_dataset["test"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
-...     dummy_labels=True,
+>>> tf_test_set = model.prepare_tf_dataset(
+...     lm_dataset["test"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
@@ -352,20 +350,18 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and labels in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
->>> tf_train_set = lm_dataset["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
-...     dummy_labels=True,
+>>> tf_train_set = model.prepare_tf_dataset(
+...     lm_dataset["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_test_set = lm_dataset["test"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
-...     dummy_labels=True,
+>>> tf_test_set = model.prepare_tf_dataset(
+...     lm_dataset["test"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
--- a/docs/source/en/tasks/multiple_choice.mdx
+++ b/docs/source/en/tasks/multiple_choice.mdx
@@ -224,21 +224,19 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs in `columns`, targets in `label_cols`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
 >>> data_collator = DataCollatorForMultipleChoice(tokenizer=tokenizer)
->>> tf_train_set = tokenized_swag["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids"],
-...     label_cols=["labels"],
+>>> tf_train_set = model.prepare_tf_dataset(
+...     tokenized_swag["train"],
 ...     shuffle=True,
 ...     batch_size=batch_size,
 ...     collate_fn=data_collator,
 ... )

->>> tf_validation_set = tokenized_swag["validation"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids"],
-...     label_cols=["labels"],
+>>> tf_validation_set = model.prepare_tf_dataset(
+...     tokenized_swag["validation"],
 ...     shuffle=False,
 ...     batch_size=batch_size,
 ...     collate_fn=data_collator,
@@ -273,10 +271,7 @@ Load BERT with [`TFAutoModelForMultipleChoice`]:
 Configure the model for training with [`compile`](https://keras.io/api/models/model_training_apis/#compile-method):

 ```py
->>> model.compile(
-...     optimizer=optimizer,
-...     loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True),
-... )
+>>> model.compile(optimizer=optimizer)
 ```

 Call [`fit`](https://keras.io/api/models/model_training_apis/#fit-method) to fine-tune the model:
--- a/docs/source/en/tasks/question_answering.mdx
+++ b/docs/source/en/tasks/question_answering.mdx
@@ -199,20 +199,18 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and the start and end positions of an answer in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
->>> tf_train_set = tokenized_squad["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "start_positions", "end_positions"],
-...     dummy_labels=True,
+>>> tf_train_set = model.prepare_tf_dataset(
+...     tokenized_squad["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_validation_set = tokenized_squad["validation"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "start_positions", "end_positions"],
-...     dummy_labels=True,
+>>> tf_validation_set = model.prepare_tf_dataset(
+...     tokenized_squad["validation"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
--- a/docs/source/en/tasks/sequence_classification.mdx
+++ b/docs/source/en/tasks/sequence_classification.mdx
@@ -144,18 +144,19 @@ At this point, only three steps remain:
 </Tip>
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and labels in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].
+

 ```py
->>> tf_train_set = tokenized_imdb["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "label"],
+>>> tf_train_set = model.prepare_tf_dataset(
+...     tokenized_imdb["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_validation_set = tokenized_imdb["test"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "label"],
+>>> tf_validation_set = model.prepare_tf_dataset(
+...     tokenized_imdb["test"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
--- a/docs/source/en/tasks/summarization.mdx
+++ b/docs/source/en/tasks/summarization.mdx
@@ -159,18 +159,18 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and labels in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
->>> tf_train_set = tokenized_billsum["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
+>>> tf_train_set = model.prepare_tf_dataset(
+...     tokenized_billsum["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_test_set = tokenized_billsum["test"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
+>>> tf_test_set = model.prepare_tf_dataset(
+...     tokenized_billsum["test"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
--- a/docs/source/en/tasks/token_classification.mdx
+++ b/docs/source/en/tasks/token_classification.mdx
@@ -199,18 +199,18 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and labels in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
->>> tf_train_set = tokenized_wnut["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
+>>> tf_train_set = model.prepare_tf_dataset(
+...     tokenized_wnut["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_validation_set = tokenized_wnut["validation"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
+>>> tf_validation_set = model.prepare_tf_dataset(
+...     tokenized_wnut["validation"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
--- a/docs/source/en/tasks/translation.mdx
+++ b/docs/source/en/tasks/translation.mdx
@@ -175,18 +175,18 @@ At this point, only three steps remain:
 ```
 </pt>
 <tf>
-To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~datasets.Dataset.to_tf_dataset`]. Specify inputs and labels in `columns`, whether to shuffle the dataset order, batch size, and the data collator:
+To fine-tune a model in TensorFlow, start by converting your datasets to the `tf.data.Dataset` format with [`~TFPreTrainedModel.prepare_tf_dataset`].

 ```py
->>> tf_train_set = tokenized_books["train"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
+>>> tf_train_set = model.prepare_tf_dataset(
+...     tokenized_books["train"],
 ...     shuffle=True,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
 ... )

->>> tf_test_set = tokenized_books["test"].to_tf_dataset(
-...     columns=["attention_mask", "input_ids", "labels"],
+>>> tf_test_set = model.prepare_tf_dataset(
+...     tokenized_books["test"],
 ...     shuffle=False,
 ...     batch_size=16,
 ...     collate_fn=data_collator,
@@ -216,7 +216,7 @@ Configure the model for training with [`compile`](https://keras.io/api/models/mo
 Call [`fit`](https://keras.io/api/models/model_training_apis/#fit-method) to fine-tune the model:

 ```py
->>> model.fit(x=tf_train_set, validation_data=tf_test_set, epochs=3)
+>>> model.fit(tf_train_set, validation_data=tf_test_set, epochs=3)
 ```
 </tf>
 </frameworkcontent>