From 16d4acbfdb547cb922361ba07a13de12e1503fb8 Mon Sep 17 00:00:00 2001
From: Steven Liu <59462357+stevhliu@users.noreply.github.com>
Date: Fri, 28 Jan 2022 19:01:37 -0600
Subject: [PATCH] Get started docs (#15098)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit
* clean commit of changes
* apply review feedback, make edits
* fix backticks, minor formatting
* π make fixup and minor edits
* π fix # in header
* π update code sample without from_pt
* π final review
---
docs/source/index.mdx | 52 +----
docs/source/installation.mdx | 211 ++++++++++-------
docs/source/quicktour.mdx | 433 +++++++++++++----------------------
3 files changed, 286 insertions(+), 410 deletions(-)
diff --git a/docs/source/index.mdx b/docs/source/index.mdx
index 3192ce1142..562a94121a 100644
--- a/docs/source/index.mdx
+++ b/docs/source/index.mdx
@@ -12,25 +12,18 @@ specific language governing permissions and limitations under the License.
# π€ Transformers
-State-of-the-art Machine Learning for Jax, Pytorch and TensorFlow
+State-of-the-art Machine Learning for PyTorch, TensorFlow and JAX.
-π€ Transformers (formerly known as _pytorch-transformers_ and _pytorch-pretrained-bert_) provides thousands of pretrained models to perform tasks on different modalities such as text, vision, and audio.
+π€ Transformers provides APIs to easily download and train state-of-the-art pretrained models. Using pretrained models can reduce your compute costs, carbon footprint, and save you time from training a model from scratch. The models can be used across different modalities such as:
-These models can applied on:
+* π Text: text classification, information extraction, question answering, summarization, translation, and text generation in over 100 languages.
+* πΌοΈ Images: image classification, object detection, and segmentation.
+* π£οΈ Audio: speech recognition and audio classification.
+* π Multimodal: table question answering, optical character recognition, information extraction from scanned documents, video classification, and visual question answering.
-* π Text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages.
-* πΌοΈ Images, for tasks like image classification, object detection, and segmentation.
-* π£οΈ Audio, for tasks like speech recognition and audio classification.
+Our library supports seamless integration between three of the most popular deep learning libraries: [PyTorch](https://pytorch.org/), [TensorFlow](https://www.tensorflow.org/) and [JAX](https://jax.readthedocs.io/en/latest/). Train your model in three lines of code in one framework, and load it for inference with another.
-Transformer models can also perform tasks on **several modalities combined**, such as table question answering, optical character recognition, information extraction from scanned documents, video classification, and visual question answering.
-
-π€ Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our [model hub](https://huggingface.co/models). At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments.
-
-π€ Transformers is backed by the three most popular deep learning libraries β [Jax](https://jax.readthedocs.io/en/latest/), [PyTorch](https://pytorch.org/) and [TensorFlow](https://www.tensorflow.org/) β with a seamless integration between them. It's straightforward to train your models with one before loading them for inference with the other.
-
-This is the documentation of our repository [transformers](https://github.com/huggingface/transformers). You can
-also follow our [online course](https://huggingface.co/course) that teaches how to use this library, as well as the
-other libraries developed by Hugging Face and the Hub.
+Each π€ Transformers architecture is defined in a standalone Python module so they can be easily customized for research and experiments.
## If you are looking for custom support from the Hugging Face team
@@ -38,35 +31,6 @@ other libraries developed by Hugging Face and the Hub.
-## Features
-
-1. Easy-to-use state-of-the-art models:
- - High performance on natural language understanding & generation, computer vision, and audio tasks.
- - Low barrier to entry for educators and practitioners.
- - Few user-facing abstractions with just three classes to learn.
- - A unified API for using all our pretrained models.
-
-1. Lower compute costs, smaller carbon footprint:
- - Researchers can share trained models instead of always retraining.
- - Practitioners can reduce compute time and production costs.
- - Dozens of architectures with over 20,000 pretrained models, some in more than 100 languages.
-
-1. Choose the right framework for every part of a model's lifetime:
- - Train state-of-the-art models in 3 lines of code.
- - Move a single model between TF2.0/PyTorch/JAX frameworks at will.
- - Seamlessly pick the right framework for training, evaluation and production.
-
-1. Easily customize a model or an example to your needs:
- - We provide examples for each architecture to reproduce the results published by its original authors.
- - Model internals are exposed as consistently as possible.
- - Model files can be used independently of the library for quick experiments.
-
-[All the model checkpoints](https://huggingface.co/models) are seamlessly integrated from the huggingface.co [model
-hub](https://huggingface.co) where they are uploaded directly by [users](https://huggingface.co/users) and
-[organizations](https://huggingface.co/organizations).
-
-Current number of checkpoints:
-
## Contents
The documentation is organized in five parts:
diff --git a/docs/source/installation.mdx b/docs/source/installation.mdx
index 0b1a710434..cd48721663 100644
--- a/docs/source/installation.mdx
+++ b/docs/source/installation.mdx
@@ -1,5 +1,5 @@