From 16d4acbfdb547cb922361ba07a13de12e1503fb8 Mon Sep 17 00:00:00 2001 From: Steven Liu <59462357+stevhliu@users.noreply.github.com> Date: Fri, 28 Jan 2022 19:01:37 -0600 Subject: [PATCH] Get started docs (#15098) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit * clean commit of changes * apply review feedback, make edits * fix backticks, minor formatting * πŸ– make fixup and minor edits * πŸ– fix # in header * πŸ“ update code sample without from_pt * πŸ“ final review --- docs/source/index.mdx | 52 +---- docs/source/installation.mdx | 211 ++++++++++------- docs/source/quicktour.mdx | 433 +++++++++++++---------------------- 3 files changed, 286 insertions(+), 410 deletions(-) diff --git a/docs/source/index.mdx b/docs/source/index.mdx index 3192ce1142..562a94121a 100644 --- a/docs/source/index.mdx +++ b/docs/source/index.mdx @@ -12,25 +12,18 @@ specific language governing permissions and limitations under the License. # πŸ€— Transformers -State-of-the-art Machine Learning for Jax, Pytorch and TensorFlow +State-of-the-art Machine Learning for PyTorch, TensorFlow and JAX. -πŸ€— Transformers (formerly known as _pytorch-transformers_ and _pytorch-pretrained-bert_) provides thousands of pretrained models to perform tasks on different modalities such as text, vision, and audio. +πŸ€— Transformers provides APIs to easily download and train state-of-the-art pretrained models. Using pretrained models can reduce your compute costs, carbon footprint, and save you time from training a model from scratch. The models can be used across different modalities such as: -These models can applied on: +* πŸ“ Text: text classification, information extraction, question answering, summarization, translation, and text generation in over 100 languages. +* πŸ–ΌοΈ Images: image classification, object detection, and segmentation. +* πŸ—£οΈ Audio: speech recognition and audio classification. +* πŸ™ Multimodal: table question answering, optical character recognition, information extraction from scanned documents, video classification, and visual question answering. -* πŸ“ Text, for tasks like text classification, information extraction, question answering, summarization, translation, text generation, in over 100 languages. -* πŸ–ΌοΈ Images, for tasks like image classification, object detection, and segmentation. -* πŸ—£οΈ Audio, for tasks like speech recognition and audio classification. +Our library supports seamless integration between three of the most popular deep learning libraries: [PyTorch](https://pytorch.org/), [TensorFlow](https://www.tensorflow.org/) and [JAX](https://jax.readthedocs.io/en/latest/). Train your model in three lines of code in one framework, and load it for inference with another. -Transformer models can also perform tasks on **several modalities combined**, such as table question answering, optical character recognition, information extraction from scanned documents, video classification, and visual question answering. - -πŸ€— Transformers provides APIs to quickly download and use those pretrained models on a given text, fine-tune them on your own datasets and then share them with the community on our [model hub](https://huggingface.co/models). At the same time, each python module defining an architecture is fully standalone and can be modified to enable quick research experiments. - -πŸ€— Transformers is backed by the three most popular deep learning libraries β€” [Jax](https://jax.readthedocs.io/en/latest/), [PyTorch](https://pytorch.org/) and [TensorFlow](https://www.tensorflow.org/) β€” with a seamless integration between them. It's straightforward to train your models with one before loading them for inference with the other. - -This is the documentation of our repository [transformers](https://github.com/huggingface/transformers). You can -also follow our [online course](https://huggingface.co/course) that teaches how to use this library, as well as the -other libraries developed by Hugging Face and the Hub. +Each πŸ€— Transformers architecture is defined in a standalone Python module so they can be easily customized for research and experiments. ## If you are looking for custom support from the Hugging Face team @@ -38,35 +31,6 @@ other libraries developed by Hugging Face and the Hub. HuggingFace Expert Acceleration Program
-## Features - -1. Easy-to-use state-of-the-art models: - - High performance on natural language understanding & generation, computer vision, and audio tasks. - - Low barrier to entry for educators and practitioners. - - Few user-facing abstractions with just three classes to learn. - - A unified API for using all our pretrained models. - -1. Lower compute costs, smaller carbon footprint: - - Researchers can share trained models instead of always retraining. - - Practitioners can reduce compute time and production costs. - - Dozens of architectures with over 20,000 pretrained models, some in more than 100 languages. - -1. Choose the right framework for every part of a model's lifetime: - - Train state-of-the-art models in 3 lines of code. - - Move a single model between TF2.0/PyTorch/JAX frameworks at will. - - Seamlessly pick the right framework for training, evaluation and production. - -1. Easily customize a model or an example to your needs: - - We provide examples for each architecture to reproduce the results published by its original authors. - - Model internals are exposed as consistently as possible. - - Model files can be used independently of the library for quick experiments. - -[All the model checkpoints](https://huggingface.co/models) are seamlessly integrated from the huggingface.co [model -hub](https://huggingface.co) where they are uploaded directly by [users](https://huggingface.co/users) and -[organizations](https://huggingface.co/organizations). - -Current number of checkpoints: - ## Contents The documentation is organized in five parts: diff --git a/docs/source/installation.mdx b/docs/source/installation.mdx index 0b1a710434..cd48721663 100644 --- a/docs/source/installation.mdx +++ b/docs/source/installation.mdx @@ -1,5 +1,5 @@