Release: v4.29.2

Build with non Python files (#23405 )
* Add a test of the built release * Polish everything * Trigger CI
2023-05-16 14:24:29 -04:00 · 2023-05-16 14:23:55 -04:00 · 2023-05-11 15:32:41 -04:00 · 2023-05-11 14:42:49 -04:00 · 2023-05-11 14:41:53 -04:00 · 2023-05-11 14:41:48 -04:00
11 changed files with 112 additions and 52 deletions
--- a/MANIFEST.in
+++ b/MANIFEST.in
@@ -1 +0,0 @@
-include LICENSE
--- a/7
+++ b/7
@@ -111,3 +111,10 @@ post-release:

 post-patch:
 	python utils/release.py --post_release --patch
+
+build-release:
+	rm -rf dist
+	rm -rf build
+	python setup.py bdist_wheel
+	python setup.py sdist
+	python utils/check_build.py
--- a/docs/source/en/custom_tools.mdx
+++ b/docs/source/en/custom_tools.mdx
@@ -503,7 +503,7 @@ print("\n".join([f"- {a}" for a in agent.toolbox.keys()]))

 Note how `image_upscaler` is now part of the agents' toolbox.

-Let's now try out the new tools! We will re-use the image we generated in (Transformers Agents Quickstart)[./transformers_agents#single-execution-run].
+Let's now try out the new tools! We will re-use the image we generated in [Transformers Agents Quickstart](./transformers_agents#single-execution-run).

 ```py
 from diffusers.utils import load_image
@@ -726,7 +726,7 @@ We pass that instance to the `Tool.from_gradio` method:
 ```python
 from transformers import Tool

-tool = Tool.from_gradio(gradio_tools)
+tool = Tool.from_gradio(gradio_tool)
 ```

 Now we can manage it exactly as we would a usual custom tool. We leverage it to improve our prompt
--- a/docs/source/en/main_classes/agent.mdx
+++ b/docs/source/en/main_classes/agent.mdx
@@ -19,7 +19,7 @@ can vary as the APIs or underlying models are prone to change.

 </Tip>

-To learn more about agents and tools make sure to read the [introductory guide](../agents_and_tools). This page
+To learn more about agents and tools make sure to read the [introductory guide](../transformers_agents). This page
 contains the API docs for the underlying classes.

 ## Agents
--- a/docs/source/en/transformers_agents.mdx
+++ b/docs/source/en/transformers_agents.mdx
@@ -65,7 +65,17 @@ We provide support for openAI models as well as opensource alternatives from Big
 models perform better (but require you to have an openAI API key, so cannot be used for free); Hugging Face is
 providing free access to endpoints for BigCode and OpenAssistant models.

-To use openAI models, you instantiate an [`OpenAiAgent`]:
+To start with, please install the `agents` extras in order to install all default dependencies.
+```bash
+pip install transformers[agents]
+```
+
+To use openAI models, you instantiate an [`OpenAiAgent`] after installing the `openai` dependency:
+
+```bash
+pip install openai
+```
+

 ```py
 from transformers import OpenAiAgent
@@ -256,16 +266,16 @@ with the code generated by the agent.
 We identify a set of tools that can empower such agents. Here is an updated list of the tools we have integrated 
 in `transformers`:

- **Document question answering**: given a document (such as a PDF) in image format, answer a question on this document ([Donut](../model_doc/donut))
- **Text question answering**: given a long text and a question, answer the question in the text ([Flan-T5](../model_doc/flan-t5))
- **Unconditional image captioning**: Caption the image! ([BLIP](../model_doc/blip))
- **Image question answering**: given an image, answer a question on this image ([VILT](../model_doc/vilt))
- **Image segmentation**: given an image and a prompt, output the segmentation mask of that prompt ([CLIPSeg](../model_doc/clipseg))
- **Speech to text**: given an audio recording of a person talking, transcribe the speech into text ([Whisper](../model_doc/whisper))
- **Text to speech**: convert text to speech ([SpeechT5](../model_doc/speecht5))
- **Zero-shot text classification**: given a text and a list of labels, identify to which label the text corresponds the most ([BART](../model_doc/bart))
- **Text summarization**: summarize a long text in one or a few sentences ([BART](../model_doc/bart))
- **Translation**: translate the text into a given language ([NLLB](../model_doc/nllb))
+- **Document question answering**: given a document (such as a PDF) in image format, answer a question on this document ([Donut](./model_doc/donut))
+- **Text question answering**: given a long text and a question, answer the question in the text ([Flan-T5](./model_doc/flan-t5))
+- **Unconditional image captioning**: Caption the image! ([BLIP](./model_doc/blip))
+- **Image question answering**: given an image, answer a question on this image ([VILT](./model_doc/vilt))
+- **Image segmentation**: given an image and a prompt, output the segmentation mask of that prompt ([CLIPSeg](./model_doc/clipseg))
+- **Speech to text**: given an audio recording of a person talking, transcribe the speech into text ([Whisper](./model_doc/whisper))
+- **Text to speech**: convert text to speech ([SpeechT5](./model_doc/speecht5))
+- **Zero-shot text classification**: given a text and a list of labels, identify to which label the text corresponds the most ([BART](./model_doc/bart))
+- **Text summarization**: summarize a long text in one or a few sentences ([BART](./model_doc/bart))
+- **Translation**: translate the text into a given language ([NLLB](./model_doc/nllb))

 These tools have an integration in transformers, and can be used manually as well, for example:

@@ -283,7 +293,7 @@ the ability to quickly create and share custom tools.

 By pushing the code of a tool to a Hugging Face Space or a model repository, you're then able to leverage the tool 
 directly with the agent. We've added a few 
-**transformers-agnostic** tools to the `huggingface-tools` organization:
+**transformers-agnostic** tools to the [`huggingface-tools` organization](https://huggingface.co/huggingface-tools):

 - **Text downloader**: to download a text from a web URL
 - **Text to image**: generate an image according to a prompt, leveraging stable diffusion
@@ -294,7 +304,7 @@ The text-to-image tool we have been using since the beginning is a remote tool t
 [*huggingface-tools/text-to-image*](https://huggingface.co/spaces/huggingface-tools/text-to-image)! We will
 continue releasing such tools on this and other organizations, to further supercharge this implementation.

-The agents have by default access to tools that reside on `huggingface-tools`.
+The agents have by default access to tools that reside on [`huggingface-tools`](https://huggingface.co/huggingface-tools).
 We explain how to you can write and share your tools as well as leverage any custom tool that resides on the Hub in [following guide](custom_tools).

 ### Code generation
--- a/setup.py
+++ b/setup.py
@@ -38,14 +38,9 @@ To create the package for pypi.
 7. Build both the sources and the wheel. Do not change anything in setup.py between
   creating the wheel and the source distribution (obviously).

-   Clean up your build and dist folders (to avoid re-uploading oldies):
-   rm -rf dist
-   rm -rf build
+   Run `make build-release`. This will build the release and do some sanity checks for you. If this ends with an error
+   message, you need to fix things before going further.

-   For the wheel, run: "python setup.py bdist_wheel" in the top level directory.
-   (this will build a wheel for the python version you use to build it).
-
-   For the sources, run: "python setup.py sdist"
   You should now have a /dist directory with both .whl and .tar.gz source versions.

 8. Check that everything looks correct by uploading the package to the pypi test server:
@@ -61,6 +56,7 @@ To create the package for pypi.
   Check you can run the following commands:
   python -c "from transformers import pipeline; classifier = pipeline('text-classification'); print(classifier('What a nice release'))"
   python -c "from transformers import *"
+   python utils/check_build.py --check_lib

   If making a patch release, double check the bug you are patching is indeed resolved.

@@ -112,6 +108,7 @@ _deps = [
    "datasets!=2.5.0",
    "decord==0.6.0",
    "deepspeed>=0.8.3",
+    "diffusers",
    "dill<0.3.5",
    "evaluate>=0.2.0",
    "fairscale>0.3",
@@ -123,7 +120,7 @@ _deps = [
    "fugashi>=1.0",
    "GitPython<3.1.19",
    "hf-doc-builder>=0.3.0",
-    "huggingface-hub>=0.11.0,<1.0",
+    "huggingface-hub>=0.14.1,<1.0",
    "importlib_metadata",
    "ipadic>=1.0.0,<2.0",
    "isort>=5.5.4",
@@ -140,6 +137,7 @@ _deps = [
    "onnxconverter-common",
    "onnxruntime-tools>=1.4.2",
    "onnxruntime>=1.4.0",
+    "opencv-python",
    "optuna",
    "optax>=0.0.8,<=0.1.4",
    "packaging>=20.0",
@@ -412,6 +410,10 @@ extras["torchhub"] = deps_list(
    "tqdm",
 )

+extras["agents"] = deps_list(
+    "diffusers", "accelerate", "datasets", "torch", "sentencepiece", "opencv-python", "Pillow"
+)
+
 # when modifying the following list, make sure to update src/transformers/dependency_versions_check.py
 install_requires = [
    deps["importlib_metadata"] + ";python_version<'3.8'",  # importlib_metadata for Python versions that don't have it
@@ -428,7 +430,7 @@ install_requires = [

 setup(
    name="transformers",
-    version="4.29.0",  # expected format is one of x.y.z.dev0, or x.y.z.rc1 or x.y.z (no to dashes, yes to dots)
+    version="4.29.2",  # expected format is one of x.y.z.dev0, or x.y.z.rc1 or x.y.z (no to dashes, yes to dots)
    author="The Hugging Face team (past and future) with the help of all our contributors (https://github.com/huggingface/transformers/graphs/contributors)",
    author_email="transformers@huggingface.co",
    description="State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow",
@@ -440,7 +442,7 @@ setup(
    package_dir={"": "src"},
    packages=find_packages("src"),
    include_package_data=True,
-    package_data={"transformers": ["*.cu", "*.cpp", "*.cuh", "*.h", "*.pyx"]},
+    package_data={"": ["**/*.cu", "**/*.cpp", "**/*.cuh", "**/*.h", "**/*.pyx"]},
    zip_safe=False,
    extras_require=extras,
    entry_points={"console_scripts": ["transformers-cli=transformers.commands.transformers_cli:main"]},
--- a/src/transformers/init.py
+++ b/src/transformers/init.py
@@ -18,7 +18,7 @@
 # to defer the actual importing for when the objects are requested. This way `import transformers` provides the names
 # in the namespace without actually importing anything (and especially none of the backends).

-__version__ = "4.29.0"
+__version__ = "4.29.2"

 from typing import TYPE_CHECKING

--- a/src/transformers/dependency_versions_table.py
+++ b/src/transformers/dependency_versions_table.py
@@ -13,6 +13,7 @@ deps = {
    "datasets": "datasets!=2.5.0",
    "decord": "decord==0.6.0",
    "deepspeed": "deepspeed>=0.8.3",
+    "diffusers": "diffusers",
    "dill": "dill<0.3.5",
    "evaluate": "evaluate>=0.2.0",
    "fairscale": "fairscale>0.3",
@@ -24,7 +25,7 @@ deps = {
    "fugashi": "fugashi>=1.0",
    "GitPython": "GitPython<3.1.19",
    "hf-doc-builder": "hf-doc-builder>=0.3.0",
-    "huggingface-hub": "huggingface-hub>=0.11.0,<1.0",
+    "huggingface-hub": "huggingface-hub>=0.14.1,<1.0",
    "importlib_metadata": "importlib_metadata",
    "ipadic": "ipadic>=1.0.0,<2.0",
    "isort": "isort>=5.5.4",
@@ -41,6 +42,7 @@ deps = {
    "onnxconverter-common": "onnxconverter-common",
    "onnxruntime-tools": "onnxruntime-tools>=1.4.2",
    "onnxruntime": "onnxruntime>=1.4.0",
+    "opencv-python": "opencv-python",
    "optuna": "optuna",
    "optax": "optax>=0.0.8,<=0.1.4",
    "packaging": "packaging>=20.0",
--- a/src/transformers/modeling_utils.py
+++ b/src/transformers/modeling_utils.py
@@ -207,29 +207,21 @@ def get_parameter_dtype(parameter: Union[nn.Module, GenerationMixin, "ModuleUtil
        # if no floating dtype was found return whatever the first dtype is
        return last_dtype

-    for t in parameter.buffers():
-        last_dtype = t.dtype
-        if t.is_floating_point():
-            return t.dtype
+    else:
+        # For nn.DataParallel compatibility in PyTorch > 1.5
+        def find_tensor_attributes(module: nn.Module) -> List[Tuple[str, Tensor]]:
+            tuples = [(k, v) for k, v in module.__dict__.items() if torch.is_tensor(v)]
+            return tuples

-    if last_dtype is not None:
-        # if no floating dtype was found return whatever the first dtype is
-        return last_dtype
+        gen = parameter._named_members(get_members_fn=find_tensor_attributes)
+        last_tuple = None
+        for tuple in gen:
+            last_tuple = tuple
+            if tuple[1].is_floating_point():
+                return tuple[1].dtype

-    # For nn.DataParallel compatibility in PyTorch > 1.5
-    def find_tensor_attributes(module: nn.Module) -> List[Tuple[str, Tensor]]:
-        tuples = [(k, v) for k, v in module.__dict__.items() if torch.is_tensor(v)]
-        return tuples
-
-    gen = parameter._named_members(get_members_fn=find_tensor_attributes)
-    last_tuple = None
-    for tuple in gen:
-        last_tuple = tuple
-        if tuple[1].is_floating_point():
-            return tuple[1].dtype
-
-    # fallback to the last dtype
-    return last_tuple[1].dtype
+        # fallback to the last dtype
+        return last_tuple[1].dtype


 def get_state_dict_float_dtype(state_dict):
--- a/tests/tools/test_image_segmentation.py
+++ b/tests/tools/test_image_segmentation.py
@@ -44,10 +44,10 @@ class ImageSegmentationToolTester(unittest.TestCase, ToolTesterMixin):

    def test_exact_match_kwarg(self):
        image = Image.open(Path(get_tests_dir("fixtures/tests_samples/COCO")) / "000000039769.png").resize((512, 512))
-        result = self.tool(image=image, prompt="cat")
+        result = self.tool(image=image, label="cat")
        self.assertTrue(isinstance(result, Image.Image))

    def test_exact_match_kwarg_remote(self):
        image = Image.open(Path(get_tests_dir("fixtures/tests_samples/COCO")) / "000000039769.png").resize((512, 512))
-        result = self.remote_tool(image=image, prompt="cat")
+        result = self.remote_tool(image=image, label="cat")
        self.assertTrue(isinstance(result, Image.Image))
--- a/utils/check_build.py
+++ b/utils/check_build.py
@@ -0,0 +1,48 @@
+# coding=utf-8
+# Copyright 2023 The HuggingFace Inc. team.
+#
+# Licensed under the Apache License, Version 2.0 (the "License");
+# you may not use this file except in compliance with the License.
+# You may obtain a copy of the License at
+#
+#     http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+import argparse
+import importlib
+from pathlib import Path
+
+
+# Test all the extensions added in the setup
+FILES_TO_FIND = [
+    "kernels/rwkv/wkv_cuda.cu",
+    "kernels/rwkv/wkv_op.cpp",
+    "models/deformable_detr/custom_kernel/ms_deform_attn.h",
+    "models/deformable_detr/custom_kernel/cuda/ms_deform_im2col_cuda.cuh",
+    "models/graphormer/algos_graphormer.pyx",
+]
+
+
+def test_custom_files_are_present(transformers_path):
+    # Test all the extensions added in the setup
+    for file in FILES_TO_FIND:
+        if not (transformers_path / file).exists():
+            return False
+    return True
+
+
+if __name__ == "__main__":
+    parser = argparse.ArgumentParser()
+    parser.add_argument("--check_lib", action="store_true", help="Whether to check the build or the actual package.")
+    args = parser.parse_args()
+    if args.check_lib:
+        transformers_module = importlib.import_module("transformers")
+        transformers_path = Path(transformers_module.__file__).parent
+    else:
+        transformers_path = Path.cwd() / "build/lib/transformers"
+    if not test_custom_files_are_present(transformers_path):
+        raise ValueError("The built release does not contain the custom files. Fix this before going further!")
Author	SHA1	Message	Date
Sylvain Gugger	ba7054533f	Release: v4.29.2 Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details	2023-05-16 14:24:29 -04:00
Sylvain Gugger	38bda6a321	Build with non Python files (#23405 ) * Add a test of the built release * Polish everything * Trigger CI	2023-05-16 14:23:55 -04:00
Sylvain Gugger	118e981068	Revert "search buffers for dtype" (#23308 ) Some checks failed Release - Conda / build_and_package (push) Has been cancelled Details Revert "search buffers for dtype (#23159)" This reverts commit `ef42c2c487`.	2023-05-11 15:32:41 -04:00
Sylvain Gugger	37a508cd86	Release: v4.29.1	2023-05-11 14:42:49 -04:00
Sylvain Gugger	d2e5aedfb6	Style	2023-05-11 14:41:53 -04:00
Sylvain Gugger	df98769c7a	Fix image segmentation tool test (#23306 )	2023-05-11 14:41:48 -04:00
Freddy Boulton	3652e1665b	Fix typo in gradio-tools docs (#23305 ) Fix typo	2023-05-11 14:41:42 -04:00
Sylvain Gugger	b1189abc99	Fix broken links in the agent docs (#23297 )	2023-05-11 14:41:34 -04:00
Lysandre Debut	10eeb2adf6	Agents extras (#23301 ) * Agents extras * Add to docs	2023-05-11 14:41:27 -04:00
Mishig	6376aa2f2d	Update transformers_agents.mdx (#23289 ) Make `huggingface-tools` to [`huggingface-tools`](https://huggingface.co/huggingface-tools)	2023-05-11 14:41:15 -04:00
Mishig	48722778ea	Update custom_tools.mdx: fix link (#23292 ) Wrong parantheses	2023-05-11 14:41:06 -04:00