fix infer task from model_name if model from sentence transformer #2151

eaidova · 2025-01-08T08:18:53Z

What does this PR do?

issue found during attempt to export https://huggingface.co/BAAI/bge-small-zh-v1.5 via optimum-intel without --task specification
The root cause is infer_library_from_model detects task as sentence-transformers even if --library transformers selected in cli and there is no additional condition for specification task for sentence transformers, as the result method raises error

KeyError: "The task could not be automatically inferred. Please provide the argument --task with the relevant task from image-text-to-text, audio-classification, zero-shot-image-classification, masked-im, audio-xvector, image-to-text, audio-frame-classification, depth-estimation, visual-question-answering, semantic-segmentation, image-to-image, image-classification, feature-extraction, image-segmentation, inpainting, sentence-similarity, object-detection, automatic-speech-recognition, multiple-choice, text-classification, fill-mask, text-generation, zero-shot-object-detection, text-to-image, reinforcement-learning, text2text-generation, token-classification, question-answering, mask-generation, text-to-audio. Detailed error: 'Could not find the proper task name for the model BAAI/bge-small-zh-v1.5.'

provided default task for sentence-transformers and added mechanism to explicitly set library_name (in case if user force --library_name in cli, need additional changes for enabling this behaviour in optimum-intel and other integrations that allow to select library_name) for avoiding mismatch between auto-detected and selected library (sentence transformers model can be exported with both transformers and sentence transformers library name and export configuration and exported model may be different for this case)

eaidova · 2025-01-08T08:19:14Z

@IlyasMoutawwakil @echarlaix could you please take a look?

HuggingFaceDocBuilderDev · 2025-01-08T13:06:40Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

IlyasMoutawwakil · 2025-01-08T13:22:57Z

optimum/exporters/tasks.py

@@ -1770,6 +1770,7 @@ def _infer_task_from_model_name_or_path(
        revision: Optional[str] = None,
        cache_dir: str = HUGGINGFACE_HUB_CACHE,
        token: Optional[Union[bool, str]] = None,
+        library_name: Optional[str] = None,


I see you're adding library_name to the signature but shouldn't this argument be specified in the method's calls as well ?

I just want to get confirmation that it is reliable approach before doing that.
In case of optimum itself it also need to provide it to infer_task_from_model too https://github.com/huggingface/optimum/blob/main/optimum/exporters/onnx/__main__.py#L259

yeah you can go ahead and add it where needed, it seems reliable.

ok, added.

Only not sure about tflite exporter, as I can see for getting export config, it uses library_name="transformers" explicitly, so it means that other library are not supported, right? I also added library_name="transformers" for avoid model type mismatch (e.g. in case of sentence transformers described above, if it will not be provided, model will be loaded as sentence transformer and raises error in some steps later

eaidova · 2025-01-10T14:00:44Z

failed test is not related to my changes, modernbert is available only on transformers master branch, there is no transformers package that contains this model code yet

fix infer task from model_name if model from sentence transformer

163e4a8

IlyasMoutawwakil reviewed Jan 8, 2025

View reviewed changes

use library_name for infer task

6fa38bc

IlyasMoutawwakil requested a review from echarlaix January 10, 2025 11:13

IlyasMoutawwakil merged commit adcae38 into huggingface:main Jan 10, 2025
56 of 57 checks passed

IlyasMoutawwakil removed the request for review from echarlaix January 10, 2025 15:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix infer task from model_name if model from sentence transformer #2151

fix infer task from model_name if model from sentence transformer #2151

eaidova commented Jan 8, 2025

eaidova commented Jan 8, 2025

HuggingFaceDocBuilderDev commented Jan 8, 2025

IlyasMoutawwakil Jan 8, 2025

eaidova Jan 8, 2025

IlyasMoutawwakil Jan 9, 2025

eaidova Jan 10, 2025 •

edited

Loading

eaidova commented Jan 10, 2025

fix infer task from model_name if model from sentence transformer #2151

fix infer task from model_name if model from sentence transformer #2151

Conversation

eaidova commented Jan 8, 2025

What does this PR do?

eaidova commented Jan 8, 2025

HuggingFaceDocBuilderDev commented Jan 8, 2025

IlyasMoutawwakil Jan 8, 2025

Choose a reason for hiding this comment

eaidova Jan 8, 2025

Choose a reason for hiding this comment

IlyasMoutawwakil Jan 9, 2025

Choose a reason for hiding this comment

eaidova Jan 10, 2025 • edited Loading

Choose a reason for hiding this comment

eaidova commented Jan 10, 2025

eaidova Jan 10, 2025 •

edited

Loading