Skip to content

Outdated Text Embeddings Inference containers #149

@pocman

Description

@pocman

This is a copy of this issue on Google Issuetracker

What you would like to accomplish:

Enable the deployment of Text Embeddings Inference (TEI) containers with versions huggingface-text-embeddings-inference-cu122.1-6-1.ubuntu2204 and huggingface-text-embeddings-inference-cu122.1-7.ubuntu2204 on Vertex AI endpoints. This would allow customers to utilize embedding models that require these specific TEI versions, such as snowflake-arctic-embed-m-v2.0.

How this might work:

The Vertex AI Product Team would build and make available the requested TEI container versions in the Vertex AI Model Registry.

If applicable, reasons why alternative solutions are not sufficient:

Current TEI versions are insufficient: The currently available TEI container versions on Vertex AI endpoints (up to 1.6.0) do not support the requirements of newer embedding models like snowflake-arctic-embed-m-v2.0.

Other information (workarounds you have tried, documentation consulted, etc):

The customer attempted to deploy snowflake-arctic-embed-m-v2.0 using the existing TEI containers on Vertex AI endpoints but encountered compatibility issues due to the older TEI version.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions