This is a copy of this issue on Google Issuetracker
What you would like to accomplish:
Enable the deployment of Text Embeddings Inference (TEI) containers with versions huggingface-text-embeddings-inference-cu122.1-6-1.ubuntu2204 and huggingface-text-embeddings-inference-cu122.1-7.ubuntu2204 on Vertex AI endpoints. This would allow customers to utilize embedding models that require these specific TEI versions, such as snowflake-arctic-embed-m-v2.0.
How this might work:
The Vertex AI Product Team would build and make available the requested TEI container versions in the Vertex AI Model Registry.
If applicable, reasons why alternative solutions are not sufficient:
Current TEI versions are insufficient: The currently available TEI container versions on Vertex AI endpoints (up to 1.6.0) do not support the requirements of newer embedding models like snowflake-arctic-embed-m-v2.0.
Other information (workarounds you have tried, documentation consulted, etc):
The customer attempted to deploy snowflake-arctic-embed-m-v2.0 using the existing TEI containers on Vertex AI endpoints but encountered compatibility issues due to the older TEI version.
This is a copy of this issue on Google Issuetracker
What you would like to accomplish:
Enable the deployment of Text Embeddings Inference (TEI) containers with versions huggingface-text-embeddings-inference-cu122.1-6-1.ubuntu2204 and huggingface-text-embeddings-inference-cu122.1-7.ubuntu2204 on Vertex AI endpoints. This would allow customers to utilize embedding models that require these specific TEI versions, such as snowflake-arctic-embed-m-v2.0.
How this might work:
The Vertex AI Product Team would build and make available the requested TEI container versions in the Vertex AI Model Registry.
If applicable, reasons why alternative solutions are not sufficient:
Current TEI versions are insufficient: The currently available TEI container versions on Vertex AI endpoints (up to 1.6.0) do not support the requirements of newer embedding models like snowflake-arctic-embed-m-v2.0.
Other information (workarounds you have tried, documentation consulted, etc):
The customer attempted to deploy snowflake-arctic-embed-m-v2.0 using the existing TEI containers on Vertex AI endpoints but encountered compatibility issues due to the older TEI version.