Qdrant client and Qdrant Cloud now offer cloud inference with Fastembed, allowing users to utilize the same models while offloading embedding computations to the cloud.
Feature request:
- Integrate Qdrant Cloud Inference as an embedding provider in Codeweaver.
- Users should be able to select Qdrant Cloud Inference as an embedding backend, in addition to currently supported local or self-hosted options.
- Ensure configuration options allow switching easily between local inference and cloud inference.
- Provide documentation for setup and usage, including API credentials, supported models, and usage examples.
Benefits:
- Simplifies scaling: Users can leverage Qdrant's managed infrastructure to handle increased embedding workloads.
- Consistency: Enables users to use the same models irrespective of local or cloud execution.
- Reduced local resource consumption: Cloud inference offloads compute from client machines.
References:
- Qdrant client documentation on Cloud Inference
- Fastembed integration details
Acceptance criteria:
- Qdrant Cloud Inference integration is available and easily selectable.
- End-user documentation is updated and tested.
- Includes basic tests to validate provider selection and inference results.
Qdrant client and Qdrant Cloud now offer cloud inference with Fastembed, allowing users to utilize the same models while offloading embedding computations to the cloud.
Feature request:
Benefits:
References:
Acceptance criteria: