From bae4ac1729064dde8ba39f6988f81ec34e66a703 Mon Sep 17 00:00:00 2001 From: sk-portkey Date: Mon, 28 Apr 2025 18:38:52 +0530 Subject: [PATCH 1/3] fix: data service changelog --- changelog/data-service.mdx | 224 +++++++++++++++++++++++++++++++++++++ docs.json | 2 +- 2 files changed, 225 insertions(+), 1 deletion(-) create mode 100644 changelog/data-service.mdx diff --git a/changelog/data-service.mdx b/changelog/data-service.mdx new file mode 100644 index 00000000..c9b0df1c --- /dev/null +++ b/changelog/data-service.mdx @@ -0,0 +1,224 @@ +--- +title: "Data Service" +sidebarTitle: "Data Service [1.1.10]" +--- + + +## v1.1.10 +--- + +### S3 Upload Improvements +- Added support for passing encryption headers while uploading stream data to S3. +- Added support for both file path and direct value from environment variables for secrets like redis connection. + +### Batch and Fine-tune Enhancements +- Added provider_options as a key for batches and fine-tunes. + +### Stream Handling +- Improved stream cleanup for validation processes. + + + +## v1.1.9 +--- + +### File Handling Fixes +- Fixed issue with extra bytes being added to files during processing. + + + +## v1.1.8 +--- + +### File Upload Improvements +- Updated socket timeout for long requests during file uploads to prevent timeouts. + + + +## v1.1.7 +--- + +### Fireworks Fine-tuning Support +- Added support for Fireworks fine-tuning operations using Version2. + +### Batch Processing Improvements +- Included response tokens calculation in provider batch output. +- Fixed file loading in memory issues for better performance. + + + +## v1.1.6 +--- + +### S3 SDK Updates +- Upgraded S3 SDK to latest version for fixing issue with S3 streaming. + + + +## v1.1.5 +--- + +### Batch Processing Enhancements +- Improved provider batch output handling. +- Added support for custom batch output paths. +- Increased maximum lines to 500k and chunk size to 5MB for better performance. + + + +## v1.1.4 +--- + +### Vertex Embeddings Batches Support +- Added support for Vertex batch embeddings. + +### S3 and Storage Improvements +- Added support for custom S3 get operations. +- Back-merged Clickhouse query changes for better performance. + +### Batch Processing Updates +- Included model information in log objects. +- Implemented custom batch processing output generations. + +### HTTPS Support +- Added HTTPS connection support for secure communications. + + + + +## v1.1.3 +--- + +### Infrastructure Updates +- Streamlined uploaded file location for Bedrock operations. + + + +## v1.1.2 +--- + +### Vertex Integration +- Added support for Vertex provider options for batches. + +### Infrastructure Updates +- Fixed missing custom S3 support for file retrieval. +- Implemented cluster mode Redis for queues. +- Updated fine-tune status handling. + + + +## v1.1.1 +--- + +### S3 Enhancements +- Made S3 bucket optional for Bedrock batches. +- Added S3 encryption header support for finetunes and batches. +- Implemented SSE file upload support. + +### Logging Improvements +- Added filtering for log exports. +- Implemented end limit for log export records. + +### Performance Optimizations +- Added optional push gateway support. +- Implemented internal memory cache for better performance. + + + +## v1.1.0 +--- + +### Bull Board Integration +- Added Bull Board for visualizing job queues and their status. + +### Batch Job Retry Support +- Implemented retry functionality for batch jobs to handle failures gracefully. + +### Prometheus Metrics Enhancements +- Added Prometheus metrics for batch jobs and fine-tuning operations. + +### Error Handling Improvements +- Enhanced error handling across the service. + + + + +## v1.0.8 +--- + +### Azure Fine-tuning Support +- Added support for Azure fine-tuning operations. + + + + +## v1.0.7 +--- + +### Fine-tune v2 Implementation +- Implemented version 2 of the fine-tuning functionality. + + + +## v1.0.6 +--- + +### Prompt Slug Filter +- Added support for data exports filtering by PromptSlug. + + + + +## v1.0.5 +--- + +### Batch Exports +- Added support for batch exports functionality. + + + +## v1.0.4 +--- + +### Code Quality Improvements +- Fixed dynamic port retrieval from environment variables. + + + +## v1.0.3 +--- + +### Vision Fine-tuning Support +- Added support for vision fine-tuning validation for OpenAI. +- Implemented S3 bucket support for fine-tunes. + +### AWS Integration Improvements +- Fixed assumed role handling for Bedrock fine-tuning dataset URLs. +- Improved S3 bucket path handling for Bedrock fine-tune operations. +- Achieved parity with Enterprise Gateway for data sources. + + + +## v1.0.2 +--- + +### Fine-tuning Enhancements +- Added support for OpenAI job start and Fireworks upload. +- Improved handling of chunk type failures with JSON. + + + +## v1.0.1 +--- + +### Fireworks Fine-tuning Support +- Added support for Fireworks fine-tuning operations. + + + + +## v1.0.0 +--- + +### Initial Release +- Base version of the Data Service with core functionality. + diff --git a/docs.json b/docs.json index 7fb68469..a4dba282 100644 --- a/docs.json +++ b/docs.json @@ -831,7 +831,7 @@ }, { "group": "Enterprise Releases", - "pages": ["changelog/enterprise", "changelog/helm-chart"] + "pages": ["changelog/enterprise", "changelog/helm-chart", "changelog/data-service"] }, { "group": "Product Releases", From e877eb0fb6ad0b97c42901004fd50bfff5d52a30 Mon Sep 17 00:00:00 2001 From: sk-portkey Date: Wed, 25 Jun 2025 15:47:27 +0530 Subject: [PATCH 2/3] chore: update data service change log to 1.2.4 --- changelog/data-service.mdx | 123 +++++++++++++++++++++++++++++-------- 1 file changed, 98 insertions(+), 25 deletions(-) diff --git a/changelog/data-service.mdx b/changelog/data-service.mdx index c9b0df1c..2a3da398 100644 --- a/changelog/data-service.mdx +++ b/changelog/data-service.mdx @@ -1,8 +1,90 @@ --- title: "Data Service" -sidebarTitle: "Data Service [1.1.10]" +sidebarTitle: "Data Service [1.2.4]" --- + +## v1.2.4 +--- + +### Fixes and Improvements +- Fixed issue with custom batches missing cost calculation for some provider models + + + +## v1.2.3 +--- + +### Fine-tuning and Batch Processing +- Added support for configurable `FINETUNE_STATUS_CHECK_INTERVAL` for provider fine-tuning status check operations. +- Added support for configurable `BATCH_STATUS_CHECK_INTERVAL` for provider batch processing status check operations. +- Both values should be in milliseconds. Minimum value is 10000 milliseconds. +- If not provided, will default to 10 seconds. + + + + +## v1.2.2 +--- + +### Observability +- Added support for below Prometheus Counters + - `batch_count` + - `batch_cost` + - `batch_input_tokens` + - `batch_total_tokens` + - `batch_process_time` + - `batch_success_row_count` + - `batch_failure_row_count` + - `batch_row_count` +- With the below labels + - `provider` + - `type` (provider/custom) + +### Fixes and Improvements +- Fixed issue with attributing incorrect created at time stamp for batch processing +- Including error source as `control plane` for control plane failures + + + +## v1.2.1 +--- + +### Data exports +- Added support for [Data exports](/api-reference/admin-api/data-plane/logs/log-exports-beta/start-a-log-export) for hybrid deployments. + +### Fixes and Improvements +- Fixed issue with custom batches for small batch files + + + +## v1.2.0 +--- + +### Custom S3 Support +- Added support for `s3_custom` log store option for batches and fine-tunes. + +### Fixes and Improvements +- Fixed issue with STS token generation for AWS. + + + +## v1.1.12 +--- + +### Fixes and Improvements +- Fixed issue with cost calculation for custom batches. + + + +## v1.1.11 +--- + +### Fixes and Improvements +- Fixed issue where queue remains stuck in a queued state during file validation. + + + ## v1.1.10 --- @@ -11,9 +93,6 @@ sidebarTitle: "Data Service [1.1.10]" - Added support for passing encryption headers while uploading stream data to S3. - Added support for both file path and direct value from environment variables for secrets like redis connection. -### Batch and Fine-tune Enhancements -- Added provider_options as a key for batches and fine-tunes. - ### Stream Handling - Improved stream cleanup for validation processes. @@ -39,7 +118,7 @@ sidebarTitle: "Data Service [1.1.10]" --- ### Fireworks Fine-tuning Support -- Added support for Fireworks fine-tuning operations using Version2. +- Added support for `Fireworks` fine-tuning operations using Version2. ### Batch Processing Improvements - Included response tokens calculation in provider batch output. @@ -61,7 +140,7 @@ sidebarTitle: "Data Service [1.1.10]" ### Batch Processing Enhancements - Improved provider batch output handling. - Added support for custom batch output paths. -- Increased maximum lines to 500k and chunk size to 5MB for better performance. +- Increased maximum lines for custom batches to 500k and chunk size to 5MB for better performance. @@ -69,18 +148,16 @@ sidebarTitle: "Data Service [1.1.10]" --- ### Vertex Embeddings Batches Support -- Added support for Vertex batch embeddings. - -### S3 and Storage Improvements -- Added support for custom S3 get operations. -- Back-merged Clickhouse query changes for better performance. +- Added support for `Vertex` batch embeddings. ### Batch Processing Updates - Included model information in log objects. - Implemented custom batch processing output generations. -### HTTPS Support -- Added HTTPS connection support for secure communications. +### Internal POD to POD HTTPS Support +- Added support for internal POD to POD HTTPS communication. +- This can be enabled by mounting a volume with certificate and key. +- `TLS_KEY_PATH` and `TLS_CERT_PATH` environment variables will be used to fetch the certificate and key from the volume. @@ -89,7 +166,7 @@ sidebarTitle: "Data Service [1.1.10]" --- ### Infrastructure Updates -- Streamlined uploaded file location for Bedrock operations. +- Streamlined uploaded file location for `Bedrock` operations. @@ -97,10 +174,9 @@ sidebarTitle: "Data Service [1.1.10]" --- ### Vertex Integration -- Added support for Vertex provider options for batches. +- Added support for `Vertex` provider options for batches. ### Infrastructure Updates -- Fixed missing custom S3 support for file retrieval. - Implemented cluster mode Redis for queues. - Updated fine-tune status handling. @@ -119,7 +195,6 @@ sidebarTitle: "Data Service [1.1.10]" - Implemented end limit for log export records. ### Performance Optimizations -- Added optional push gateway support. - Implemented internal memory cache for better performance. @@ -136,9 +211,6 @@ sidebarTitle: "Data Service [1.1.10]" ### Prometheus Metrics Enhancements - Added Prometheus metrics for batch jobs and fine-tuning operations. -### Error Handling Improvements -- Enhanced error handling across the service. - @@ -146,7 +218,7 @@ sidebarTitle: "Data Service [1.1.10]" --- ### Azure Fine-tuning Support -- Added support for Azure fine-tuning operations. +- Added support for `Azure OpeAI` fine-tuning operations. @@ -154,8 +226,8 @@ sidebarTitle: "Data Service [1.1.10]" ## v1.0.7 --- -### Fine-tune v2 Implementation -- Implemented version 2 of the fine-tuning functionality. +### Fine-tune v2 +- Implemented version 2 of the [fine-tuning](/product/ai-gateway/fine-tuning) functionality. @@ -171,8 +243,9 @@ sidebarTitle: "Data Service [1.1.10]" ## v1.0.5 --- -### Batch Exports -- Added support for batch exports functionality. +### Batch Processing +- Added provider and custom [Batch] (/product/ai-gateway/batches) processing functionality. + From ada5a678a62a82838f278695929efe24f89c9845 Mon Sep 17 00:00:00 2001 From: sk-portkey Date: Wed, 25 Jun 2025 16:03:24 +0530 Subject: [PATCH 3/3] chore: add changelog/data-service in missing place --- docs.json | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs.json b/docs.json index 1783384f..d6dcc9df 100644 --- a/docs.json +++ b/docs.json @@ -932,7 +932,7 @@ }, { "group": "Enterprise Releases", - "pages": ["changelog/enterprise", "changelog/helm-chart"] + "pages": ["changelog/enterprise", "changelog/helm-chart", "changelog/data-service"] }, { "group": "Product Releases",