AI Inference for Data Science (Batch / Event-Driven)

This project provides an auto-scaling, event-driven infrastructure specifically tailored for Data Science teams running sporadic or batch AI inference workloads.

The Use Case

Data Science teams often need substantial compute power (CPU/RAM) to run inference across massive datasets. However, these batch jobs are sporadic. Running a static GKE cluster or a fleet of VMs 24/7 is a massive waste of capital.

The Architecture

This Terraform code deploys a "Scale-to-Zero" event-driven pipeline:

Google Cloud Storage (GCS): The landing zone where data scientists upload their datasets.
Pub/Sub Topic: An event bus. When a new batch job needs to run, a message is published here.
Cloud Run (Serverless Compute): The inference engine. It listens to the Pub/Sub topic via a Push Subscription.
- Scale to Zero: When the queue is empty, instances scale down to 0, costing the business nothing.
- Burst Scaling: When a massive batch job is triggered, GCP auto-scales out to as many as 50 parallel containers to crunch the data simultaneously.
Least Privilege IAM: The Cloud Run service account is strictly scoped. It can only read from the specific GCS bucket and process messages from the specific Pub/Sub subscription. It cannot be accessed via the public internet (INGRESS_TRAFFIC_INTERNAL_ONLY).

Deployment

Ensure you have authenticated with GCP (gcloud auth application-default login).
Run terraform init to download the Google provider.
Run terraform apply -var="project_id=YOUR_PROJECT_ID".

This infrastructure is designed to be fully automated and ephemeral, allowing Data Science teams to focus on models rather than managing servers.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
main.tf		main.tf
outputs.tf		outputs.tf
variables.tf		variables.tf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Inference for Data Science (Batch / Event-Driven)

The Use Case

The Architecture

Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Inference for Data Science (Batch / Event-Driven)

The Use Case

The Architecture

Deployment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages