Deploying AI models at scale brings real-world costs—especially in terms of energy use and environmental impact. While there are many efforts to make machine learning more efficient, most of them focus on just one layer of the stack: optimizing a model, tweaking the hardware, or shifting workloads to different locations. These isolated fixes often miss the bigger picture.
EAVE (Energy Analytics for Cost-effective and Sustainable Operations) is an open-source framework designed to bring those pieces together. It supports multi-layer optimization across tasks, models, hardware configurations, and deployment strategies. EAVE helps practitioners explore trade-offs between cost, performance, and carbon impact—and make informed decisions at each step.
The framework combines task profiling, model compression techniques (like distillation and quantization), hardware-aware planning, and spatial-temporal deployment analysis. It uses a mix of causal inference, Bayesian modeling, and PUE-based energy analytics to recommend more sustainable and cost-effective AI deployment strategies.
Evaluated on MLPerf workloads, EAVE provides actionable insights for both machine learning engineers and data center operators—without sacrificing model accuracy.
EAVE was evaluated for its predictive accuracy, causal reasoning capabilities, and multi-objective optimization performance.
EAVE estimates Average Treatment Effects (ATE) with Bayesian uncertainty over key energy drivers. Results show that IT energy, cooling energy, and PUE are more sensitive to infrastructure and facility-level attributes (e.g., total energy, weather, cooling systems) than to model or hardware configurations. Placebo tests confirm the robustness of these inferences.
Figure: Average treatment effects estimation results on It energy, cooling energy and PUE.
By leveraging multi-objective optimization, EAVE identifies Pareto-optimal configurations balancing performance, cost, and carbon footprint. For instance, compressing the LLaMA 3.3 70B model to 8B via distillation reduced energy and CO₂ by 37.23% with only a 14.34% drop in accuracy.
Using a Bayesian Neural Network (BNN), EAVE predicts energy metrics with high reliability. On a held-out test set, BNN outperforms classical baselines in MAE, MSE, RMSE, and R². Posterior predictive checks (below) further validate the model’s adequacy, showing strong alignment between simulated and observed distributions.
Figure: a) Visualization of pareto optimal front results, b) Kernel density plot showing the results of posterior predictive check over 1000 simulations
- Install Conda
- Install Python 3.9+
-
Navigate to the backend directory
cd escade-project/EAVE-dashboard/eave-api -
Create and activate a Conda environment
conda create --name eave-backend python=3.9 -y conda activate eave-backend
-
Install dependencies
pip install -r requirements.txt
-
Run the FastAPI server
uvicorn main:app --reload
- Install Node.js
- Install npm (usually comes with Node.js)
-
Navigate to the frontend directory
cd escade-project/EAVE-dashboard/eave-ui -
Install dependencies
npm install
-
Start the development server
npm run dev
Once both the backend and frontend servers are running:
Open your browser and go to: http://localhost:5173
- Ensure that both the frontend and backend are running simultaneously for full functionality.
- Use
conda deactivateto exit the backend environment when done. - If any packages are missing, install them by
pip install [missing-package]
EAVE-dashboard/
├── eave-api/ # FastAPI back-end
│ ├── app/
│ │ ├── main.py # entry point for SPA
│ │ │ ├── measure/ # energy + causal analysis
│ │ │ ├── predict/ # CausalBNN inference
│ │ │ └── optimize/ # compression + Pareto
│ ├── data/ # benchmark input
│ └── requirements.txt # back-end dependencies
└── eave-ui/ # React front-end
