-
Notifications
You must be signed in to change notification settings - Fork 15
feat: Add first docker-compose deployment for stability #92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from all commits
513c0cf
78f3cca
369787b
2e0bd7b
8582c9c
c226a51
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| @@ -0,0 +1,43 @@ | ||||||||||||
| services: | ||||||||||||
| sglang-router: | ||||||||||||
| # https://hub.docker.com/r/lmsysorg/sglang-router/tags | ||||||||||||
| image: deepauto/sglang-router:v1.2.9-sglang | ||||||||||||
| # Host Network to use Tailscale | ||||||||||||
| network_mode: host | ||||||||||||
| command: | ||||||||||||
| - --host | ||||||||||||
| - "0.0.0.0" | ||||||||||||
| - --port | ||||||||||||
| - "10090" | ||||||||||||
| # Worker Configuration | ||||||||||||
| - --worker-urls | ||||||||||||
| - http://h100-80-1:10080 | ||||||||||||
| # - http://h100-80-2:10080 | ||||||||||||
|
Comment on lines
+14
to
+15
|
||||||||||||
| - http://h100-80-1:10080 | |
| # - http://h100-80-2:10080 | |
| - http://${WORKER_HOSTNAME:-h100-80-1}:10080 # Set WORKER_HOSTNAME env var to override | |
| # - http://${WORKER_HOSTNAME_2:-h100-80-2}:10080 # Example for a second worker |
Copilot
AI
Oct 10, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The ports mapping is incomplete. The prometheus port 10091 is configured in the command but not exposed in the ports section, which will prevent external access to monitoring metrics.
Copilot
AI
Oct 10, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The ports configuration is incomplete. The router is configured to listen on port 10090 and also expose Prometheus metrics on port 10091, but only port 10090 is exposed. Add port mapping for 10091:10091 to make Prometheus metrics accessible.
Copilot
AI
Oct 10, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Port mappings are redundant when using 'network_mode: host'. The host network mode makes all container ports directly accessible on the host, so explicit port mappings are unnecessary and could cause confusion.
| ports: | |
| # Router port | |
| - 10090:10090 | |
| # Monitoring port | |
| - 10091:10091 |
| Original file line number | Diff line number | Diff line change | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| @@ -0,0 +1,106 @@ | ||||||||||
| services: | ||||||||||
| sglang-server: | ||||||||||
| # https://hub.docker.com/r/deepauto/hip-attention/tags | ||||||||||
| image: deepauto/hip-attention:v1.2.9-sglang | ||||||||||
| ipc: host | ||||||||||
| deploy: | ||||||||||
| resources: | ||||||||||
| reservations: | ||||||||||
| devices: | ||||||||||
| - driver: nvidia | ||||||||||
| device_ids: ['0', '1', '2', '3', '4', '5', '6', '7'] | ||||||||||
| capabilities: [gpu] | ||||||||||
| volumes: | ||||||||||
| - type: volume | ||||||||||
| source: sglang-cache | ||||||||||
| target: /root/.cache | ||||||||||
| - type: bind | ||||||||||
| source: ${HF_HOME:?error} | ||||||||||
|
||||||||||
| source: ${HF_HOME:?error} | |
| source: ${HF_HOME:?HF_HOME environment variable is required but not set} |
Copilot
AI
Oct 10, 2025
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The error message 'error' is not descriptive. Consider using a more helpful message like 'HF_HOME environment variable is required'.
| source: ${HF_HOME:?error} | |
| source: ${HF_HOME:?HF_HOME environment variable is required} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hardcoded hostname 'h100-80-1' makes this configuration environment-specific and less portable. Consider using environment variables or Docker service names for worker URLs.