jeev20
diff --git a/‎_freeze/posts/buildingapowerfulenoughlocalaiserver/index/execute-results/html.json‎
Lines changed: 2 additions & 2 deletions b/‎_freeze/posts/buildingapowerfulenoughlocalaiserver/index/execute-results/html.json‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎drafts/buildingapowerfulenoughlocalaiserver/index.qmd‎
Lines changed: 0 additions & 57 deletions b/‎drafts/buildingapowerfulenoughlocalaiserver/index.qmd‎
Lines changed: 0 additions & 57 deletions
diff --git a/‎drafts/programaticallyaddingtabsetsinquarto/index.qmd‎
Lines changed: 20 additions & 0 deletions b/‎drafts/programaticallyaddingtabsetsinquarto/index.qmd‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎posts/buildingapowerfulenoughlocalaiserver/images/LLMServer.jpg‎
220 KB b/‎posts/buildingapowerfulenoughlocalaiserver/images/LLMServer.jpg‎
220 KB
diff --git a/‎posts/buildingapowerfulenoughlocalaiserver/index.qmd‎
Lines changed: 97 additions & 0 deletions b/‎posts/buildingapowerfulenoughlocalaiserver/index.qmd‎
Lines changed: 97 additions & 0 deletions
@@ -1,8 +1,8 @@
 {
-  "hash": "7292ea7890b433f95c7ed85a563373b3",
+  "hash": "2910e3d5e25105865252e02476a5f3c0",
   "result": {
     "engine": "jupyter",
-    "markdown": "---\ntitle: Building a local AI server\nformat: html\ntoc: true\nlang: en\njupyter: python3\nipynb-shell-interactivity: all\nexecute:\n  echo: false\ndate: 2025-06-03 06:50 +0200\ncategories: [\"ai\"]\ntags: [\"ollama\", \"aihardware\"]\ncomments:\n  giscus:\n    repo: jeev20/jeev20.github.io\n---\n\n\n## Requirements\n\n### Hardware\nMy requirements were quite basic:\n\n* A minimum of 16GB VRAM (preferably Nvidia)\n* A minimum of 16GB RAM\n* A minimum of 6 cores / 12 threads Ryzen CPU\n* A motherboard with 2 PCI GPU slots (does not need to be 16 lanes PCI)\n* A minimum of 600 watt power supply\n* A wi-fi smart-plug with scheduling capabilities\n* A motherboard which support power-on after power restoration\n\n\n### Software\n* Linux OS with a long-term support and used by a lot of users\n* Linux OS with Tailscale native support (systemd)\n* Linux OS with easy installation of Docker, Nvidia Container Toolkit and Cuda Toolkit\n* Linux OS with OpenSSH server to manage server remotely\n* Linux OS with crontab to schedule running of scripts\n  \n\n## Server \nI ended up purchasing a second-hand PC with all the above requirements and then purchased two RTX 3060 GPUs with 12 GB Vram each. The RAM capacity is somewhat low for this use-case and I plan to upgrade it in the future. \n\n\n## Scheduling\n\n## Updates\nAs everything runs on an Ubuntu LTS 24.04. \n\n## Docker\n\n## Ollama and OpenWebUi Bundle\n\n## Docker exec\n\n## Power automation\n\n",
+    "markdown": "---\ntitle: Building a local AI server\nformat: html\ntoc: true\nlang: en\njupyter: python3\nipynb-shell-interactivity: all\nexecute:\n  echo: false\ndate: 2025-05-15 06:50 +0200\ncategories: [\"ai server\"]\ntags: [\"ollama\", \"aihardware\", \"machine learning hardware\"]\ncomments:\n  giscus:\n    repo: jeev20/jeev20.github.io\n---\n\nI have been an avid tinkerer with home pc's. I enjoy building and configuring them. Naturally, as LLM's become a developer necessity, I wanted to build an economical yet a performant LLM server for my home-lab.\n\n## Requirements\n\n### Hardware\nMy requirements were quite basic:\n\n* A minimum of 16GB VRAM (preferably Nvidia)\n* A minimum of 16GB RAM\n* A minimum of 6 cores / 12 threads Ryzen CPU\n* A motherboard with 2 PCI GPU slots (does not need to be 16 lanes PCI)\n* A minimum of 600 watt power supply\n* A wi-fi smart-plug with scheduling capabilities\n* A motherboard which support power-on after power restoration\n\n\n### Software\n* Linux OS with a long-term support and used by a lot of users\n* Linux OS with Tailscale native support (systemd)\n* Linux OS with easy installation of Docker, Nvidia Container Toolkit and Cuda Toolkit\n* Linux OS with OpenSSH server to manage server remotely\n* Linux OS with crontab to schedule running of scripts\n  \n\n## Server \nI ended up purchasing a second-hand PC with all the above requirements and then purchased two used RTX 3060 GPUs with 12 GB Vram each. The combined 24 GB of Vram is plenty enough for my use cases. In the future, this may be upgraded to either a 32 GB or higher. \n\nThe RAM capacity (16 GB) is somewhat low for this use-case and I plan to upgrade it in the future. \n\nThe PC costed me 1500 NOK and 5200 NOK for the 2 GPUs. The total cost was 6700 NOK, which is equivalent to 666 USD. I am sure it is a fantastic price for the performance it offers.\n\nTo ensure stable internet connectivity, I choose to connect the server directly to a Unify router via the internet cable, which is part of a mesh network. \n\n\n![LLM Server - a sleeper build](images/LLMServer.jpg)\n\nAlthough it would have been great to connect to the main router, my main router is placed up in the attic which can get quite cold in the winters. Also, I do not wish to go up there when I have to access this server. \n\n### Scheduling\nI have a cron job set up to switch of the server at around 01:00 everynight. Simulatneously, a smart wi-fi switch off from the power socket. However, to wake up the server in the morning, I use a smart switch to power on and in the server bios, I set \"wake on power up\" setting. \n\nThis ensures that when the smart switch is in the on position, the server starts up. \n\nThis method to optimize power useage has been working without any hicups for several months now and I recommend this to anyone looking to get into home-labing. \n\n### Updates\nAs everything runs on an Ubuntu LTS 24.04, I have very litte proactive updates to make manually. \n\n### Combining Ollama and Open-WebUi\n\nFrom the Open-Webui repository, I choose to run this configuration \n\n```{.bash}\nsudo docker run -d -p 3000:8080 -p 11434:11434 --gpus=all   -v ollama:/root/.ollama   -v open-webui:/app/backend/data   --name open-webui --restart always   -e OLLAMA_HOST=0.0.0.0   ghcr.io/open-webui/open-webui:ollama\n```\n\nTo keep this installation upto date, I use `watchtower`\n```{.bash}\nsudo docker run --rm --volume /var/run/docker.sock:/var/run/docker.sock containrrr/watchtower --run-once open-webui\n\n```\n### Docker exec\nOffcourse, an LLM server should be able to fetch the newest open-source models. I do this via an easy ollama command. \n```{.bash}\nsudo docker exec open-webui ollama pull gpt-oss:latest\n```\n\n\n### Remote access\nIn order to have acces to my LLM server outside my home network, I use [Tailscale](https://tailscale.com/). Tailscale is an amazing technology which allows to create encrypted tunnels across devices. \n\nMy phone is also part of the this `tailnet`, which allows it to connect to my LLM server from any public internet network. Tailscale automatically updates to the latest version when I run `apt update && apt upgrade` ensuring minimal updating overhead. \n\n## Usage and experience\n\nOverall, I am quite happy with how this server performs. I use this server for both machine learning tasks and LLM experiments. Since Github Copilot has an agent mode which can be configured with ollama, this server serves the models to VS Code when I need it. \n\nI have also configured this server as the LLM provider in my [Marimo](https://marimo.io/) notebooks. \n\nI am looking forward to many wonderful evenings of exploring LLMs and machine learning on this nifty little server. \n\n",
     "supporting": [
       "index_files"
     ],
 
@@ -0,0 +1,20 @@
+---
+title: Programatically adding tabsets in Quarto
+description: A walkthorugh of using embedd option to create dashboards
+author: "Jeevith Hegde"
+date: 2025-08-25 15:33 +0200
+format: html
+toc: true
+lang: en
+jupyter: python3
+ipynb-shell-interactivity: all
+execute:
+  echo: false
+categories: ["quarto"]
+tags: ["tutorial"]
+comments:
+  giscus:
+    repo: jeev20/jeev20.github.io
+---
+
+Test
@@ -0,0 +1,97 @@
+---
+title: Building a local AI server
+format: html
+toc: true
+lang: en
+jupyter: python3
+ipynb-shell-interactivity: all
+execute:
+  echo: false
+date: 2025-05-15 06:50 +0200
+categories: ["ai server"]
+tags: ["ollama", "aihardware", "machine learning hardware"]
+comments:
+  giscus:
+    repo: jeev20/jeev20.github.io
+---
+
+I have been an avid tinkerer with home pc's. I enjoy building and configuring them. Naturally, as LLM's become a developer necessity, I wanted to build an economical yet a performant LLM server for my home-lab.
+
+## Requirements
+
+### Hardware
+My requirements were quite basic:
+
+* A minimum of 16GB VRAM (preferably Nvidia)
+* A minimum of 16GB RAM
+* A minimum of 6 cores / 12 threads Ryzen CPU
+* A motherboard with 2 PCI GPU slots (does not need to be 16 lanes PCI)
+* A minimum of 600 watt power supply
+* A wi-fi smart-plug with scheduling capabilities
+* A motherboard which support power-on after power restoration
+
+
+### Software
+* Linux OS with a long-term support and used by a lot of users
+* Linux OS with Tailscale native support (systemd)
+* Linux OS with easy installation of Docker, Nvidia Container Toolkit and Cuda Toolkit
+* Linux OS with OpenSSH server to manage server remotely
+* Linux OS with crontab to schedule running of scripts
+  
+
+## Server 
+I ended up purchasing a second-hand PC with all the above requirements and then purchased two used RTX 3060 GPUs with 12 GB Vram each. The combined 24 GB of Vram is plenty enough for my use cases. In the future, this may be upgraded to either a 32 GB or higher. 
+
+The RAM capacity (16 GB) is somewhat low for this use-case and I plan to upgrade it in the future. 
+
+The PC costed me 1500 NOK and 5200 NOK for the 2 GPUs. The total cost was 6700 NOK, which is equivalent to 666 USD. I am sure it is a fantastic price for the performance it offers.
+
+To ensure stable internet connectivity, I choose to connect the server directly to a Unify router via the internet cable, which is part of a mesh network. 
+
+
+![LLM Server - a sleeper build](images/LLMServer.jpg)
+
+Although it would have been great to connect to the main router, my main router is placed up in the attic which can get quite cold in the winters. Also, I do not wish to go up there when I have to access this server. 
+
+### Scheduling
+I have a cron job set up to switch of the server at around 01:00 everynight. Simulatneously, a smart wi-fi switch off from the power socket. However, to wake up the server in the morning, I use a smart switch to power on and in the server bios, I set "wake on power up" setting. 
+
+This ensures that when the smart switch is in the on position, the server starts up. 
+
+This method to optimize power useage has been working without any hicups for several months now and I recommend this to anyone looking to get into home-labing. 
+
+### Updates
+As everything runs on an Ubuntu LTS 24.04, I have very litte proactive updates to make manually. 
+
+### Combining Ollama and Open-WebUi
+
+From the Open-Webui repository, I choose to run this configuration 
+
+```{.bash}
+sudo docker run -d -p 3000:8080 -p 11434:11434 --gpus=all   -v ollama:/root/.ollama   -v open-webui:/app/backend/data   --name open-webui --restart always   -e OLLAMA_HOST=0.0.0.0   ghcr.io/open-webui/open-webui:ollama
+```
+
+To keep this installation upto date, I use `watchtower`
+```{.bash}
+sudo docker run --rm --volume /var/run/docker.sock:/var/run/docker.sock containrrr/watchtower --run-once open-webui
+
+```
+### Docker exec
+Offcourse, an LLM server should be able to fetch the newest open-source models. I do this via an easy ollama command. 
+```{.bash}
+sudo docker exec open-webui ollama pull gpt-oss:latest
+```
+
+
+### Remote access
+In order to have acces to my LLM server outside my home network, I use [Tailscale](https://tailscale.com/). Tailscale is an amazing technology which allows to create encrypted tunnels across devices. 
+
+My phone is also part of the this `tailnet`, which allows it to connect to my LLM server from any public internet network. Tailscale automatically updates to the latest version when I run `apt update && apt upgrade` ensuring minimal updating overhead. 
+
+## Usage and experience
+
+Overall, I am quite happy with how this server performs. I use this server for both machine learning tasks and LLM experiments. Since Github Copilot has an agent mode which can be configured with ollama, this server serves the models to VS Code when I need it. 
+
+I have also configured this server as the LLM provider in my [Marimo](https://marimo.io/) notebooks. 
+
+I am looking forward to many wonderful evenings of exploring LLMs and machine learning on this nifty little server.