Commit Graph

154 Commits

Author SHA1 Message Date
Lumpiasty af4a7fee48 go back to official llama-swap image
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-31 16:23:50 +02:00
Lumpiasty 6546676dd6 add llama-swap optimizations recommended by claude
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-31 05:18:47 +02:00
Lumpiasty 353155f7ad Enable DMA transfer queue on llama-swap
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-31 04:25:08 +02:00
Lumpiasty 172fbb1ded Test updated base-image llama-swap
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-31 03:15:44 +02:00
Renovate fa85180736 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-29 2026-05-30 02:01:05 +00:00
Renovate eb579d2632 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-28 2026-05-29 02:01:16 +00:00
Lumpiasty cd0e92379f Merge pull request 'Update ghcr.io/remsky/kokoro-fastapi-cpu Docker tag to v0.4.0' (#306) from renovate/ghcr.io-remsky-kokoro-fastapi-cpu-0.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was canceled
Reviewed-on: #306
2026-05-28 17:19:47 +00:00
Renovate f68f2e1d38 Update ghcr.io/remsky/kokoro-fastapi-cpu Docker tag to v0.4.0 2026-05-26 02:02:30 +00:00
Renovate a2d193e87d Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-25 2026-05-26 02:00:45 +00:00
Lumpiasty fc58a6507b disable mlock
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-24 19:16:24 +02:00
Renovate 1d6a94b5b4 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-24 2026-05-24 17:06:03 +00:00
Lumpiasty 6096b7019d fix path to llama-server binary
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-23 09:32:11 +02:00
Renovate 37d42a8dd8 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-23 2026-05-23 07:25:07 +00:00
Lumpiasty c161da3657 add mlock and disable mmap in llama-server
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 23:18:05 +02:00
Lumpiasty fc2c15d154 move whisper to gpu
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 22:02:34 +02:00
Lumpiasty 02b3ec13b4 switch kokoro to remsky/Kokoro-FastAPI 2026-05-21 21:55:34 +02:00
Lumpiasty 989732e1b5 move kokoro to separate deployment
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 21:34:33 +02:00
Lumpiasty ab438be629 fix tts model path
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 21:11:29 +02:00
Lumpiasty 4556ca3c08 add ffmpeg for whisper
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 20:58:56 +02:00
Lumpiasty 611f9f3886 add tts and sst to llama-swap and openwebui
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 20:43:54 +02:00
Renovate 92bf792320 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-21 2026-05-21 17:40:41 +00:00
Lumpiasty cfa3df6d1a increase llama models PVC from 300Gi to 400Gi
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-20 19:49:25 +02:00
Lumpiasty c82f60e90a switch text encoder to ponpoke/flux2-klein-4b-uncensored-text-encoder Q4_K_M
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-20 17:50:06 +02:00
Lumpiasty b41342be01 switch image model to FLUX.2-klein-4B (Apache 2.0, 4-step, unified gen+edit)
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-20 16:49:30 +02:00
Lumpiasty d3434a4102 remove unused qwen nothink chat template 2026-05-20 01:38:24 +02:00
Lumpiasty de2822fee1 switch llama-swap to unified-vulkan image with FLUX.1-dev image generation
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
- Update deployment to unified-vulkan-2026-05-19 (includes llama-server,
  sd-server, whisper-server in one image)
- Fix binary paths: /app/llama-server -> llama-server (now on PATH)
- Migrate groups -> matrix to allow FLUX to evict the always-on 0.8B model
  when image generation is requested
- Add FLUX.1-dev Q4_K_S model via sd-server
- Configure OpenWebUI image generation to use llama-swap sd-server
- Update renovate versioning regex to treat all unified-vulkan date tags as
  patch updates for automerge
2026-05-20 01:11:57 +02:00
Lumpiasty 55ac337a63 enable MTP on MTP models
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-18 20:31:57 +02:00
Renovate cd8768de67 Update ghcr.io/mostlygeek/llama-swap Docker tag to v216 2026-05-18 17:59:16 +00:00
Lumpiasty 5397749a73 Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v214-vulkan-b9174' (#288) from renovate/ghcr.io-mostlygeek-llama-swap-214.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
Reviewed-on: #288
2026-05-18 17:57:20 +00:00
Lumpiasty f3ad488bc8 add MTP version of Qwen3.6-35B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-18 19:42:48 +02:00
Renovate 8f51671c35 Update ghcr.io/mostlygeek/llama-swap Docker tag to v214-vulkan-b9174 2026-05-17 12:46:14 +00:00
Renovate 38ad78e69b Update ghcr.io/mostlygeek/llama-swap Docker tag to v214 2026-05-17 02:01:24 +00:00
Renovate 2edcc0f4aa Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9115' (#277) from renovate/ghcr.io-mostlygeek-llama-swap-211.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was canceled
2026-05-13 02:05:28 +00:00
Renovate 2af6065421 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9115 2026-05-13 02:05:24 +00:00
Renovate b30d77436f Update caddy Docker tag to v2.11.3 2026-05-13 02:05:22 +00:00
Renovate e1ed09f938 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9102 2026-05-12 21:01:42 +00:00
Renovate 8269533c45 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9093 2026-05-12 02:05:31 +00:00
Renovate cad7dab839 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9085 2026-05-11 02:01:10 +00:00
Renovate 5d18a56d3b Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9070 2026-05-10 02:01:10 +00:00
Renovate eee1cd6a66 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9049 2026-05-09 02:01:14 +00:00
Renovate 1d7fa75b70 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9038 2026-05-08 02:01:09 +00:00
Renovate 028f4b1560 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9014 2026-05-06 02:01:06 +00:00
Renovate 2d1ac75c7c Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9010 2026-05-05 02:01:38 +00:00
Lumpiasty 0062ed03d2 Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v211' (#255) from renovate/ghcr.io-mostlygeek-llama-swap-211.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline failed
Reviewed-on: #255
2026-05-04 17:43:05 +00:00
Lumpiasty 8d0c9f7a0d increase llama ingress max request size
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-04 17:53:07 +02:00
Renovate 9fdb7e6f7b Update ghcr.io/mostlygeek/llama-swap Docker tag to v211 2026-05-04 02:01:17 +00:00
Renovate 1d1f782c56 Update ghcr.io/mostlygeek/llama-swap Docker tag to v210-vulkan-b8994 2026-05-03 02:04:18 +00:00
Renovate 3b55957687 Update ghcr.io/mostlygeek/llama-swap Docker tag to v210 2026-05-02 02:01:00 +00:00
Lumpiasty 8cbe2ef794 add abliterated versions of Qwen3.6-35B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-04-30 18:35:32 +02:00
Renovate 284c97acc8 Update ghcr.io/mostlygeek/llama-swap Docker tag to v208-vulkan-b8953 2026-04-30 02:00:56 +00:00