Commit Graph

169 Commits

Author SHA1 Message Date
Lumpiasty 374ee146fe update llama-swap image
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-06-12 18:17:52 +02:00
Lumpiasty 2380cd16e4 add more gemma 4 26b variants
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-06-12 18:01:27 +02:00
Lumpiasty aab4bc279c Merge pull request 'Update ghcr.io/remsky/kokoro-fastapi-cpu Docker tag to v0.5.0' (#324) from renovate/ghcr.io-remsky-kokoro-fastapi-cpu-0.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was canceled
Reviewed-on: #324
2026-06-11 21:46:39 +00:00
Lumpiasty 7f6439d64a switch gemma 4 quant, add mtp and nothink variants
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-06-11 21:06:12 +02:00
Lumpiasty 8860f6782e add converting proxy to parakeet
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-06-10 20:46:17 +02:00
Lumpiasty f863a0a496 use parakeet.cpp instead of whisper
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-06-10 20:08:14 +02:00
Renovate fb4fa9b0e7 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-09 2026-06-09 15:40:20 +00:00
Renovate 13a87e5b00 Update ghcr.io/remsky/kokoro-fastapi-cpu Docker tag to v0.5.0 2026-06-07 02:02:49 +00:00
Renovate 87c56a9ca1 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-06 2026-06-07 02:01:15 +00:00
Renovate 43c2036642 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-05 2026-06-06 02:01:10 +00:00
Renovate fe607d3fb8 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-04 2026-06-05 02:01:18 +00:00
Renovate 0dd86e3321 Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-03' (#318) from renovate/ghcr.io-mostlygeek-llama-swap-2026.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-06-04 02:02:56 +00:00
Renovate 9dd0c7eb0a Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-03 2026-06-04 02:01:07 +00:00
Renovate 68b480299d Update caddy Docker tag to v2.11.4 2026-06-04 02:01:05 +00:00
Renovate 149e85762f Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-06-02 2026-06-03 02:01:32 +00:00
Lumpiasty af4a7fee48 go back to official llama-swap image
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-31 16:23:50 +02:00
Lumpiasty 6546676dd6 add llama-swap optimizations recommended by claude
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-31 05:18:47 +02:00
Lumpiasty 353155f7ad Enable DMA transfer queue on llama-swap
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-31 04:25:08 +02:00
Lumpiasty 172fbb1ded Test updated base-image llama-swap
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-31 03:15:44 +02:00
Renovate fa85180736 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-29 2026-05-30 02:01:05 +00:00
Renovate eb579d2632 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-28 2026-05-29 02:01:16 +00:00
Lumpiasty cd0e92379f Merge pull request 'Update ghcr.io/remsky/kokoro-fastapi-cpu Docker tag to v0.4.0' (#306) from renovate/ghcr.io-remsky-kokoro-fastapi-cpu-0.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was canceled
Reviewed-on: #306
2026-05-28 17:19:47 +00:00
Renovate f68f2e1d38 Update ghcr.io/remsky/kokoro-fastapi-cpu Docker tag to v0.4.0 2026-05-26 02:02:30 +00:00
Renovate a2d193e87d Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-25 2026-05-26 02:00:45 +00:00
Lumpiasty fc58a6507b disable mlock
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-24 19:16:24 +02:00
Renovate 1d6a94b5b4 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-24 2026-05-24 17:06:03 +00:00
Lumpiasty 6096b7019d fix path to llama-server binary
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-23 09:32:11 +02:00
Renovate 37d42a8dd8 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-23 2026-05-23 07:25:07 +00:00
Lumpiasty c161da3657 add mlock and disable mmap in llama-server
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 23:18:05 +02:00
Lumpiasty fc2c15d154 move whisper to gpu
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 22:02:34 +02:00
Lumpiasty 02b3ec13b4 switch kokoro to remsky/Kokoro-FastAPI 2026-05-21 21:55:34 +02:00
Lumpiasty 989732e1b5 move kokoro to separate deployment
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 21:34:33 +02:00
Lumpiasty ab438be629 fix tts model path
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 21:11:29 +02:00
Lumpiasty 4556ca3c08 add ffmpeg for whisper
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 20:58:56 +02:00
Lumpiasty 611f9f3886 add tts and sst to llama-swap and openwebui
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-21 20:43:54 +02:00
Renovate 92bf792320 Update ghcr.io/mostlygeek/llama-swap Docker tag to unified-vulkan-2026-05-21 2026-05-21 17:40:41 +00:00
Lumpiasty cfa3df6d1a increase llama models PVC from 300Gi to 400Gi
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-20 19:49:25 +02:00
Lumpiasty c82f60e90a switch text encoder to ponpoke/flux2-klein-4b-uncensored-text-encoder Q4_K_M
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-20 17:50:06 +02:00
Lumpiasty b41342be01 switch image model to FLUX.2-klein-4B (Apache 2.0, 4-step, unified gen+edit)
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-20 16:49:30 +02:00
Lumpiasty d3434a4102 remove unused qwen nothink chat template 2026-05-20 01:38:24 +02:00
Lumpiasty de2822fee1 switch llama-swap to unified-vulkan image with FLUX.1-dev image generation
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
- Update deployment to unified-vulkan-2026-05-19 (includes llama-server,
  sd-server, whisper-server in one image)
- Fix binary paths: /app/llama-server -> llama-server (now on PATH)
- Migrate groups -> matrix to allow FLUX to evict the always-on 0.8B model
  when image generation is requested
- Add FLUX.1-dev Q4_K_S model via sd-server
- Configure OpenWebUI image generation to use llama-swap sd-server
- Update renovate versioning regex to treat all unified-vulkan date tags as
  patch updates for automerge
2026-05-20 01:11:57 +02:00
Lumpiasty 55ac337a63 enable MTP on MTP models
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-05-18 20:31:57 +02:00
Renovate cd8768de67 Update ghcr.io/mostlygeek/llama-swap Docker tag to v216 2026-05-18 17:59:16 +00:00
Lumpiasty 5397749a73 Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v214-vulkan-b9174' (#288) from renovate/ghcr.io-mostlygeek-llama-swap-214.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
Reviewed-on: #288
2026-05-18 17:57:20 +00:00
Lumpiasty f3ad488bc8 add MTP version of Qwen3.6-35B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-18 19:42:48 +02:00
Renovate 8f51671c35 Update ghcr.io/mostlygeek/llama-swap Docker tag to v214-vulkan-b9174 2026-05-17 12:46:14 +00:00
Renovate 38ad78e69b Update ghcr.io/mostlygeek/llama-swap Docker tag to v214 2026-05-17 02:01:24 +00:00
Renovate 2edcc0f4aa Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9115' (#277) from renovate/ghcr.io-mostlygeek-llama-swap-211.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was canceled
2026-05-13 02:05:28 +00:00
Renovate 2af6065421 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9115 2026-05-13 02:05:24 +00:00
Renovate b30d77436f Update caddy Docker tag to v2.11.3 2026-05-13 02:05:22 +00:00