Commit Graph

127 Commits

Author SHA1 Message Date
Renovate cd8768de67 Update ghcr.io/mostlygeek/llama-swap Docker tag to v216 2026-05-18 17:59:16 +00:00
Lumpiasty 5397749a73 Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v214-vulkan-b9174' (#288) from renovate/ghcr.io-mostlygeek-llama-swap-214.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
Reviewed-on: #288
2026-05-18 17:57:20 +00:00
Lumpiasty f3ad488bc8 add MTP version of Qwen3.6-35B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-18 19:42:48 +02:00
Renovate 8f51671c35 Update ghcr.io/mostlygeek/llama-swap Docker tag to v214-vulkan-b9174 2026-05-17 12:46:14 +00:00
Renovate 38ad78e69b Update ghcr.io/mostlygeek/llama-swap Docker tag to v214 2026-05-17 02:01:24 +00:00
Renovate 2edcc0f4aa Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9115' (#277) from renovate/ghcr.io-mostlygeek-llama-swap-211.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline was canceled
2026-05-13 02:05:28 +00:00
Renovate 2af6065421 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9115 2026-05-13 02:05:24 +00:00
Renovate b30d77436f Update caddy Docker tag to v2.11.3 2026-05-13 02:05:22 +00:00
Renovate e1ed09f938 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9102 2026-05-12 21:01:42 +00:00
Renovate 8269533c45 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9093 2026-05-12 02:05:31 +00:00
Renovate cad7dab839 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9085 2026-05-11 02:01:10 +00:00
Renovate 5d18a56d3b Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9070 2026-05-10 02:01:10 +00:00
Renovate eee1cd6a66 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9049 2026-05-09 02:01:14 +00:00
Renovate 1d7fa75b70 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9038 2026-05-08 02:01:09 +00:00
Renovate 028f4b1560 Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9014 2026-05-06 02:01:06 +00:00
Renovate 2d1ac75c7c Update ghcr.io/mostlygeek/llama-swap Docker tag to v211-vulkan-b9010 2026-05-05 02:01:38 +00:00
Lumpiasty 0062ed03d2 Merge pull request 'Update ghcr.io/mostlygeek/llama-swap Docker tag to v211' (#255) from renovate/ghcr.io-mostlygeek-llama-swap-211.x into fresh-start
ci/woodpecker/push/flux-reconcile-source Pipeline failed
Reviewed-on: #255
2026-05-04 17:43:05 +00:00
Lumpiasty 8d0c9f7a0d increase llama ingress max request size
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-05-04 17:53:07 +02:00
Renovate 9fdb7e6f7b Update ghcr.io/mostlygeek/llama-swap Docker tag to v211 2026-05-04 02:01:17 +00:00
Renovate 1d1f782c56 Update ghcr.io/mostlygeek/llama-swap Docker tag to v210-vulkan-b8994 2026-05-03 02:04:18 +00:00
Renovate 3b55957687 Update ghcr.io/mostlygeek/llama-swap Docker tag to v210 2026-05-02 02:01:00 +00:00
Lumpiasty 8cbe2ef794 add abliterated versions of Qwen3.6-35B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-04-30 18:35:32 +02:00
Renovate 284c97acc8 Update ghcr.io/mostlygeek/llama-swap Docker tag to v208-vulkan-b8953 2026-04-30 02:00:56 +00:00
Renovate 4b1f047d9c Update ghcr.io/mostlygeek/llama-swap Docker tag to v208-vulkan-b8943 2026-04-29 02:04:52 +00:00
Renovate b1c98fbb2b Update ghcr.io/mostlygeek/llama-swap Docker tag to v208 2026-04-28 02:00:58 +00:00
Renovate f9367c3ea8 Update ghcr.io/mostlygeek/llama-swap Docker tag to v204-vulkan-b8864 2026-04-23 02:05:32 +00:00
Lumpiasty b0f20de80b qwen3.6 and cleanup of llama-swap config
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
- Deleting unused models
- Cleaned up, unified and fixed qwen3.5 sampling params to thinking and non-thinking params, no futrher differentiation
- kv cache quant q4_0 everywhere
2026-04-23 00:43:37 +02:00
Renovate 0eae56bc4e Update ghcr.io/mostlygeek/llama-swap Docker tag to v204 2026-04-22 02:01:04 +00:00
Lumpiasty 328b14ded7 add cpu version of Qwen3.5-35B-A3B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-18 23:21:52 +02:00
Lumpiasty bc7eb5f0c5 Revert "switch llama from vulkan to rocm"
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
This reverts commit 0e398706ab.
2026-04-18 16:42:02 +02:00
Lumpiasty 0e398706ab switch llama from vulkan to rocm
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-18 15:49:17 +02:00
Lumpiasty 480fb1c6d6 set bonsai model ctx to 64k 2026-04-18 15:48:30 +02:00
Renovate b7a82eb9af Update ghcr.io/mostlygeek/llama-swap Docker tag to v202 2026-04-18 02:01:09 +00:00
Lumpiasty ae7f53c395 add bonsai model
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-04-14 18:31:18 +02:00
Renovate 98c58e857c Update ghcr.io/mostlygeek/llama-swap Docker tag to v201 2026-04-14 16:05:21 +00:00
Renovate c2f2832f6b Update ghcr.io/mostlygeek/llama-swap Docker tag to v199-vulkan-b8720 2026-04-14 16:00:34 +00:00
Renovate 08734503bc Update ghcr.io/mostlygeek/llama-swap Docker tag to v199-vulkan-b8684 2026-04-08 13:28:25 +00:00
Renovate 083b8571bf Update ghcr.io/mostlygeek/llama-swap Docker tag to v199-vulkan-b8672 2026-04-07 15:54:15 +00:00
Renovate 69497a35e3 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8667 2026-04-07 00:00:32 +00:00
Renovate fe0d090ebc chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8660 2026-04-06 00:00:41 +00:00
Renovate 817cdd2ec7 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8643 2026-04-05 00:00:38 +00:00
Lumpiasty a0814e76ee increase pvc for llama to 300 Gi
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-04 22:49:26 +02:00
Lumpiasty 8160a52176 add gemma 4 models
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-04 02:48:02 +02:00
Lumpiasty ad3b2229c2 get rid of openrouter proxying via llama-swap
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-04 02:39:26 +02:00
Renovate e923fc3c30 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8637 2026-04-04 00:00:54 +00:00
Renovate 4e30c9b94d chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8606 2026-04-03 00:00:32 +00:00
Renovate 3d53b4b10b chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8589 2026-04-02 00:00:30 +00:00
Lumpiasty 054df42d8b update qwen3.5 4b ctx size to 128k 2026-03-30 21:05:00 +02:00
Renovate e485a4fc7f chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8576 2026-03-30 00:00:49 +00:00
Lumpiasty 9e74ed6a19 increase --fit-target to 1.5GB 2026-03-29 23:50:45 +02:00