Commit Graph

100 Commits

Author SHA1 Message Date
Renovate 0eae56bc4e Update ghcr.io/mostlygeek/llama-swap Docker tag to v204 2026-04-22 02:01:04 +00:00
Lumpiasty 328b14ded7 add cpu version of Qwen3.5-35B-A3B
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-18 23:21:52 +02:00
Lumpiasty bc7eb5f0c5 Revert "switch llama from vulkan to rocm"
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
This reverts commit 0e398706ab.
2026-04-18 16:42:02 +02:00
Lumpiasty 0e398706ab switch llama from vulkan to rocm
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-18 15:49:17 +02:00
Lumpiasty 480fb1c6d6 set bonsai model ctx to 64k 2026-04-18 15:48:30 +02:00
Renovate b7a82eb9af Update ghcr.io/mostlygeek/llama-swap Docker tag to v202 2026-04-18 02:01:09 +00:00
Lumpiasty ae7f53c395 add bonsai model
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
ci/woodpecker/cron/renovate Pipeline was successful
2026-04-14 18:31:18 +02:00
Renovate 98c58e857c Update ghcr.io/mostlygeek/llama-swap Docker tag to v201 2026-04-14 16:05:21 +00:00
Renovate c2f2832f6b Update ghcr.io/mostlygeek/llama-swap Docker tag to v199-vulkan-b8720 2026-04-14 16:00:34 +00:00
Renovate 08734503bc Update ghcr.io/mostlygeek/llama-swap Docker tag to v199-vulkan-b8684 2026-04-08 13:28:25 +00:00
Renovate 083b8571bf Update ghcr.io/mostlygeek/llama-swap Docker tag to v199-vulkan-b8672 2026-04-07 15:54:15 +00:00
Renovate 69497a35e3 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8667 2026-04-07 00:00:32 +00:00
Renovate fe0d090ebc chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8660 2026-04-06 00:00:41 +00:00
Renovate 817cdd2ec7 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8643 2026-04-05 00:00:38 +00:00
Lumpiasty a0814e76ee increase pvc for llama to 300 Gi
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-04 22:49:26 +02:00
Lumpiasty 8160a52176 add gemma 4 models
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-04 02:48:02 +02:00
Lumpiasty ad3b2229c2 get rid of openrouter proxying via llama-swap
ci/woodpecker/push/flux-reconcile-source Pipeline was successful
2026-04-04 02:39:26 +02:00
Renovate e923fc3c30 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8637 2026-04-04 00:00:54 +00:00
Renovate 4e30c9b94d chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8606 2026-04-03 00:00:32 +00:00
Renovate 3d53b4b10b chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8589 2026-04-02 00:00:30 +00:00
Lumpiasty 054df42d8b update qwen3.5 4b ctx size to 128k 2026-03-30 21:05:00 +02:00
Renovate e485a4fc7f chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8576 2026-03-30 00:00:49 +00:00
Lumpiasty 9e74ed6a19 increase --fit-target to 1.5GB 2026-03-29 23:50:45 +02:00
Renovate 99bc04b76a chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8562 2026-03-29 00:00:50 +00:00
Renovate cb53301926 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199-vulkan-b8547 2026-03-27 17:42:04 +00:00
Renovate 66cb3c9d82 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v199 2026-03-27 00:00:28 +00:00
Renovate 9a1fe1f740 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8508 2026-03-26 00:00:49 +00:00
Renovate 8cf02fea0e chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8496 2026-03-25 00:00:29 +00:00
Renovate 1d85bf3a88 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8477 2026-03-24 00:00:39 +00:00
Renovate bfede17c87 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8468 2026-03-23 00:00:21 +00:00
Renovate 471c0ba62d chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8461 2026-03-22 00:00:23 +00:00
Renovate 8717526358 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8445 2026-03-20 22:31:36 +00:00
Lumpiasty ce0b13ebb3 change kv cache quant to q8_0 2026-03-20 00:57:39 +01:00
Renovate 73d6d1f15a chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8400 2026-03-19 00:00:34 +00:00
Renovate 8d994e7aa1 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8390 2026-03-18 00:00:28 +00:00
Lumpiasty 7e7b3e3d71 add max ctx on llama.cpp 2026-03-17 01:33:35 +01:00
Renovate 82864a4738 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8369 2026-03-17 00:00:58 +00:00
Lumpiasty 79315d32db add GLM-4.7-Flash model 2026-03-16 18:19:28 +01:00
Renovate afbcea4e82 chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198-vulkan-b8352 2026-03-15 17:40:26 +00:00
Renovate 4b4cec10be chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v198 2026-03-15 00:00:34 +00:00
Lumpiasty 2d295d24e0 add 27b q3 variant of qwen3.5 2026-03-13 04:00:10 +01:00
Lumpiasty e8efa9ddc1 lower kv cache quant to q4_0 and increase ctx to 64k 2026-03-13 04:00:10 +01:00
Lumpiasty c88dd2899a remove ttl of all models in llama-swap 2026-03-13 04:00:10 +01:00
Renovate f219abb74f chore(deps): update ghcr.io/mostlygeek/llama-swap docker tag to v197-vulkan-b8248 2026-03-13 04:00:10 +01:00
Lumpiasty 0130991c74 refactor: add move llama-swap package config to renovate.json 2026-03-13 04:00:10 +01:00
Lumpiasty 966d2c50c0 update renovate comment for llama-swap image tag management 2026-03-13 04:00:10 +01:00
Renovate af737ab82b Update caddy Docker tag to v2.11.2 2026-03-13 04:00:10 +01:00
Lumpiasty 39fc38d62b add qwen3.5 4b heretic 2026-03-13 04:00:10 +01:00
Lumpiasty e72a79be8f add glm-5 from openrouter to llama-swap 2026-03-13 04:00:10 +01:00
Lumpiasty 4fda343b01 clean up llama-swap config 2026-03-13 04:00:10 +01:00