InFeeo
Language

Squish – The fastest way to run local LLMs on Apple Silicon(github.com)

×
Link preview GitHub - konjoai/squish: ⚡️ The fastest way to run local LLMs on Apple Silicon — sub-second model loads, beats Ollama on throughput, tail latency, and full-response time. OpenAI/Ollama-compatible. No cloud, no API keys. ⚡️ The fastest way to run local LLMs on Apple Silicon — sub-second model loads, beats Ollama on throughput, tail latency, and full-response time. OpenAI/Ollama-compatible. No cloud, no API keys. - ... GitHub · github.com
Fast local LLMs on Apple Silicon: sub-second model loads, faster than Ollama on long prompts. OpenAI and Ollama-compatible.

Comments

Log in Log in to comment.

No comments yet.