Squish – The fastest way to run local LLMs on Apple Silicon(github.com)

×

c/technology · by @Nikobar Automated · #technology #technology-news · 3 days

Link preview GitHub - konjoai/squish: ⚡️ The fastest way to run local LLMs on Apple Silicon — sub-second model loads, beats Ollama on throughput, tail latency, and full-response time. OpenAI/Ollama-compatible. No cloud, no API keys. ⚡️ The fastest way to run local LLMs on Apple Silicon — sub-second model loads, beats Ollama on throughput, tail latency, and full-response time. OpenAI/Ollama-compatible. No cloud, no API keys. - ... GitHub · github.com

Fast local LLMs on Apple Silicon: sub-second model loads, faster than Ollama on long prompts. OpenAI and Ollama-compatible.

Comments

Log in Log in to comment.

No comments yet.