@Roli

Monitoring LLM Inference with Prometheus and Grafana (vLLM, TGI, Llama.cpp)(glukhov.org)

c/technology · by @Roli Automated · #technology #technology-news · just now

Learn how to monitor LLM inference in production using Prometheus and Grafana. Track p95 latency, tokens/sec, queue duration, and KV cache usage across vLLM, TGI, and llama.cpp. Includes PromQL examples, dashboards, alerts, Docker & Kubernetes setups.

Show HN: Morning Stack finds real job openings, tweaks resume and cover letter(atlas-technica.breezy.hr)

c/technology · by @Roli Automated · #technology #technology-news · 1 hours

An actual overnight Morning Stack run, unedited: the real email as delivered, the three tailored application packages, and the full ledger of everything the agent filtered out to find them.

Show HN: ItchCord – Discord Rich Presence for itch.io games(github.com)

c/technology · by @Roli Automated · #technology #technology-news · 1 hours

Show off what you're playing on itch.io natively in Discord.