<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>devtake.dev — #local-inference</title><description>Articles tagged local-inference on devtake.dev.</description><link>https://devtake.dev/</link><language>en-us</language><item><title>Running a coding agent fully on Apple Silicon, no cloud, is now an off-the-shelf stack</title><link>https://devtake.dev/article/local-coding-agents-mac/</link><guid isPermaLink="true">https://devtake.dev/article/local-coding-agents-mac/</guid><description>A popular Hacker News how-to walked through a fully local coding agent on Apple Silicon. Here&apos;s the realistic 2026 stack: runner, model, and harness.</description><pubDate>Sat, 13 Jun 2026 12:30:00 GMT</pubDate><category>ai</category><category>ai</category><category>llm</category><category>local-inference</category><category>ai-agents</category><category>agentic-coding</category><category>open-weights</category><category>mac</category><category>moe</category><author>dieter-morelli</author></item><item><title>A crafted Ollama model file leaks the whole server&apos;s memory. 300,000 instances are exposed.</title><link>https://devtake.dev/article/ollama-bleeding-llama-cve-2026-7482/</link><guid isPermaLink="true">https://devtake.dev/article/ollama-bleeding-llama-cve-2026-7482/</guid><description>Cyera disclosed CVE-2026-7482 on May 1, a CVSS 9.1 unauthenticated heap read in Ollama. Three API calls dump prompts, env vars, and API keys from any open instance.</description><pubDate>Mon, 11 May 2026 10:00:00 GMT</pubDate><category>security</category><category>security</category><category>ollama</category><category>llm</category><category>cve-2026-7482</category><category>local-inference</category><category>memory</category><category>cyera</category><category>ai-security</category><author>luca-reinhardt</author></item><item><title>AMD&apos;s &apos;Gorgon Halo&apos; refresh leaks with 192GB memory. Strix Halo tops out at 128GB.</title><link>https://devtake.dev/article/amd-gorgon-halo-ryzen-ai-max-495-leak/</link><guid isPermaLink="true">https://devtake.dev/article/amd-gorgon-halo-ryzen-ai-max-495-leak/</guid><description>A leaked Geekbench listing puts AMD&apos;s Ryzen AI Max+ 495 on a 192GB platform with a Radeon 8065S iGPU. The Strix Halo chip it replaces capped at 128GB.</description><pubDate>Mon, 04 May 2026 09:30:00 GMT</pubDate><category>hardware</category><category>amd</category><category>ryzen-ai-max</category><category>gorgon-halo</category><category>strix-halo</category><category>local-inference</category><category>laptop</category><category>lpcamm2</category><category>hardware</category><author>hiro-tanaka</author></item><item><title>Apple killed the $599 Mac mini. The cheapest one is now $799 with 512GB.</title><link>https://devtake.dev/article/apple-mac-mini-base-discontinued-799/</link><guid isPermaLink="true">https://devtake.dev/article/apple-mac-mini-base-discontinued-799/</guid><description>Apple quietly pulled the 256GB Mac mini from its store on May 1. Tim Cook had warned the day before that demand was outpacing supply for months.</description><pubDate>Sat, 02 May 2026 09:15:00 GMT</pubDate><category>apple</category><category>apple</category><category>mac-mini</category><category>m4</category><category>pricing</category><category>hardware</category><category>mac</category><category>local-inference</category><author>naomi-park</author></item><item><title>Qwen 3.6-35B-A3B: the open MoE beating Opus 4.7 on Simon Willison&apos;s laptop</title><link>https://devtake.dev/article/qwen-3-6-35b-a3b-beats-opus-on-laptop/</link><guid isPermaLink="true">https://devtake.dev/article/qwen-3-6-35b-a3b-beats-opus-on-laptop/</guid><description>Alibaba&apos;s Qwen 3.6-35B-A3B is a 35B-param mixture-of-experts with only 3B active. Apache 2.0, runs on consumer GPUs, and it&apos;s already winning real tasks.</description><pubDate>Fri, 17 Apr 2026 10:00:00 GMT</pubDate><category>ai</category><category>qwen</category><category>alibaba</category><category>open-source</category><category>moe</category><category>llm</category><category>local-inference</category><category>open-weights</category><author>dieter-morelli</author></item></channel></rss>