<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>devtake.dev — #on-device-ai</title><description>Articles tagged on-device-ai on devtake.dev.</description><link>https://devtake.dev/</link><language>en-us</language><item><title>Cactus Compute distilled Gemini into a 26M tool-calling model. The trick: no feed-forward layers.</title><link>https://devtake.dev/article/needle-cactus-compute-tool-calling/</link><guid isPermaLink="true">https://devtake.dev/article/needle-cactus-compute-tool-calling/</guid><description>Needle is a 26M-parameter function caller distilled from Gemini 3.1 Flash-Lite. The Simple Attention Network drops MLPs and runs at 6,000 tok/s prefill on edge silicon.</description><pubDate>Wed, 13 May 2026 10:00:00 GMT</pubDate><category>ai</category><category>ai-models</category><category>gemini</category><category>open-weights</category><category>function-calling</category><category>tool-calling</category><category>edge-ai</category><category>on-device-ai</category><category>small-models</category><author>dieter-morelli</author></item><item><title>Anker built its own AI chip. It runs neural nets inside flash memory cells.</title><link>https://devtake.dev/article/anker-thus-on-device-ai-chip/</link><guid isPermaLink="true">https://devtake.dev/article/anker-thus-on-device-ai-chip/</guid><description>Anker&apos;s Thus chip embeds compute inside NOR flash, claims 150x more on-device AI for noise cancellation, and ships in Soundcore earbuds on May 21. Here&apos;s why it matters.</description><pubDate>Mon, 27 Apr 2026 08:00:00 GMT</pubDate><category>hardware</category><category>anker</category><category>ai-chips</category><category>custom-silicon</category><category>on-device-ai</category><category>compute-in-memory</category><category>soundcore</category><category>semiconductor</category><author>hiro-tanaka</author></item></channel></rss>