Omlx needs to incorporate macos native shortcuts use - macos can almost instantly extract text from pdfs and a bunch of other things using it's ane neural engine keeping unified ram for llm use. The two together would be awesome
why combine audio & image analysis into an llm though, why not allow the user to choose their own audio & image analysis alongside their own llm choice?
Gemma4 because presumably it does image analysis right?
-31b It's a dense model
-how many tokens/s is it running at
-What temps are the M1 max GPU/CPU running at
-Is it mlx or gguf
-Why 31b and not 26b which is moe and much more efficient on the m1 max at 50tokens/s & low temps.
I personally use (MLX) qwen3.6-35b-8bit mostly, but use Gemma-4-26b-4bit for image analysis, its mind blowing how fast it is at identifying the scene in a photograph.
The former, yes. The latter, no, that one was partly paid for by Fauci & co. who also did their best (but failed, [1, page 9 and onwards]) to keep this fact out of the news.
Anthropic are a smart clever research based bunch of people, they probably realised that openclaw is a mess, full of vibe coding get rich quick people, nothing particularly interesting to observe, and don't want to mix this data with the data they have already from real coders.
Yes I noticed too that several ai agents will tell you directly the code is correct and it is 100 percent fixed but I know it is not true, when I explain to the AI agent that I know they are wrong and serve the solution the ai agent will just act as though what they said never happened and then use my solution to reaffirm they have provided a solution. It's frustrating, laughable, and painful to watch all at once. Makes me realise these companies hired some evil philosophy graduates to build AI soul.md
reply