LLM full throttle on M3 Ultra 84 Watts !

What a crazy efficiency monster.
Running a LLM on all 80 GPU cores and the system is only drawing 84 Watts…..

Nvidia must be crying at night.

and this is only M3, M4 is even more efficient, but yet not available as an ultra fusion variant, and end of year Apple will manufacture in 2nm



Upgraded my old M1 mini to Mac Studio M3 Ultra 256GB

Finally the latest Mac Studio was released, unfortunately only with M3 chip instead of a M4, but I simply waited too long already.
Let’s see how this one compares to the previous M4 Pro I bought for my son in regards to LLM’s.

This one will be used with Podman Desktop, LM Studio and hopefully be fast enough to also handle voice recognition and rendering in realtime.

256GB VRAM will be more than enough for my use cases, 96GB was simply not enough as I already saw that on the 64GB M4.