HP Z2 Mini G1a Review: Running GPT-OSS 120B Without a Discrete GPU

mapumbaa@lemmy.zip · 6 days ago

HP Z2 Mini G1a Review: Running GPT-OSS 120B Without a Discrete GPU

rkd@sh.itjust.works · edit-2 4 days ago

For some weird reason, in my country it’s easier to order a Beelink or a Framework than an HP. They will sell everything else, except what you want to buy.

Domi@lemmy.secnd.me · 5 days ago

I ordered a Beelink GTR9 Pro which should hopefully arrive next month.

Really excited to play around with it, the 24GB in my 7900 XTX just don’t cut it for local LLMs.

There are a lot of benchmarks for the 395 processor here: https://kyuz0.github.io/amd-strix-halo-toolboxes/

They are leaving a lot of performance (and VRAM) on the table by doing this on Windows.

HelloRoot@lemy.lol · edit-2 6 days ago

Seems pretty decent, but I wonder how it compares to an AI optimized desktop build with the same budget of 2000$.

mapumbaa@lemmy.zip · edit-2 5 days ago

It will probably kick the ass of that desktop. $2000 won’t get you far with a conventional build.

HelloRoot@lemy.lol · edit-2 5 days ago

Well, thats what I said “AI optimized”.

Even my 5 year old 900$ rig can output like 4 tps.

mapumbaa@lemmy.zip · 4 days ago

There is nothing “optimized” that will get you better inference performance of medium/large models at $2000.

d_arm64@lemmy.world · 5 days ago

With what model? GPT oss or something else?

HelloRoot@lemy.lol · edit-2 4 days ago

LLama 3 8B Instruct: 25tps

DeepSeek R1 distill qwen 14b: 3.2tps

To be fair: Motherboard, cpu and ram I bought 6 years ago with an nvidia 1660. Then I bought the Radeon RX 6600 XT on release in 2021, so 4 years ago. But it’s a generic gaming rig.

I would be surprised if 2000$ worth of modern hardware, picked for this specific task would be worse than that mini PC.

mapumbaa@lemmy.zip · edit-2 4 days ago

I promise. It’s not possible. But things change quickly of course.

(Unless you’re lucky/pro and get your hands on some super cheap used high end hardware…)

d_arm64@lemmy.world · 4 days ago

To be honest that is pretty good. Thanks!