HP Z2 Mini G1a Review: Running GPT-OSS 120B Without a Discrete GPU

mapumbaa@lemmy.zip · 6 days ago

HP Z2 Mini G1a Review: Running GPT-OSS 120B Without a Discrete GPU

HelloRoot@lemy.lol · edit-2 5 days ago

Well, thats what I said “AI optimized”.

Even my 5 year old 900$ rig can output like 4 tps.

mapumbaa@lemmy.zip · 4 days ago

There is nothing “optimized” that will get you better inference performance of medium/large models at $2000.

d_arm64@lemmy.world · 5 days ago

With what model? GPT oss or something else?

HelloRoot@lemy.lol · edit-2 4 days ago

LLama 3 8B Instruct: 25tps

DeepSeek R1 distill qwen 14b: 3.2tps

To be fair: Motherboard, cpu and ram I bought 6 years ago with an nvidia 1660. Then I bought the Radeon RX 6600 XT on release in 2021, so 4 years ago. But it’s a generic gaming rig.

I would be surprised if 2000$ worth of modern hardware, picked for this specific task would be worse than that mini PC.

mapumbaa@lemmy.zip · edit-2 4 days ago

I promise. It’s not possible. But things change quickly of course.

(Unless you’re lucky/pro and get your hands on some super cheap used high end hardware…)

d_arm64@lemmy.world · 4 days ago

To be honest that is pretty good. Thanks!