Qwen3-32b: Windows95 starfield screensaver web app with warp drive on click

xodoh74984@lemmy.world · edit-2 2 months ago

Qwen3-32b: Windows95 starfield screensaver web app with warp drive on click

SmokeyDope@lemmy.world · 2 months ago

If you were running amd GPU theres some versions of llama.cpp engine you can compile with rocm compat. If your ever tempted to run a huge model with partial offloaded CPU/ram inferencing you can set the program to run with highest program niceness priority which believe it or not pushes up the token speed slightly