Do i need industry grade gpu’s or can i scrape by getring decent tps with a consumer level gpu.

    • breakingcups@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      4 days ago

      I still dont understand why u cant distribute a large llm over many different processors each holding a section of the parameters in memory.

      Because each weight in a layer influences each weight in the next layer, which means the bandwidth requirements are enormous and regular networking solutions are insufficient for that.

    • tpWinthropeIII@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      6 days ago

      Not exactly. Digits still uses a Blackwell GPU, only it uses unified RAM as virtual VRAM instead of actual VRAM. The GPU is probably a down clocked Blackwell. Speculation I’ve seen is that these are defective and repurposed Blackwells; good for us. By defective I mean they can’t run at full speed or are projected to have the cracking die problem, etc.