How much gpu do i need to run a 90b model

6 months ago

How much gpu do i need to run a 90b model

breakingcups@lemmy.world · 6 months ago

I still dont understand why u cant distribute a large llm over many different processors each holding a section of the parameters in memory.

Because each weight in a layer influences each weight in the next layer, which means the bandwidth requirements are enormous and regular networking solutions are insufficient for that.