What are your LocalLLaMA "hot takes"?

mudkip@lemdro.id · 14 days ago

panda_abyss@lemmy.ca · edit-2 14 days ago

Thinking is an awful paradigm

Models would do better to revert and visit other token branches, but top p/k blocks that. Thinking tokens are a waste.

One of the reasons thinking majes models good is just reinforcement learning, but it tends to be very narrow.

Like math you can reinforcement learn until grad level. That’s fine. But it doesn’t actually improve problem solving.