• 0 Posts
  • 9 Comments
Joined 2 years ago
cake
Cake day: July 9th, 2023

help-circle
  • Among the tested models, GPT-4 Turbo ranked highest with 46% accuracy, while Llama-3.1-8B scored the lowest at 33.6%.

    “The main takeaway from this study is that LLMs, while impressive, still lack the depth of understanding required for advanced history,” said del Rio-Chanona. “They’re great for basic facts, but when it comes to more nuanced, PhD-level historical inquiry, they’re not yet up to the task.”

    I’m sorry, you fucking what? How about you test the world’s population in PhD level history and see if you get a 46%? Are you fucking kidding me? You’re telling me this machine is half accurate on PhD history and you’re tryna act like that doesn’t just make your entire history department fucking useless? At most, you have 5 years until it’s better at the job than actual humans trained for it, because it’s already better than the public at large.


  • “Why did you pull me over?”

    “Sir, we’re here because your house was robbed.”

    Fake af. When your house is robbed, you can go fuck yourself - your shit is gone unless you’ve got GPS trackers in it. Here’s a more likely scenario:

    “Why did you pull me over?”

    “4 years ago you filed a report that your house was robbed. This is now becoming a problem, as people have noticed we do nothing for society, and your report is adding to that statistic. Would you like to close it, or shall we go ahead and process that broken taillight?”

    “what broken taillight?”

    “down on the ground! I said down!” <sounds of gunshots hitting car, sounds of body hitting steering wheel, sounds of prolonged honking, sounds of thin blue line erections, sounds of coke being sprinkled, sounds of policeman breaking taillight>