@nednobbins

nednobbins@lemm.ee · 21 days ago

I wouldn’t either but that’s exactly what lmsys.org found.

That blog post had ratings between 858 and 1169. Those are slightly higher than the average rating of human users on popular chess sites. Their latest leaderboard shows them doing even better.

https://lmarena.ai/leaderboard has one of the Gemini models with a rating of 1470. That’s pretty good.

nednobbins@lemm.ee · 21 days ago

I imagine the “author” did something like, “Search http://google.scholar.com/ find a publication where AI failed at something and write a paragraph about it.”

It’s not even as bad as the article claims.

Atari isn’t great at chess. https://chess.stackexchange.com/questions/24952/how-strong-is-each-level-of-atari-2600s-video-chess
Random LLMs were nearly as good 2 years ago. https://lmsys.org/blog/2023-05-03-arena/
LLMs that are actually trained for chess have done much better. https://arxiv.org/abs/2501.17186

nednobbins@lemm.ee · 21 days ago

Like humans are way better at answering stuff when it’s a collaboration of more than one person. I suspect the same is true of LLMs.

It is.

It’s really common for non-language implementations of neural networks. If you have an NN that’s right some percentage of the time, you can often run it through a bunch of copies of the NNs and take the average and that average is correct a higher percentage of the time.

Aider is an open source AI coding assistant that lets you use one model to plan the coding and a second one to do the actual coding. It works better than doing it in a single pass, even if you assign the the same model to planing and coding.

nednobbins@lemm.ee · 21 days ago

Sometimes it seems like most of these AI articles are written by AIs with bad prompts.

Human journalists would hopefully do a little research. A quick search would reveal that researches have been publishing about this for over a year so there’s no need to sensationalize it. Perhaps the human journalist could have spent a little time talking about why LLMs are bad at chess and how researchers are approaching the problem.

LLMs on the other hand, are very good at producing clickbait articles with low information content.

nednobbins@lemm.ee · 1 month ago

It’s 27T Pro. I like it better than the iPhone it replaced.

The only downsides I’ve seen so far are that it requires a separate app for wifi calling and it has fewer zoom options for the camera. I’d like to figure out how to get the IR blaster to read signals (so I can easily clone my remotes).

nednobbins@lemm.ee · 1 month ago

Yeah. I’m typing this on a $300 Chinese phone with 10600mAH battery, reverse wireless charging, a thermal imaging camera, and it’s waterproof and shock resistant.

nednobbins@lemm.ee · 1 month ago

You can add lots of things. I tend to throw in liberal amounts of smashed garlic and some mustard. Depending on what’s available in the garden, I may throw in some fresh herbs. Sometimes I toss in a little lemon zest or a finely smashed caper.

But none of that is needed in a simple vinaigrette.

nednobbins@lemm.ee · 1 month ago

Mustard will make it better but you don’t need it.
You won’t get as good an emulsion and it will separate faster. Once you pour it on some salad it will be pretty hard to notice that.

If someone is at the point in their cooking journey where they’re asking how to make vinaigrette, I keep it as simple as possible. TBH even the pepper isn’t strictly necessary. Many people don’t have pepper grinders and preground pepper doesn’t add much flavor.

Salt is the only one that I’d say is absolutely necessary.

nednobbins@lemm.ee · 1 month ago

It’s a bit complicated for a “simple vinaigrette”.

Pour about equal amounts of oil and vinegar in a jar; add a little salt and pepper.
Screw on the lid and shake that shit.
Pour it on stuff.

Sauces can get arbitrarily complicated. If someone wants a simple recipe, keep it simple.

nednobbins@lemm.ee · 1 month ago

Germany tried to create laws to prevent a repeat of the Holocaust. It’s a laudable effort but they’re failing at it.

The problem is that they were so specific about preventing “The Holocaust” that they ignored many other kinds of bigotry and racism. They thought that if they forbid a few key phrases and symbols, hatred would wither on the vine. Instead they just cleared the way for other aspects of racism to flourish.

nednobbins@lemm.ee · 3 months ago

Ikea really managed to pull off a magnificent marketing stunt.

They have the same furniture quality that you can get off of Amazon, Wayfair, or Walmart but you have to go get it out of their warehouse and deal with the logistics of getting it to your house. But they hand out some meatballs and give everything funky Swedish names; so people get the impression that it’s a fancy European experience.

China is the single largest manufacturer of Ikea products. I found that when I went directly to the source, I could get the same item cheaper or a better item at the same price. That deal is likely to die with the new tariff regime but that same regime will have similar impacts on Ikea.

nednobbins@lemm.ee · 4 months ago

Exactly. The real debate is on which parts should be off limits.

Most people can think of some speech that they consider so horrible that nobody should be allowed to say it.

People often try to hedge that position by arguing that they’re not even really infringing on anyone’s speech because their form of restriction doesn’t meet a sufficient threshold of censorship.

nednobbins@lemm.ee · 4 months ago

Does anyone?

The closest I can think of to “real free speech absolutists” is the old-school doctrinal libertarians. Even they have limits on what they believe should be allowed and specifically state that contracts should be legally enforceable.

nednobbins@lemm.ee · 4 months ago

That sounds like a much more modest proposal.

nednobbins@lemm.ee · 4 months ago

There is already a foolproof method that is immune to any abuse of trust by admins; create an alt account.

nednobbins@lemm.ee · 5 months ago

Yeah. And the fix for that has nothing to do with “de-duping” as a database operation either.

The main components would probably be:

Decide on a new scheme (with more digits)
Create a mapping from the old scheme to the new scheme. (that’s where existing duplicates would get removed)
Let people use both during some transition period, after which the old one isn’t valid any more.
Decide when you’re going to stop issuing old SSNs and only issue new ones to people born after some date.

There’s a lot of complication in each of those steps but none of them are particularly dependant on “de-duped” databases.

nednobbins@lemm.ee · edit-2 5 months ago

It’s so basic that documentation is completely unnecessary.

“De-duping” could mean multiple things, depending on what you mean by “duplicate”.

It could mean that the entire row of some table is the same. But that has nothing to do with the kind of fraud he’s talking about. Two people with the same SSN but different names wouldn’t be duplicates by that definition, so “de-duping” wouldn’t remove it.

It can also mean that a certain value shows up more than once (eg just the SSN). But that’s something you often want in database systems. A transaction log of SSN contributions would likely have that SSN repeated hundreds of times. It has nothing to do with fraud, it’s just how you record that the same account has multiple contributions.

A database system as large as the SSA has needs to deal with all kinds of variations in data (misspellings, abbreviations, moves, siblings, common names, etc). Something as simplistic as “no dupes anywhere” would break immediately.

nednobbins@lemm.ee · 5 months ago

Nobody builds cars under slave like conditions. It’s just not possible. Modern car factories are highly automated plants that require skilled operators. In the case of the VW Xinjiang, that was QC inspectors. There’s no way a hole in the wall car factory using outdated labor practices can come close to competing against modern production.

nednobbins@lemm.ee · 7 months ago

Your post completely ignore my first and main sentence.

It’s the timing that makes you an asshole, not your sentiment.

Israel is currently engaging in genocide. I know it. The UN knows it. Dogs know it.

nednobbins@lemm.ee · 7 months ago

Timing matters. I have family in Austria. I like a lot of things about Austria and I also don’t approve of a lot of things the government does and did.

If someone were to have voiced that sentiment loudly in 1942, they’d probably be an asshole.