• 0 Posts
  • 120 Comments
Joined 2 years ago
cake
Cake day: June 25th, 2023

help-circle

  • And then fuck it up by pointing Linux at your windows EFI partition, end up with neither system bootable and make things worse as you panic and try to rush a fix without understanding what you’re doing.

    If you’re new to how it all works and having a working machine is important, best to keep it simple and as separated as you can.

    I’m also not convinced that “Windows doesn’t know about the other partitions”, that sounds like the kind of thing that’s true until it isn’t and it overwrites your Linux bootloader.


  • This is certainly an odd suggestion, and not what you’re really asking for, but makes me think of Space Station 13. It’s a janky round-based multiplayer roleplay/social intrigue game. It’s free, and the game is opensource (though not the engine), which also leads to there being many servers with unique variations. It’s cheating to suggest a multiplayer game when talking about single player natural language processing games, but using actual players is probably the easiest way to pull it off.

    The reason it reminds me is because on a roleplay server, you’ve got something like 20 people, each with their job to do, talking to each other, talking on common radio, etc. - and if you’re lucky, a player playing as the station AI, complete with a (modifiable) lawset they have to follow, Asimov’s laws style. And of course, a few antagonists that have objectives to do.

    If you’re curious, I personally recommend BeeStation, though there are a lot of fine choices for the server, just maybe stay away from the 18+ ones.



  • I would argue that memorization is important, but what you memorize and how you arrive at that is very personal. Forcing kids to memorize very specific things, and trying to enforce memorization (as opposed to the ability to arrive at the solution) seems like a bad idea to me.

    I still don’t have the 10x10 multiplication table memorized, and I took physics in high school and work as a programmer. I have a use for knowing number multiples, and have domain-specific numbers memorized (2^8=8*8=256, 256*256=65536), but what I don’t remember off the top of my head I can figure out from the things I do know, from certain tricks, and from brute force mental math juggling numbers.

    And the important thing to me is, I learned what I know not because somebody told me this is how I should do things, but because I picked them up as needed, a mix of memorizing common multiplications and figuring out tricks (like multiples of 9*N for N<11 being the digits N-1 and 10-N)





  • That actually sounds like a fun SCP - a word that doesn’t seem to contain a letter, but when testing for the presence of that letter using an algorithm that exclusively checks for that presence, it reports the letter is indeed present. Any attempt to check where in the word the letter is, or to get a list of all letters in that word, spuriously fail. Containment could be fun, probably involving amnestics and widespread societal influence, I also wonder if they could create an algorithm for checking letter presence that can be performed by hand without leaking any other information to the person performing it, reproducing the anomaly without computers.




  • If the password is securely hashed, and the attack only includes data exfiltration, then there’s theoretically no risk of breaking into users’ accounts anyways. However, the issue is that if somebody can log into your Plex account, that means they got your password somehow - and if they did get that password, they can use it elsewhere. So if there’s any reason to change your password on Plex, then there’s just as much reason to change that same password elsewhere.





  • I’m confused. It’s based on arch but not really? Is it arch based or not? Does it use any arch package manager? The post raises a number of new questions

    The answers to that seem pretty obvious to me: yes, it is based on arch. No, it does not come with a package manager. Presumably, they use Arch packaging tools and package definitions behind the scenes in some way, but the end result is a premade immutable system image.


  • I really hope not, that feels like crypto all over again, with inconsistent payouts and varying electricity prices… And on top of that probably awful service since people tend to have the weirdest internet connections.

    Though if you remove the part where it’s used to stream games to other players, that sounds too niche to be viable, but could be cool. If going in that direction, I’d imagine it more likely to be gaming servers for businesses, like VR gaming spots, where they have multiple gaming computers hooked up to headsets.



  • Apertus was developed with due consideration to Swiss data protection laws, Swiss copyright laws, and the transparency obligations under the EU AI Act. Particular attention has been paid to data integrity and ethical standards: the training corpus builds only on data which is publicly available. It is filtered to respect machine-readable opt-out requests from websites, even retroactively, and to remove personal data, and other undesired content before training begins.

    We probably won’t get better, but sounds like it’s still being trained on scraped data unless you explicitly opt out, including anything that may be getting mirrored by third parties that don’t opt out. Also, they can remove data from the training material retroactively… But presumably won’t be retraining the model from scratch, which means it will still have that in their weights, and the official weights will still have a potential advantage on models trained later on their training data.

    From the license:

    SNAI will regularly provide a file with hash values for download which you can apply as an output filter to your use of our Apertus LLM. The file reflects data protection deletion requests which have been addressed to SNAI as the developer of the Apertus LLM. It allows you to remove Personal Data contained in the model output.

    Oof, so they’re basically passing on data protection deletion requests to the users and telling them all to respectfully account for them.

    They also claim “open data”, but I’m having trouble finding the actual training data, only the “Training data reconstruction scripts”…