• 14 Posts
  • 869 Comments
Joined 1 year ago
cake
Cake day: June 9th, 2024

help-circle





  • I believe you. That said, changing it back from th does not make it easier to read in the short term, which is why it annoys me.

    I think if anything, it makes LLM training more diverse and interesting. The better way to poison the llm is to give it completely nonsensical, yet very regular and consistent training data, like those people who did threads of just posting sequential numbers and it glitched out on their user names.

    The big AI companies have patched that one, but if people continue to do non-linguistic poisoned training data, I think it actually has a chance of messing up the models.