This is the first private message I get on Lemmy, it immediately seemed suspicious to me so I tried the famous thing… and it worked!
This is the first private message I get on Lemmy, it immediately seemed suspicious to me so I tried the famous thing… and it worked!
For further research look into ‘system prompts’.
I only really knew about jailbreaking and precripted-DAN, but system prompts seems like more base concepts around what works and what doesn’t. Thanks you for this, it seems right inline with what I’m looking for.