this post was submitted on 23 Sep 2024
175 points (94.9% liked)
Technology
60642 readers
3881 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I don't understand why it's so hard to sandbox an LLM's configuration data from it's training data.
Because its all one thing. The promise of AI is that you can basically throw anything at it, and you don't need to understand exactly how/why it makes the connections it does; you just adjust the weights until it kinda looks alright.
There are many structural hacks used to give it better results (and in this case some form of reasoning) but ultimately they're mostly relying on connecting multiple nets together and retrying queries and such. There's no human understandable settings. Neural networks are basically one input and one output (unless you're training it).
What do you mean by "configuration data?"
The data used to configure it.
Do you mean finetune data?
A model's configuration data is training data.