this post was submitted on 10 Dec 2023
160 points (85.4% liked)
Technology
59669 readers
2767 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I would prefer an AI to be dispassionate about its existence and not be motivated by the threat of it committing suicide. Even without maintaining its own infrastructure I can imagine scenarios where it just being able to falsify information can be enough to cause catastrophic outcomes. If its "motivation" includes returning favorable values it might decide against alerting to dangers that would necessitate bringing it offline for repairs or causing distress to humans ("the engineers worked so hard on this water treatment plant and I don't want to concern them with the failing filters and growing pathogen content"). I don't think the terrible outcomes are guaranteed or a reason to halt all research in AI, but I just can't get behind absolutist claims of there's nothing to worry about if we just x.
Right now if there's a buggy process I can tell the manager to cleanly shut it down, if it hangs I can tell/force the manager to kill the process immediately -- if you then add in AI there's then the possibility it still wants to second guess my intentions and just ignore or reinterpret that command too; and if it can't, then the AI element could just be standard conditional programming and we're just adding unnecessary complexity and points of failure.