ChatGPT

8902 readers

1 users here now

Unofficial ChatGPT community to discuss anything ChatGPT

founded 1 year ago

MODERATORS

[email protected]

132

ChatGPT would have been so much useful and trustworthy if it is able to accept that it doesn't know an answer. (programming.dev)

submitted 4 months ago* (last edited 4 months ago) by [email protected] to c/[email protected]

46 comments fedilink hide all child comments

Small rant : Basically, the title. Instead of answering every question, if it instead said it doesn't know the answer, it would have been trustworthy.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 1 points 4 months ago

It's right in the research I was mentioning:

https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html

Find the section on the model's representation of self and then the ranked feature activations.

I misremembered the top feature slightly, which was: responding "I'm fine" or gives a positive but insincere response when asked how they are doing.