this post was submitted on 20 Aug 2024
717 points (98.8% liked)

Programmer Humor

32476 readers
688 users here now

Post funny things about programming here! (Or just rant about your favourite programming language.)

Rules:

founded 5 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 2 months ago

Yep.

I'd still call that memory. It's not the present; arguably for a (post-training) LLM the present totally consists of choosing probabilities for the next token, and there is no notion of future. That's really just a choice of interpretation, though.

During training they definitely can learn and remember things (or at least "learn" and "remember"). Sometimes despite our best efforts, because we don't really want them to know a real, non-celebrity person's information. Training ends before the consumer uses the thing though, and it's kind of like we're running a coma patient after that.