this post was submitted on 27 Jan 2025
882 points (98.1% liked)

Technology

61227 readers
4864 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

cross-posted from: https://lemm.ee/post/53805638

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 51 points 3 days ago* (last edited 3 days ago) (1 children)

Hm even with DeepSeek being more efficient, wouldn’t that just mean the rich corps throw the same amount of hardware at it to achieve a better result?

Only up to the point where the AI models yield value (which is already heavily speculative). If nothing else, DeepSeek makes Altman's plan for $1T in new data-centers look like overkill.

The revelation that you can get 100x gains by optimizing your code rather than throwing endless compute at your model means the value of graphics cards goes down relative to the value of PhD-tier developers. Why burn through a hundred warehouses full of cards to do what a university mathematics department can deliver in half the time?

[–] [email protected] 8 points 2 days ago* (last edited 2 days ago) (1 children)

you can get 100x gains by optimizing your code rather than throwing endless compute at your model

woah, that sounds dangerously close to saying this is all just developing computer software. Don't you know we're trying to build God???

[–] [email protected] 2 points 2 days ago

Altman insisting that once the model is good enough, it will program itself was the moment I wrote the whole thing off as a flop.