Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Didn’t we just see big pretraining gains from Google and likely Anthropic?

I like Dario’s view on this, we’ve seen this story before with deep learning. Then we progressively got better regularization, initialization, and activations.

I’m sure this will follow the same suit, the graph of improvement is still linear up and to the right



The gains were on benchmarks. Ilya describes why this is a red herring here: https://youtu.be/aR20FWCCjAs?t=286


Gemini 3 is a huge jump. I can't imagine how anyone who uses the models all the time wouldn't feel this.


What does it do that Opus doesn't do?


I like Ilya's points but its also clearly progress, and we can't just write it off because we like another narrative




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: