There’s some truth in that, but isn’t making a radically cheaper version also a ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		actuallyalys 10 months ago \| parent \| context \| favorite \| on: OpenAI says it has evidence DeepSeek used its mode... There’s some truth in that, but isn’t making a radically cheaper version also a new idea that deepseek didn’t know whether it would work? I mean, there was already research into distillation, but there was already research into some of (most of?) OpenAI’s ideas.

janalsncm 10 months ago [–]

Yes, for people who look into the research Deepseek released, there are a good number of novelties which enabled much cheaper R&D. For example, improvements to Mixture of Experts modules and Multi-head Latent Attention. If you have infinite money, you don’t need to innovate there, but DeepSeek didn’t.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact