Ben Thompson / Stratechery:
An in-depth look at DeepSeek: DeepSeekMoE and DeepSeekMLA, cheap V3 training, the US chip ban, “distillation” from other models, Nvidia impact, AGI, and more — It’s Monday, January 27. Why haven’t you written about DeepSeek yet? — I did! I wrote about R1 last Tuesday.
No comment yet, add your voice below!