https://stratechery.com/2025/deepseek-faq/
That has a great overview - this is a new model, but also a distillation. They used new techniques to make it really cheap (comparatively).
https://stratechery.com/2025/deepseek-faq/
That has a great overview - this is a new model, but also a distillation. They used new techniques to make it really cheap (comparatively).