31/01/2025
Wes Roth gives an overview of Berkleys reproduction of DeepSeek R1 core technologies, in the CountDown game
https://www.youtube.com/watch?v=E_h8xt0X1Kg
Links to original sources:
GitHub:
https://github.com/Jiayi-Pan/TinyZero
(the author):
https://x.com/jiayi_pirate/status/1882839370505621655
Berkeley Researchers Replicate DeepSeek R1's Core Tech for Just $30: A Small Model RL Revolution
https://xyzlabs.substack.com/p/berkeley-researchers-replicate-deepseek
A Berkeley AI Research team led by PhD candidate Jiayi Pan has achieved what many thought impossible: reproducing DeepSeek R1-Zero's key technologies for less than the cost of a dinner for two.