Culture Magazine

HOME ›

DeepSeek Works Though Distillation [Schmidhuber]

By Bbenzon @bbenzon

DeepSeek [1] uses elements of the 2015 reinforcement learning prompt engineer [2] and its 2018 refinement [3] which collapses the RL machine and world model of [2] into a single net through the neural net distillation procedure of 1991 [4]: a distilled chain of thought system.… pic.twitter.com/wpk2tFCoUx
— JĂźrgen Schmidhuber (@SchmidhuberAI) January 31, 2025

Back to Featured Articles on

About the author

Bbenzon 4802 shares View profile
View Blog

Author's Latest Articles

Magazines

Tweets by @paperblog