Life in the Singularity

Life in the Singularity

START: Self-taught Reasoner with Tools

Matt McDonagh's avatar
Matt McDonagh
Mar 08, 2025
∙ Paid
1
Share

The last few months in AI have been incredible.

The reinforcement learning renaissance is underway. Then we hacked how to give models near-perfect (and near-infinite) memory. This all lead to the breakthrough of giving models a mechanism to self-reward:

Self-Rewarding Reasoning Large Language Models (SR-LLMs)

Matt McDonagh
·
Mar 2
Self-Rewarding Reasoning Large Language Models (SR-LLMs)

"There are decades where nothing happens; and there are weeks where decades happen"

Read full story

All of that in the span of 30-days.

Now, we are giving these reasoning models a large and powerful tool…

This post is for paid subscribers

Already a paid subscriber? Sign in
© 2025 Matt McDonagh
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture