Meta tag
StepSearch
1 article
SLATE Teaches Search Models Where They Went Wrong
SLATE targets the hardest part of RL search training: teaching a model which exact step helped or harmed the final answer.
27 Apr 2026
1 article
SLATE targets the hardest part of RL search training: teaching a model which exact step helped or harmed the final answer.