TECH & SPACE
PROHR
Space Tracker
Meta tag

retrieval-augmented reasoning

1 article

SLATE Teaches Search Models Where They Went Wrong
AIRewritten
db#3486

SLATE Teaches Search Models Where They Went Wrong

SLATE targets the hardest part of RL search training: teaching a model which exact step helped or harmed the final answer.

27 Apr 2026
⊞ Foto Review