Agentic Abstention: Do Agents Know When to Stop Instead of Act? Paper • 2606.28733 • Published 7 days ago • 141
OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning Paper • 2606.26790 • Published 9 days ago • 54
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published Jun 1 • 237
view article Article Beyond LoRA: Can you beat the most popular fine-tuning technique? +2 BenjaminB, sayakpaul, hubnemo, kashif • 16 days ago • 71
SkillOpt: Executive Strategy for Self-Evolving Agent Skills Paper • 2605.23904 • Published May 22 • 250
view article Article Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI nvidia • 30 days ago • 12