← Today/2026-03-15

📅 Sunday, March 15, 2026

Top ML papers scored in PaperBrief digests on this day.

📅

Top Papers — Mar 15

The highest-scoring arXiv ML papers from Mar 15, ranked by LLM relevance.

1📄 Notable

Security Considerations for Artificial Intelligence Agents

Ninghui Li +3 · 2026-03-12

5.0

This article, a lightly adapted version of Perplexity's response to NIST/CAISI Request for Information 2025-0035, details our observations and recommendations concerning the security of frontier AI ag…

arXiv →PDF →PaperBrief →

2📄 Notable

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Guanyu Jiang +4 · 2026-03-12

5.0

Multimodal agents can now tackle complex reasoning tasks with diverse tools, yet they still suffer from inefficient tool use and inflexible orchestration in open-ended settings. A central challenge is…

arXiv →PDF →PaperBrief →

3📄 Notable

Mock Worlds, Real Skills: Building Small Agentic Language Models with Synthetic Tasks, Simulated Environments, and Rubric-Based Rewards

Yuanjie Lyu +4 · 2026-01-30

5.0

Small LLMs often struggle to match the agentic capabilities of large, costly models. While reinforcement learning can help, progress has been limited by two structural bottlenecks: existing open-sourc…

arXiv →PDF →PaperBrief →

4📄 Notable

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Łukasz Borchmann +14 · 2026-03-12

4.0

Multimodal agents offer a promising path to automating complex document-intensive workflows. Yet, a critical question remains: do these agents demonstrate genuine strategic reasoning, or merely stocha…

arXiv →PDF →PaperBrief →

5📄 Notable

Can Small Language Models Use What They Retrieve? An Empirical Study of Retrieval Utilization Across Model Scale

Sanchit Pandey · 2026-03-12

4.0

Retrieval augmented generation RAG is widely deployed to improve factual accuracy in language models yet it remains unclear whether smaller models of size 7B parameters or less can effectively utilize…

arXiv →PDF →PaperBrief →

6📄 Notable

Try, Check and Retry: A Divide-and-Conquer Framework for Boosting Long-context Tool-Calling Performance of LLMs

Kunfeng Chen +4 · 2026-03-12

4.0

Tool-calling empowers Large Language Models (LLMs) to interact with external environments. However, current methods often struggle to handle massive and noisy candidate tools in long-context tool-call…

arXiv →PDF →PaperBrief →

7📄 Notable

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Weixun Wang +89 · 2025-12-31

4.0

Agentic crafting requires LLMs to operate in real-world environments over multiple turns by taking actions, observing outcomes, and iteratively refining artifacts. Despite its importance, the open-sou…

arXiv →PDF →PaperBrief →

8📄 Notable

Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation

Yuxiang Zhou +4 · 2025-11-15

4.0

Mobile agents show immense potential, yet current state-of-the-art (SoTA) agents exhibit inadequate success rates on real-world, long-horizon, cross-application tasks. We attribute this bottleneck to …

arXiv →PDF →PaperBrief →

9📄 Notable

TURA: Tool-Augmented Unified Retrieval Agent for AI Search

Zhejun Zhao +10 · 2025-08-06

4.0

The advent of Large Language Models (LLMs) is transforming search engines into conversational AI search products, primarily using Retrieval-Augmented Generation (RAG) on web corpora. However, this par…

arXiv →PDF →PaperBrief →

10📄 Notable

CARROT: A Learned Cost-Constrained Retrieval Optimization System for RAG

Ziting Wang +4 · 2024-11-01

4.0

Large Language Models (LLMs) have demonstrated impressive ability in generation and reasoning tasks but struggle with handling up-to-date knowledge, leading to inaccuracies or hallucinations. Retrieva…

arXiv →PDF →PaperBrief →

Want papers like these in your inbox?

PaperBrief sends you a personalised daily digest of the arXiv papers that actually matter for your research track.

Get your personalised digest →