Abstracted
A weekly digest of the most commercially relevant arXiv papers for operators, PMs, investors, and non-research engineers.
Weekly Brief
Archive
Home
/
Sitemap
Library sitemap
All weeks and briefs
Crawlable links to every public weekly edition page and every individual brief page.
Week of Mar 16, 2026
Memento-Skills: Let Agents Design Agents
Governed Memory: A Production Architecture for Multi-Agent Workflows
Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augmentation and Robust Noise Reduction
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Evaluating Agentic Optimization on Large Codebases
MAC: Multi-Agent Constitution Learning
CUBE: A Standard for Unifying Agent Benchmarks
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale
Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents
AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems
Week of Mar 9, 2026
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Automatic Generation of High-Performance RL Environments
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Slow-Fast Inference: Training-Free Inference Acceleration via Within-Sentence Support Stability
Think While Watching: Online Streaming Segment-Level Memory for Multi-Turn Video Reasoning in Multimodal Large Language Models
CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges
When OpenClaw Meets Hospital: Toward an Agentic Operating System for Dynamic Clinical Workflows
OSCBench: Benchmarking Object State Change in Text-to-Video Generation
RoboClaw: An Agentic Framework for Scalable Long-Horizon Robotic Tasks
One Supervisor, Many Modalities: Adaptive Tool Orchestration for Autonomous Queries
COMIC: Agentic Sketch Comedy Generation
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation
Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization
Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning
Resource-constrained Amazons chess decision framework integrating large language models and graph attention
OpenClaw-RL: Train Any Agent Simply by Talking
Context Engineering: From Prompts to Corporate Multi-Agent Architecture
Latent World Models for Automated Driving: A Unified Taxonomy, Evaluation Framework, and Open Challenges
From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring
Meissa: Multi-modal Medical Agentic Intelligence
Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents
PostTrainBench: Can LLM Agents Automate LLM Post-Training?
SplitAgent: A Privacy-Preserving Distributed Architecture for Enterprise-Cloud Agent Collaboration
Ares: Adaptive Reasoning Effort Selection for Efficient LLM Agents
Week of Mar 2, 2026
HLER: Human-in-the-Loop Economic Research via Multi-Agent Pipelines for Empirical Discovery
SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration
Light
Dark