Mar 8, 2026

HLER: Human-in-the-Loop Economic Research via Multi-Agent Pipelines for Empirical Discovery

Chen Zhu, Xiaolu Wang/arXiv abstract

Why this is worth your attention

This paper matters because it makes a specific part of “AI can automate research” look more operationally real: not autonomous genius, but a cheap, structured workflow that turns a dataset into a draft empirical paper with humans approving the key decisions. The headline change is less about model brilliance than about reducing wasted cycles from bad questions—HLER’s dataset-aware setup cut infeasible hypotheses sharply and completed most runs end to end in 20–25 minutes at very low API cost. If that pattern holds outside this small test, economics, policy, market research, and internal analytics teams could industrialize parts of empirical analysis faster than most current research workflows assume. The catch is readiness: evidence is still from just 14 runs on three datasets, and some quality claims rely on the same LLM family grading its own output.

Thank you to arXiv for use of its open access interoperability. This product was not reviewed or approved by, nor does it necessarily express or reflect the policies or opinions of, arXiv.
LightDark