Can AI Truly Discover New Worlds?

The Stargazer framework facilitates iterative agent development by generating tasks from both synthetic physics and real-world data, then evaluating submissions based on [latex]\Delta\Delta BIC[/latex], RMS error, matching criteria, and count statistics to provide per-criterion feedback.

A new benchmark environment, Stargazer, challenges artificial intelligence to move beyond statistical analysis and demonstrate genuine physical reasoning in the search for exoplanets.