Can AI Lab Assistants Be Trusted?

A new benchmark reveals significant gaps in the safety reasoning of large language models tasked with operating in complex scientific environments.

A new benchmark reveals significant gaps in the safety reasoning of large language models tasked with operating in complex scientific environments.
![Directed hypergraphs facilitate the broadcasting of messages across complex networks, where activation flows from pivotal nodes to interconnected receiver sets-a mechanism illustrated by the propagation along edges such as [latex]v_1 \rightarrow \{v_2, v_3\} [/latex]-enabling communication pathways beyond traditional pairwise connections.](https://arxiv.org/html/2603.12098v1/broadcast_hypergraph.png)
A new framework leverages maximum-entropy principles to model and analyze dynamics on hypergraphs, offering insights into complex relationships beyond traditional network structures.

New research reveals how video AI models internally represent the nuances of actions, distinguishing between successful and unsuccessful attempts even when the final classification remains the same.

A new self-supervised learning framework unlocks accurate and adaptable motion recognition across a wide range of devices and users, minimizing the need for labeled data.

Researchers have developed a self-supervised learning method that allows AI to learn how to simplify complex mathematical expressions by reversing the process of ‘scrambling’ them.

A new system architecture aims to bring the power of large language models to dynamic clinical workflows with enhanced safety and coordination.

A new benchmark reveals that while AI code review tools can find many potential issues, prioritizing accuracy over sheer volume is crucial for effective defect detection.

Researchers have developed a novel system that generates complex, multi-story 3D environments from natural language descriptions, enabling more realistic and challenging tests for embodied AI agents.

Researchers have developed a novel deep learning framework to more accurately predict how proteins bind to DNA and regulate gene expression.

A new system leverages cloud-edge collaboration and efficient data compression to enable accurate and robust multi-robot simultaneous localization and mapping.