Guarding the Interface: Detecting Malicious Agent Actions
A new framework, AegisUI, proactively identifies dangerous commands generated by AI agents interacting with user interfaces.
A new framework, AegisUI, proactively identifies dangerous commands generated by AI agents interacting with user interfaces.
![The proposed SURE framework extracts both coarse [latex]F_c[/latex] and fine [latex]F_f[/latex] features, then refines initial correspondences [latex]M_c[/latex] by sampling the fine features, ultimately producing precise offsets [latex](\Delta x, \Delta y)[/latex] alongside uncertainty estimates derived from a Normal-Inverse-Gamma distribution parameterized by [latex](\psi, \eta, \kappa, \rho)[/latex], effectively modeling both aleatoric and epistemic uncertainties in the regression process.](https://arxiv.org/html/2603.04869v1/2603.04869v1/fig2/structure5.png)
A new framework, SURE, elevates the accuracy of image correspondence by explicitly modeling and fusing uncertainty estimates during the matching process.

A new framework enables the creation of dynamically generated task families, allowing for more robust and nuanced evaluation of artificial intelligence’s reasoning capabilities.
![The network illustrates how specific sequential patterns-identified by weighted entropy-induce predictable or uncertain transitions in a target sequence, revealing that deterministic behavior arises not from inherent properties, but from the consistent application of these patterned influences-a relationship observable in both directions of influence, where [latex]0[/latex] weighted entropy signals a strong, predictable link and higher values suggest increasing indeterminacy.](https://arxiv.org/html/2603.04473v1/2603.04473v1/causal_network_dual.png)
A new method leverages recurring patterns to infer causal relationships without relying on traditional assumptions.
Researchers have developed a novel 3D printing technique that combines active and passive elastomers to create materials capable of complex, user-defined shape changes.
![The system dissects visual observation into task-relevant and irrelevant regions, employing [latex]SAM[/latex] and [latex]XMem++[/latex] to propagate segmentation masks-then strategically augments the former with task-specific transformations while randomly perturbing the latter via [latex]PixMix[/latex], effectively generating diverse training data by exploiting the interplay between focused manipulation and controlled chaos.](https://arxiv.org/html/2603.04845v1/2603.04845v1/x3.png)
A new approach selectively enhances training images to help robots master complex agricultural tasks through vision alone.

Researchers have developed a framework that combines the power of deep learning with logical reasoning to more accurately and transparently interpret patient behavior.

A new motion planning algorithm significantly speeds up trajectory generation for robotic arms by exploring multiple solutions simultaneously.

Researchers have developed a rigorous method for evaluating the structural reasoning abilities of large language models, moving beyond simple benchmarks to assess genuine problem-solving skills.

Researchers unveil a novel blockchain-based system designed to bring greater security, efficiency, and user control to the rapidly evolving smart home landscape.