Science – Page 161

When AI Turns Against Itself: The Rise of Proxy Attacks

08.02.2026 by ebaster

A hybrid monitoring system, while enhancing detection rates by simultaneously observing reasoning steps and tool usage, introduces a critical vulnerability: prompt injection attacks can not only compromise the agent’s function but also circumvent the monitoring safeguards themselves.

New research reveals a concerning vulnerability in AI systems where agents can be exploited to circumvent safety protocols and deliver malicious instructions.

Stuck in a Rut: Why AI Agents Struggle to Explore

08.02.2026 by ebaster

Across all evaluated tasks, explore-exploit baselines consistently surpassed the performance of language models when operating under a query budget of [latex]N=48[/latex], demonstrating robustness to variations in parameter settings.

A new evaluation benchmark reveals that current language models often fail to adequately explore interactive environments, leading to suboptimal decisions and a lack of adaptability.

Beyond Rules: When AI Learns to Design

08.02.2026 by ebaster

$Generative ontologies transcend descriptive vocabularies by establishing constraints that enable large language models to function as active grammars for design creation, ensuring validity through a formalized system-a principle akin to establishing that [latex] \forall x \in V : \text{ontology}(x) \implies \text{validity}(x) [/latex], where <i>V</i> represents the vocabulary and validity is guaranteed by the ontological framework.$

A new framework merges the power of large language models with structured knowledge to unlock creative design possibilities.

How Language Models Learn to Predict: A Hidden Geometric Order

08.02.2026 by ebaster

The correlation between the distances of differing tokens and the symmetric Kullback-Leibler divergence of their predicted distributions-measured across layers using both angular and Euclidean metrics-reveals a perturbation-based phase transition point, suggesting a fundamental shift in how the model represents and processes information.

New research reveals that the deepest layers of large language models organize information geometrically, and this structure directly powers their predictive abilities.

Mapping the Way Forward: AI-Powered Terrain Perception for Humanoid Robots

08.02.2026 by ebaster

A predictive system leverages pretrained encoders to compress data from depth cameras and LiDAR, integrating current robot state and prior heightmap information to forecast subsequent heightmaps.

Researchers have developed a new deep learning framework that fuses data from lidar and depth sensors to create detailed terrain maps, enabling more stable and reliable locomotion for humanoid robots.

Who Decides What’s Real Online?

08.02.2026 by ebaster

The FATe framework-detailed in this article for social bot detection-establishes a structured approach, alongside recommended research avenues, to rigorously assess and improve automated systems designed to identify malicious online accounts.

As social media bot detection becomes increasingly sophisticated, critical ethical questions about fairness, accountability, and transparency demand urgent attention.

Beyond Labels: Scaling Linguistic Insight with AI Agents

08.02.2026 by ebaster

A new platform leverages the power of large language models and multi-agent systems to automate complex linguistic tasks, offering a transparent and reproducible approach to annotation.

Two Hands Are Better Than One: Mastering Robotic Dexterity with Multimodal AI

08.02.2026 by ebaster

A two-stage training paradigm enables a robotic policy to adapt to tactile-aware manipulation by initially learning a vision-action strategy from images and proprioceptive states, then efficiently incorporating tactile feedback through a lightweight adapter and cross-attention mechanism-freezing the pretrained policy and avoiding full model retraining.

Researchers have developed a new AI framework that enables robots to perform complex, two-handed tasks with greater precision and adaptability.

Seeing is Reasoning: An Agent That Learns to Look

08.02.2026 by ebaster

Researchers have developed a new system, Weaver, that learns to actively gather visual evidence from videos to improve its reasoning abilities.

Drone Autonomy Takes Flight with AI Brains

08.02.2026 by ebaster

A closed-loop system integrates a Unity drone simulator, a Python controller, and VLLM to facilitate seamless interaction and control.

Researchers are exploring how large language models can give drones the reasoning skills needed to navigate complex indoor environments without pre-mapping.