Science – Page 42

Listening In: A New Approach to Understanding Audio

15.02.2026 by ebaster

Echo’s audio-interleaved reasoning demonstrates a patterned coverage of audio segments relative to position, suggesting the model prioritizes certain temporal features when processing audio for multimodal machine reasoning tasks.

Researchers are developing language models that can continuously process and reason about audio, moving beyond simple transcription to true comprehension.

The Human Factor in Open Source

15.02.2026 by ebaster

New research reveals that the success of open-source projects hinges less on code quality and more on the subtle dynamics of team collaboration and individual motivations.

Crystals Imagined: Generating New Materials with AI

15.02.2026 by ebaster

The architecture dissects input tokens into auxiliary, global, and Fourier components, then encodes them via a depth-aligned ladder-a process allowing the decoder to reconstruct both structural information and Fourier coefficients by strategically injecting auxiliary data at corresponding depths, effectively demonstrating a novel approach to information distillation and reconstruction.

Researchers are leveraging the power of diffusion models and Fourier transforms to design and create novel crystalline structures with unprecedented efficiency.

Securing Autonomous Agents: A New Era of Trustworthy AI

15.02.2026 by ebaster

As AI agents become increasingly powerful, ensuring their security and reliability is paramount, and new approaches are needed to defend against emerging threats like prompt injection.

Dividing the Labor: How We Assign Tasks to AI

15.02.2026 by ebaster

Archetypal frameworks offer a versatile foundation for human-LLM collaborative decision-making, enabling combinations, layering, and integration to address complex challenges.

A new framework categorizes the diverse roles humans and large language models play in collaborative decision-making.

Decoding the Museum Experience: A New Dataset for Visitor Understanding

15.02.2026 by ebaster

Researchers have released a comprehensive dataset combining visitor movement, gaze patterns, and demographic information to unlock deeper insights into how people interact with museum exhibits.

The Self-Improving Eye: AI Learns to See by Exploring and Teaching Itself

15.02.2026 by ebaster

Unlike passive approaches constrained by predefined datasets, this work introduces Active-Zero, a system that proactively explores open-world environments to enhance visual language model reasoning through a self-reinforcing cycle of image retrieval and problem-solving, effectively scaling its capabilities beyond the limitations of fixed data boundaries.

Researchers have developed a novel framework that allows vision-language models to actively explore environments and generate their own training data, leading to more robust visual understanding.

Teaching Robots to Handle Anything You Throw At Them

15.02.2026 by ebaster

The system constructs a long-horizon household planning dataset through task synthesis and annotation, then refines a policy via supervised fine-tuning before employing a reinforcement learning loop-integrating external correction of reasoning traces and constrained sampling-to optimize subgoal generation, task decomposition, and ultimately, robust planning performance.

Researchers have developed a new AI planning system that allows robots to understand and execute complex, everyday tasks in dynamic home environments.

Beyond Physics: Rewriting the Rules of Life

15.02.2026 by ebaster

A new perspective argues that understanding living systems, particularly neuronal function, demands a departure from conventional physics and the embrace of ‘non-ordinary’ laws.

Seeing is Understanding: A New Path for Multimodal AI

15.02.2026 by ebaster

Researchers have developed a unified model that excels at both understanding and generating images and text, bridging the gap between visual and language intelligence.