Science – Page 144

Seeing the Big Picture: Visual Memory for Smarter AI Agents

02.02.2026 by ebaster

Despite extreme constraints on memory-limited to just 16 tokens-a system employing adaptive layout maintained crucial textual evidence-specifically, the name “Gene MacLellan”-through down-sampling, enabling accurate reasoning where standard truncation and uniform text rendering failed.

Researchers have developed a new approach to equipping AI agents with long-term memory, allowing them to reason more effectively over extended periods.

Learning Physics with Noise: A New Approach to System Modeling

02.02.2026 by ebaster

Researchers are leveraging the power of generative models to create more accurate and data-efficient simulations of complex physical systems.

AI Code Generation: The Hidden Cost of Redundancy

02.02.2026 by ebaster

The process captures modifications originating from pull requests, integrating iterative improvements and collaborative contributions into a cohesive and evolving system-a testament to the principle that all structures are subject to change, and adaptation is the measure of their longevity.

A new study reveals that code produced by AI agents often contains significantly more duplicated code than human-written software, creating a potential maintenance burden.

Beyond Sets: Learning Continuous Representations for Discrete Data

02.02.2026 by ebaster

$The study transforms a two-dimensional function into a density field [latex]\rho(x,y)[/latex] by treating local peaks and valleys as individual entities and encoding their relationships using the CORDS method, effectively mapping a continuous surface onto a discrete representation.$

A new framework, CORDS, offers a powerful way to represent variable-size collections of objects using continuous fields, bridging the gap between discrete and continuous learning.

Giving AI Characters a Mind of Their Own

02.02.2026 by ebaster

Researchers are pushing the boundaries of artificial intelligence role-playing by equipping language models with more sophisticated reasoning and reward systems.

Can AI Become a true Coding Partner?

02.02.2026 by ebaster

$The system employs a rigorous execution pipeline wherein each task is encapsulated within a Docker container, enabling the automated launch of agents-be they large language models or oracle baselines equipped with integrated development environment tools-and the comprehensive capture of all interactions, followed by automated grading via test suite execution and precise code change extraction using [latex]git\ diff[/latex] for comparison against a definitive golden solution.$

A new benchmark assesses how well artificial intelligence agents handle realistic software engineering challenges, moving beyond simple code completion.

Web Agents Learn to Dream: A New Approach to Online Automation

02.02.2026 by ebaster

DynaWeb cultivates web-navigating agents through a process of simulated experience, leveraging a learned world model to generate imagined trajectories mixed with limited real-world examples, and optimizing policy through sequence-level reinforcement learning to navigate complex web tasks despite sparse rewards-a system designed not by construction, but by fostering an internal, predictive ecology.

Researchers have developed a system that allows web-navigating agents to learn more efficiently by simulating online experiences, reducing the need for constant real-world interaction.

Debating Data: A New Framework for Transparent AI Decision-Making

02.02.2026 by ebaster

The framework models agentic interactions through a structured, seven-turn protocol-encompassing private strategic deliberation alongside public debate-with all exchanges and evolving beliefs meticulously recorded to ensure complete transparency and facilitate rigorous analysis of emergent dynamics.

Researchers have developed a multi-agent simulation of a courtroom debate to improve the clarity and reliability of artificial intelligence systems when analyzing complex, tabular data.

Building Blocks for Better Molecules

01.02.2026 by ebaster

SoftMol cultivates molecular designs through a block-diffusion transformer, iteratively refining candidate structures via semi-autoregressive denoising and a gated Monte Carlo tree search-where exploration balances pharmacological feasibility with docking success, and failed candidates incur penalties, ultimately shaping a generative ecosystem rather than a deterministic solution.

A new framework leverages diffusion models and intelligent search to design novel compounds with enhanced properties and targeted activity.

Can Language Models Truly Reason About Cause and Effect?

01.02.2026 by ebaster

The system rigorously verifies the semantic equivalence of causal expressions generated by a model to ground truth, employing do-calculus and probabilistic reasoning to explore all valid derivations within a given directed acyclic graph-a method that surpasses simple string matching in its capacity for formal validation and ensuring logically sound inferences.

New research reveals a method for rigorously evaluating whether large language models’ causal statements align with underlying causal relationships.