Skip to content

BBG NEWS

  • Science
  • Who is Denis Avetisyan?

Science

Seeing the Big Picture: Visual Memory for Smarter AI Agents

02.02.2026 by ebaster

Despite extreme constraints on memory-limited to just 16 tokens-a system employing adaptive layout maintained crucial textual evidence-specifically, the name “Gene MacLellan”-through down-sampling, enabling accurate reasoning where standard truncation and uniform text rendering failed.

Researchers have developed a new approach to equipping AI agents with long-term memory, allowing them to reason more effectively over extended periods.

Categories Science

Learning Physics with Noise: A New Approach to System Modeling

02.02.2026 by ebaster

Conditional Denoising Models (CDMs) demonstrate superior predictive accuracy and physical consistency-measured by reduced constraint violations-compared to physics-consistent baselines, with performance scaling predictably with model complexity and maintaining robustness even with limited training data, as evidenced by consistently lower Root Mean Squared Error (RMSE) across multiple independent training runs and data splits.

Researchers are leveraging the power of generative models to create more accurate and data-efficient simulations of complex physical systems.

Categories Science

AI Code Generation: The Hidden Cost of Redundancy

02.02.2026 by ebaster

The process captures modifications originating from pull requests, integrating iterative improvements and collaborative contributions into a cohesive and evolving system-a testament to the principle that all structures are subject to change, and adaptation is the measure of their longevity.

A new study reveals that code produced by AI agents often contains significantly more duplicated code than human-written software, creating a potential maintenance burden.

Categories Science

Beyond Sets: Learning Continuous Representations for Discrete Data

02.02.2026 by ebaster

The study transforms a two-dimensional function into a density field [latex]\rho(x,y)[/latex] by treating local peaks and valleys as individual entities and encoding their relationships using the CORDS method, effectively mapping a continuous surface onto a discrete representation.

A new framework, CORDS, offers a powerful way to represent variable-size collections of objects using continuous fields, bridging the gap between discrete and continuous learning.

Categories Science

Giving AI Characters a Mind of Their Own

02.02.2026 by ebaster

Researchers are pushing the boundaries of artificial intelligence role-playing by equipping language models with more sophisticated reasoning and reward systems.

Categories Science

Can AI Become a true Coding Partner?

02.02.2026 by ebaster

The system employs a rigorous execution pipeline wherein each task is encapsulated within a Docker container, enabling the automated launch of agents-be they large language models or oracle baselines equipped with integrated development environment tools-and the comprehensive capture of all interactions, followed by automated grading via test suite execution and precise code change extraction using [latex]git\ diff[/latex] for comparison against a definitive golden solution.

A new benchmark assesses how well artificial intelligence agents handle realistic software engineering challenges, moving beyond simple code completion.

Categories Science

Web Agents Learn to Dream: A New Approach to Online Automation

02.02.2026 by ebaster

DynaWeb cultivates web-navigating agents through a process of simulated experience, leveraging a learned world model to generate imagined trajectories mixed with limited real-world examples, and optimizing policy through sequence-level reinforcement learning to navigate complex web tasks despite sparse rewards-a system designed not by construction, but by fostering an internal, predictive ecology.

Researchers have developed a system that allows web-navigating agents to learn more efficiently by simulating online experiences, reducing the need for constant real-world interaction.

Categories Science

Debating Data: A New Framework for Transparent AI Decision-Making

02.02.2026 by ebaster

The framework models agentic interactions through a structured, seven-turn protocol-encompassing private strategic deliberation alongside public debate-with all exchanges and evolving beliefs meticulously recorded to ensure complete transparency and facilitate rigorous analysis of emergent dynamics.

Researchers have developed a multi-agent simulation of a courtroom debate to improve the clarity and reliability of artificial intelligence systems when analyzing complex, tabular data.

Categories Science

Building Blocks for Better Molecules

01.02.2026 by ebaster

SoftMol cultivates molecular designs through a block-diffusion transformer, iteratively refining candidate structures via semi-autoregressive denoising and a gated Monte Carlo tree search-where exploration balances pharmacological feasibility with docking success, and failed candidates incur penalties, ultimately shaping a generative ecosystem rather than a deterministic solution.

A new framework leverages diffusion models and intelligent search to design novel compounds with enhanced properties and targeted activity.

Categories Science

Can Language Models Truly Reason About Cause and Effect?

01.02.2026 by ebaster

The system rigorously verifies the semantic equivalence of causal expressions generated by a model to ground truth, employing do-calculus and probabilistic reasoning to explore all valid derivations within a given directed acyclic graph-a method that surpasses simple string matching in its capacity for formal validation and ensuring logically sound inferences.

New research reveals a method for rigorously evaluating whether large language models’ causal statements align with underlying causal relationships.

Categories Science
Older posts
Newer posts
← Previous Page1 … Page143 Page144 Page145 … Page174 Next →
© 2026 BBG NEWS • Built with GeneratePress