Smart Factories on Rails: The Rise of Infrastructure-Driven Robotics

Author: Denis Avetisyan

A new architecture integrating cloud computing and advanced sensing is poised to unlock the full potential of autonomous mobile robots in industrial logistics.

Autonomous transport robots operate within the factory environment, demonstrating a scalable solution for internal logistics.

This review details a reference architecture for infrastructure-based Autonomous Mobile Robots, addressing challenges and outlining future research directions in localization, perception, and human-robot interaction for scalable industrial automation.

While autonomous mobile robots (AMRs) are increasingly deployed for internal logistics, current solutions largely prioritize onboard intelligence, overlooking the potential of augmenting them with external infrastructure. This paper, ‘Infrastructure-based Autonomous Mobile Robots for Internal Logistics — Challenges and Future Perspectives’, presents a reference architecture integrating cloud computing and dedicated sensing to address this gap. We demonstrate how this approach enhances localization, perception, and planning capabilities, leading to more scalable and robust AMR systems in complex industrial environments. Could this infrastructure-centric paradigm unlock truly collaborative and adaptable automation for the future of manufacturing and beyond?

The Limits of Autonomy: A Bottleneck of Centralization

Conventional autonomous mobile robots often function as isolated computational units, performing all perception, planning, and control tasks with onboard processors. While seemingly self-contained, this architecture introduces significant bottlenecks when operating in complex or dynamic environments. The robot’s ability to react to unforeseen obstacles, reroute around congested areas, or adapt to changing conditions is directly limited by the processing power available within the robot itself. As environmental complexity increases – think crowded warehouses or bustling city streets – the computational demands quickly overwhelm the onboard system, leading to delays in response time, reduced efficiency, and ultimately, a compromised ability to navigate effectively. This reliance on centralized processing restricts scalability; adding more robots doesn’t necessarily improve performance, and even minor adjustments to the operating environment can necessitate extensive reprogramming and recalibration.

The reliance on fully onboard processing within traditional autonomous mobile robots presents significant constraints on scalability and adaptability, ultimately impeding the realization of truly flexible automation. Because all sensing, planning, and control occur locally, each robot operates as an isolated unit, requiring substantial computational resources to navigate even moderately complex environments. This creates a bottleneck; adding more robots doesn’t inherently increase system capacity, and expanding the operational scope demands increasingly powerful – and expensive – hardware upgrades for each individual unit. Consequently, deploying these robots in dynamic, real-world settings, where layouts change and unexpected obstacles appear, necessitates extensive and time-consuming reprogramming. The architecture struggles to generalize beyond its initial parameters, hindering its ability to seamlessly adjust to novel situations and limiting its practical application in environments demanding genuine operational agility.

Existing autonomous systems often falter when confronted with real-world dynamism, exhibiting a notorious inflexibility that demands substantial re-programming for even seemingly trivial alterations to their operating environment. A robot navigating a warehouse, for example, may require complete recalibration if a single pallet is moved or a new obstacle appears – a process that is both time-consuming and expensive. This reliance on pre-programmed responses, rather than adaptive learning, creates a significant bottleneck in deployment and scalability; the robots are effectively ‘blind’ to changes they haven’t been explicitly prepared for. Consequently, current automation solutions struggle to deliver true resilience and frequently necessitate human intervention, undermining the promise of fully independent operation and limiting their effectiveness in unpredictable settings.

The increasing complexity of modern automation challenges the traditional, centralized intelligence of autonomous systems. A move towards distributed intelligence is becoming essential, where processing power isn’t solely confined to the robot itself, but shared across the environment – encompassing edge computing, cloud connectivity, and collaborative multi-agent systems. This paradigm shift allows for real-time adaptation to unforeseen circumstances, enhanced resilience through redundancy, and the ability to handle significantly more complex tasks than previously possible. By offloading computational burdens and leveraging collective knowledge, automation can evolve from rigid, pre-programmed sequences to dynamic, learning systems capable of navigating unpredictable environments and collaborating seamlessly with both humans and other machines, ultimately unlocking a new era of flexible and robust industrial processes.

RAIL defines a reference architecture for infrastructure-based Autonomous Mobile Robotics (AMR) systems used in internal logistics.

Beyond Onboard Processing: Infrastructure as Intelligence

Infrastructure-based Autonomous Mobile Robot (AMR) systems differentiate themselves from traditional AMRs by shifting computational workload from onboard processors to external resources, such as cloud servers or edge computing devices. This architecture allows AMRs to access greater processing power and storage capacity than would be feasible with fully self-contained units. By utilizing external computation, these systems can perform complex tasks-including Simultaneous Localization and Mapping (SLAM) and advanced path planning-without being limited by the physical constraints of the robot itself. This decoupling of computation and robotics enables greater flexibility, scalability, and the potential for deploying more sophisticated algorithms and functionalities.

Offloading computationally demanding tasks such as Simultaneous Localization and Mapping (SLAM) and path planning to cloud infrastructure enables Autonomous Mobile Robots (AMRs) to achieve increased efficiency and scalability. Processing these functions remotely reduces the requirements for onboard processing power, weight, and energy consumption, allowing for the deployment of smaller, more agile robots and extending operational durations. Furthermore, cloud-based processing facilitates centralized fleet management, allowing for dynamic task allocation, real-time monitoring, and over-the-air software updates for a larger number of AMRs without impacting individual robot performance. This architecture supports the scaling of robotic deployments beyond the limitations imposed by the computational resources available on each physical unit.

The implementation of infrastructure-based autonomous mobile robots (AMRs) facilitates the deployment of computationally demanding algorithms, specifically Simultaneous Localization and Mapping (SLAM), without requiring substantial onboard processing power. Traditional SLAM implementations necessitate significant computational resources for real-time map creation and localization; however, by offloading these calculations to external servers via cloud connectivity, AMRs can utilize more complex and accurate SLAM techniques. This reduces the hardware requirements – and associated costs, weight, and power consumption – of the robot itself, while simultaneously improving the quality and robustness of the environmental map and the precision of the robot’s localization within it. The distribution of processing load also allows for the use of algorithms with higher $O(n)$ complexity without impacting real-time performance.

Infrastructure-based Autonomous Mobile Robots (AMRs) enhance environmental awareness by integrating data from external sources – including real-time sensor networks, building information models (BIM), and historical operational data – with cloud-based processing. This fusion allows AMRs to move beyond onboard sensor limitations and construct a more comprehensive and dynamic representation of their surroundings. Cloud processing enables the analysis of large datasets, identifying patterns and predicting potential obstacles or changes in the environment. Consequently, decision-making regarding navigation, task allocation, and obstacle avoidance is optimized, resulting in improved operational efficiency and adaptability compared to systems reliant solely on onboard processing and limited sensor input.

A Reference Architecture: Orchestrating Intelligence

The proposed Reference Architecture integrates onboard sensors – including LiDAR, radar, and cameras – with edge computing devices for initial data processing and real-time decision-making. This distributed approach reduces latency and bandwidth requirements by filtering and pre-processing data locally before transmitting relevant information to cloud resources. Cloud connectivity enables access to larger datasets for model training, fleet management, and long-term data storage, creating a scalable system capable of supporting a growing number of automated systems and complex operational scenarios. The tiered processing structure, combining the responsiveness of edge computing with the capacity of cloud infrastructure, enhances both system reliability and overall performance.

The proposed architecture integrates several advanced planning and decision-making techniques to enable autonomous operation. Motion Planning algorithms determine feasible trajectories for robots, avoiding obstacles and adhering to kinematic constraints. Route Planning, frequently employing algorithms such as A-star, calculates optimal paths between defined locations, considering factors like distance, time, and energy consumption. Behavioral Decision Making then utilizes these planned routes and trajectories to select appropriate actions based on the current environment, task objectives, and predefined behavioral rules, allowing for dynamic adaptation and response to unforeseen circumstances. These techniques operate in concert to provide a comprehensive planning and execution framework.

Intelligent automation systems utilize computer vision to interpret data from onboard sensors – such as cameras and LiDAR – and external data feeds. Object Detection identifies and classifies distinct objects within a scene, providing bounding box coordinates and confidence levels for each detected instance. Complementing this, Semantic Segmentation assigns a class label to every pixel in an image, enabling a detailed, pixel-level understanding of the environment. Both techniques rely heavily on Deep Learning models, specifically Convolutional Neural Networks (CNNs), trained on large datasets to achieve high accuracy and robustness in varied conditions. The output of these processes provides critical environmental awareness for subsequent planning and decision-making stages.

Effective data exchange within an intelligent automation system relies on a combination of communication technologies. 5G Communication provides high bandwidth and low latency for real-time data transmission between robots and the cloud, supporting computationally intensive tasks like remote control and data analytics. Wi-Fi 7 offers increased throughput and reduced latency compared to previous Wi-Fi standards, enabling robust local communication within facilities and supporting high-resolution sensor data transfer. V2X (Vehicle-to-Everything) Communication extends connectivity beyond the robot itself, allowing data exchange with infrastructure elements like traffic signals, other vehicles, and roadside units, facilitating collaborative operations and enhanced situational awareness; this includes direct communication modes as well as cloud-mediated data sharing.

In a factory setting, the system perceives its surroundings by segmenting static obstacles (red), detecting and tracking vehicles (blue), and predicting their future movements with multiple hypotheses (red arrows).

Beyond Efficiency: A Collaborative Future

Modern factory automation increasingly relies on infrastructure-based Autonomous Mobile Robot (AMR) systems to achieve production line flexibility and adaptability. Unlike traditional fixed automation, these systems utilize a network of robots that navigate using pre-defined maps and communicate with a central control system, enabling dynamic rerouting and task allocation. This approach allows manufacturers to quickly respond to changing demands, introduce new products, or reconfigure production layouts without costly and time-consuming physical modifications. By intelligently distributing tasks and optimizing routes, AMRs minimize bottlenecks, reduce lead times, and enhance overall operational efficiency, effectively transforming rigid assembly lines into responsive, scalable production ecosystems.

The convergence of the Internet of Robotic Things and federated learning presents a pathway to significantly enhanced robotic intelligence. Rather than each robot operating in isolation, this architecture allows for collaborative learning without the need to centralize training data – preserving data privacy and reducing communication bandwidth. Robots share insights gleaned from their individual experiences – identifying optimal paths, refining object recognition, or predicting potential obstructions – and contribute these learnings to a shared, global model. This distributed learning process enables the entire robotic network to adapt more quickly to changing environments and unforeseen circumstances, improving overall system resilience and efficiency. The result is a collective intelligence that surpasses the capabilities of any single robot, fostering a dynamic and responsive automation ecosystem.

Advancements in robotic systems are increasingly focused on seamless and safe interaction with human workers, achieved through heightened environmental awareness and predictive capabilities. These robots utilize sophisticated sensor suites – including vision systems, lidar, and ultrasonic sensors – to build detailed maps of their surroundings and dynamically adjust to changes in real-time. Crucially, these systems move beyond simple obstacle avoidance; they leverage machine learning algorithms to predict human movement and intent. By anticipating actions, robots can proactively adjust their paths, slow down, or even pause, fostering a collaborative workspace where humans and robots operate safely and efficiently alongside each other. This predictive capability minimizes the need for explicit programming of every possible scenario, enabling robots to adapt to the inherent unpredictability of human behavior and creating a more intuitive and natural working environment.

A practical demonstration of this robotic architecture unfolded within an actual factory environment, showcasing a fleet of six autonomous mobile robots (AMRs) successfully managing logistics. These robots collectively executed 130 transport operations daily, indicating a substantial capacity for material handling within a dynamic industrial setting. Importantly, each transport operation was completed within a seven-minute cycle, highlighting the system’s efficiency and responsiveness. This deployment underscores the feasibility of scaling collaborative robotics for real-world applications, moving beyond theoretical potential to tangible improvements in factory workflows and productivity.

The system architecture deployed at Volvo integrates various components to facilitate industrial implementation.

The pursuit of robust autonomous systems, as detailed in the presented architecture, benefits from a relentless focus on simplification. The article champions infrastructure-based sensing and cloud integration to address the complexities of industrial logistics, but these advancements are most effective when built upon a foundation of elegant design. As Edsger W. Dijkstra once stated, “It’s not enough to make things work; they must also be understandable.” This principle underscores the necessity of clear, concise system architecture-one where each component’s function is readily apparent, promoting maintainability and scalability. The emphasis on localization, perception, and human-robot interaction, while sophisticated, must never obscure the fundamental goal of creating a system that is both powerful and intuitively comprehensible.

What’s Next?

The presented architecture, while offering a cohesive framework, merely clarifies the contours of existing dependency. The reliance on pervasive infrastructure, though demonstrably effective for localization and perception, introduces a single point of systemic fragility. Future iterations must address this. The question is not simply how to enhance sensor fidelity, but whether absolute fidelity is even necessary – or merely a comforting illusion. A shift towards probabilistic inference, accepting inherent uncertainty, may prove more robust than chasing an unattainable perfection.

Scalability, predictably, remains a constraint. Cloud Robotics offers computational relief, yet introduces latency. The minimization of this latency isn’t a technical challenge alone; it’s a fundamental negotiation with the laws of physics. Further research should prioritize edge computing paradigms, not as a replacement for the cloud, but as a distributed intelligence layer – a nervous system for the factory floor. Unnecessary centralization is violence against responsiveness.

Finally, the human-robot interaction component, while acknowledged, remains largely symbolic. True collaboration demands not just safe co-existence, but genuine symbiosis. The AMR must transition from a tool, directed by human command, to a partner, anticipating needs and offering proactive assistance. Density of meaning in this interaction, not merely volume of data, will define the next generation of industrial automation.

Original article: https://arxiv.org/pdf/2512.15215.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

The Limits of Autonomy: A Bottleneck of Centralization

Beyond Onboard Processing: Infrastructure as Intelligence

A Reference Architecture: Orchestrating Intelligence

Beyond Efficiency: A Collaborative Future

What’s Next?

See also: