Scalable Multi-Agent Reinforcement Learning Framework for Multi-Machine Tending

  • Abdalwhab Bakheet Mohamed Abdalwhab
  • , Giovanni Beltrame
  • , David St-Onge

Research output: Contribution to journalJournal Articlepeer-review

Abstract

Robotic manipulators hold significant untapped potential for manufacturing industries, particularly when deployed in multi-robot configurations that can enhance resource utilization, increase throughput, and reduce costs. However, industrial manipulators typically operate in isolated one-robot, one-machine setups, limiting both utilization and scalability. Even mobile robot implementations generally rely on centralized architectures, creating vulnerability to single points of failure and requiring robust communication infrastructure. This paper introduces SMAPPO (Scalable Multi-Agent Proximal Policy Optimization), a scalable input-size invariant multi-agent reinforcement learning model for decentralized multi-robot management in industrial environments. MAPPO (Multi-Agent Proximal Policy Optimization) represents the current state-of-the-art approach. We optimized an existing simulator to handle complex multi-agent reinforcement learning scenarios and designed a new multi-machine tending scenario for evaluation. Our novel observation encoder enables SMAPPO to handle varying numbers of agents, machines, and storage areas with minimal or no retraining. Results demonstrate SMAPPO’s superior performance compared to the state-of-the-art MAPPO across multiple conditions: full retraining (up to 61% improvement), curriculum learning (up to 45% increased productivity and up to 49% fewer collisions), zero-shot generalization to significantly different scale scenarios (up to 272% better performance without retraining), and adaptability under extremely low initial training (up to 100% increase in parts delivery).

Original languageEnglish
Pages (from-to)3135-3142
Number of pages8
JournalIEEE Robotics and Automation Letters
Volume11
Issue number3
DOIs
Publication statusPublished - 2026

!!!Keywords

  • AI and machine learning in manufacturing and logistics systems
  • Reinforcement learning
  • collaborative robots in manufacturing
  • integrated planning and control
  • path planning for multiple mobile robots or agents

Fingerprint

Dive into the research topics of 'Scalable Multi-Agent Reinforcement Learning Framework for Multi-Machine Tending'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.

Cite this