Multi-Agent Reinforcement Learning

Summary

  • Extends RL to environments with multiple interacting agents.
  • Challenges:
    • Non-stationarity: Agents’ policies change over time.
    • Scalability: Large state-action spaces.
  • Approaches:
    • Centralized Training, Decentralized Execution: Train a joint policy but allow decentralized decision-making.
    • Multi-Agent Policy Gradients: Extend policy gradients to handle multiple agents.