Researchers from Peking University and King’s College London developed a decentralized policy optimization framework for multi-agent systems, improving scalability and decision-making efficiency in large-scale AI systems by reducing communication and system complexity. The framework uses model learning to enhance policy
Sort: