Mixture of Agents (MoA) is a method that enables multiple LLMs to collaborate in a layered architecture to produce better responses than any single model. Each layer passes outputs from multiple LLMs to the next, with a final aggregator model synthesizing all intermediate responses using a special prompt. The approach achieved state-of-the-art results on AlpacaEval 2, surpassing GPT-4 by a significant margin. Results show that most models benefit from seeing responses from other models, though the degree of improvement varies.

3m watch time

Sort: