Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Published in The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025
Introduces a multi-agent debate framework for LLM judges with adaptive stability detection to improve evaluation reliability.
Recommended citation: T. Hu, Z. Tan, S. Wang, H. Qu, and T. Chen. Multi-Agent Debate for LLM Judges with Adaptive Stability Detection. The Thirty-ninth Annual Conference on Neural Information Processing Systems, 2025.
