Tag: publication
All the articles with the tag "publication".
-
Multi-Agent Debate for LLM Judges with Adaptive Stability Detection
Introduces a multi-agent debate framework for LLM judges with adaptive stability detection to improve evaluation reliability.
-
PPTBench: Towards Holistic Evaluation of Large Language Models for PowerPoint Layout and Design Understanding
Presents PPTBench, a benchmark for holistic evaluation of large language models on PowerPoint layout and design understanding.