AutoArena is an open-source tool for automated evaluations using LLM judges to rank GenAI systems.
어떻게 사용합니까AutoArena?
Use AutoArena by installing it locally and inputting user prompts to evaluate generative AI systems.
AutoArena 의 사용 사례
AutoArena 의 핵심 기능
Automated head-to-head evaluations using LLM judges
Fine-tune custom judges
Generate leaderboards with Elo scores
Support for multiple judge models
Collaborate on evaluations in the cloud
자주 묻는 질문과 대답AutoArena
Is AutoArena free to use?
Can I run AutoArena locally?
What types of models can I use with AutoArena?