Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto benchmark, a roleplaying-specific variant of Arena-Hard-Auto, where LLMs evaluate each other’s responses. It is a fine-tuned base model...
Best overall rank
-
Across all subsets · overall
Total arena votes
0
Across 0 (subset, category) entries
Providers on OR
0
None yet
First seen
-
Awaiting first snapshot
Trend
No snapshot history yet - trends appear after the first daily refresh.