On the Humanity’s Last Exam (HLE) benchmark, Kimi K2.5 scored 50.2% (with tools), surpassing OpenAI’s GPT-5.2 (xhigh) and ...
Kimi K2.5 handles up to 100 sub-agents and 1,500 tool calls, cutting task time 4.5x so you finish complex work sooner.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results