Added Qwen3 Next to the Brokk Power Ranking Open Round (coding benchmark). It's ...

noahbp · 2025-09-12T21:46:32 1757713592

Is that the updated Kimi K2, or the old Kimi k2?

jbellis · 2025-09-13T13:22:08 1757769728

It's the original. I'll update the label to clarify.

SparkyMcUnicorn · 2025-09-12T23:18:53 1757719133

This would be a valuable benchmark if it included languages other than Java, and let me see which models are best at the languages I work with.

My real-world usage does not line up with these results, but I'm not working with Java.