Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Limiting the model to only use 2000 tokens while also asking it to output ONLY HTML/CSS is just stupid. It's like asking a programmer to perform the same task while removing half their brain and also forget about their programming experience. This is a stupid and meaningless benchmark.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: