Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think the issue there is those smaller versions of those models. I regularly use Gemma3 and Qwen3 for programming without issue but in the 27b-32b range. Going smaller than that generally yields garbage.


I've tried 24-32b sizes as well and besides being even slower they were also unreliable.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: