I prefer it to work on fairly narrowly scoped atomic features, and for those my ...

I prefer it to work on fairly narrowly scoped atomic features, and for those my prompts are quite minimal, or I do a short back and forth in "plan" mode until I'm happy with the plan.

For more complex features I do spec out the prompt in more detail.

The model can output at most 4k tokens I believe, though I think Cline will just run inference in multiple steps, so that shouldn't be a limitation that matters (only time/cost wise).