I prefer it to work on fairly narrowly scoped atomic features, and for those my prompts are quite minimal, or I do a short back and forth in "plan" mode until I'm happy with the plan.
For more complex features I do spec out the prompt in more detail.
The model can output at most 4k tokens I believe, though I think Cline will just run inference in multiple steps, so that shouldn't be a limitation that matters (only time/cost wise).
For more complex features I do spec out the prompt in more detail.
The model can output at most 4k tokens I believe, though I think Cline will just run inference in multiple steps, so that shouldn't be a limitation that matters (only time/cost wise).