Fine-tuning is a good technique to have in a toolbox, but in reality, it is feas...

coldtea · 2025-10-19T13:56:02 1760882162

>Fine-tuning is a good technique to have in a toolbox, but in reality, it is feasible only in some use cases.

Yes, 100s of housands of them

brulard · 2025-10-19T21:39:19 1760909959

Care to elaborate what are some of those use cases?

coldtea · 2025-10-19T23:40:28 1760917228

Almost everything, isn't it?

From fine-tuning for coding assist, to medical applications, customer support, legal and financial use cases, various classification tasks, for government work, statistics, language learning, music, education, even for role playing game character AI...

I'd rather have a fine-tuned model specialized to any of those tasks and countless others when I'm doing one of those tasks, than a jack of all trades...

libraryofbabel · 2025-10-19T13:08:46 1760879326

What would you say is an example of one of those “middle” tasks it can help with?

CaptainOfCoit · 2025-10-19T13:16:53 1760879813

An example I just found worked very well with fine-tuning: I wanted to extract any frame that contained a full-screen presentation slide from a various videos I've archived, only when it's full-screen, and also not capture videos, and some other constraints.

Naturally I reached for CLIP+ViT which got me a ~60% success rate out of the box. Then based on that, I created a tiny training script that read `dataset/{slide,no_slide}` and trained a new head based on that. After adding ~100 samples of each, the success rate landed at 95% which was good enough to call it done, and circle back to iterate once I have more data.

I ended up with a 2.2K large "head_weights.safetensors" that increased the accuracy by ~35% which felt really nice.