Ehh, is it cool and time savings that it figured it out? Yes. But the solution w...

sh3rl0ck · 2025-10-21T05:46:02 1761025562

Pytorch + CUDA is a headache I've seen a lot of people have at my uni, and one I've never had to deal with thanks to uv. Good tooling really does go a long way in these things.

Although, I must say that for certain docker pass through cases, the debugging logs just aren't as detailed

ComputerGuru · 2025-10-21T21:11:53 1761081113

uv doesn’t fundamentally solve the issues. It didn’t invent venv or pip.

What fundamentally solves the issue is to use an onnx version of the model.

simonw · 2025-10-21T21:21:09 1761081669

Do you know if it's possible to run ONNX versions of models on a Mac?

I should try those on the NVIDIA Spark, be interesting to see if they are easy to work with on ARM64.

ComputerGuru · 2025-10-22T01:59:49 1761098389

Yup. The beauty of it is that the underlying ai accelerator/hardware is completely abstracted away. There’s a CoreML ONNX execution provider, though I haven’t used it.

No more fighting with hardcoded cuda:0 everywhere.

The only pain point is that you’ll often have to manually convert a PyTorch model from huggingface to onnx unless it’s very popular.

cat_plus_plus · 2025-10-21T23:11:57 1761088317

You can still upgrade CUDA within forward compatibility range and install new packages without reflashing.