Guidance on Training from Scratch or Fine-Tuning #164

alfausa1 · 2025-01-15T08:05:53Z

Hi,
I would like to ask how many images you would recommend for training a model from scratch, and what weights you would suggest starting with.

My use case is object segmentation on plain backgrounds. The general model currently works quite well for most cases, but there are a few specific scenarios that could be improved. This is why I’m considering training or fine-tuning.

I have a dataset of around 7,000 images at 2K resolution. What would you recommend in this case?

Thank you in advance for your help!

ZhengPeng7 · 2025-01-15T11:13:02Z

For common cases with no extremely complicated shapes, 500-1,000 images should be enough for training from scratch.
If your cases are very different from the training sets I used to train the general version weights, I suggest training from scratch when you have enough images. Otherwise, fine-tuning could be a better way.

In your case, I recommend training from scratch. BTW, you can check the model efficiency part in README; use FP16 + compile==True + PyTorch==2.5.1 to try to save GPU memory to do less downscaling on your 2K data.

Roshan-digi5 · 2025-01-16T05:58:15Z

Hello,

First of all, thank you for your incredible work and contributions!

I want to train a model specifically for removing backgrounds from car images. I have a dataset of approximately 80,000 images. Could you guide me on the best practices to follow, which model and settings would be most suitable, and whether there are any tutorials available for training or fine-tuning a model?

ZhengPeng7 · 2025-01-16T14:39:05Z

I've made a guideline of fine-tuning in my README. For settings of fine-tuning, you can use the default settings except for the epochs. If you still have a problem after following it, plz tell me.

Roshan-digi5 · 2025-01-17T04:34:33Z

Thank you will let you know in case of any issue.

alfausa1 · 2025-01-17T09:01:24Z

Hi,

Thank you so much for taking the time to reply!

I wanted to ask specifically about the configurations, losses and backbone you would recommend for my use case. Are there any particular hyperparameters or architectures you find especially suitable for this type of task? Any additional guidance would be greatly appreciated.

Thanks again for your support!

ZhengPeng7 · 2025-01-17T09:14:27Z

In my mind, car segmentation should have fewer contour details or the need for transparency. If so, you can train the model with fewer epochs and higher weights of IoU loss to accelerate the convergence.
I may come up with more points in the future, but currently that's all.

alfausa1 · 2025-01-17T09:19:11Z

Sorry for not specifying earlier, my use case is object segmentation on a plain background (not cars). Many objects do have transparencies and some small details like tiny holes.

ZhengPeng7 · 2025-01-17T09:23:07Z

That would be a general case. I'm not sure about it (otherwise, I would have added the updates to the default settings).

alfausa1 · 2025-01-21T09:47:31Z

Thank you very much again! The model trained with DIS performs really well in most cases, but we have identified some corner cases where it fails. Would you recommend fine-tuning only with those specific cases where it fails (not the entire 7k, just the problematic ones) or fine-tuning the entire dataset instead?

How much VRAM should I need? I have read around 25GB with FP16?

ZhengPeng7 · 2025-01-21T10:27:54Z

If you find it works worse on some specific cases, training only on them would help a lot. Hard negative samples usually teach the model more about it.

Yeah, following the setting there with compile, FP16, and batch_size == 2, the training would take ~25GB.

Roshan-digi5 · 2025-02-05T11:31:37Z

I'm following the guidelines you created but still unable to understand i have updated my dataset paths in same way as there said till step2 after that what changes needs to be done in config.py as well as in train.py any more guidance or any Collab demo for fine-tunning

ZhengPeng7 · 2025-02-05T11:57:35Z

OK, thanks for the suggestion. I'll try to record a video of ~1 min to start basic fine-tuning.

ZhengPeng7 added the documentation Improvements or additions to documentation label Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guidance on Training from Scratch or Fine-Tuning #164

Guidance on Training from Scratch or Fine-Tuning #164

alfausa1 commented Jan 15, 2025

ZhengPeng7 commented Jan 15, 2025

Roshan-digi5 commented Jan 16, 2025

ZhengPeng7 commented Jan 16, 2025

Roshan-digi5 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

alfausa1 commented Jan 21, 2025 •

edited

Loading

ZhengPeng7 commented Jan 21, 2025

Roshan-digi5 commented Feb 5, 2025

ZhengPeng7 commented Feb 5, 2025

Guidance on Training from Scratch or Fine-Tuning #164

Guidance on Training from Scratch or Fine-Tuning #164

Comments

alfausa1 commented Jan 15, 2025

ZhengPeng7 commented Jan 15, 2025

Roshan-digi5 commented Jan 16, 2025

ZhengPeng7 commented Jan 16, 2025

Roshan-digi5 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

alfausa1 commented Jan 17, 2025

ZhengPeng7 commented Jan 17, 2025

alfausa1 commented Jan 21, 2025 • edited Loading

ZhengPeng7 commented Jan 21, 2025

Roshan-digi5 commented Feb 5, 2025

ZhengPeng7 commented Feb 5, 2025

alfausa1 commented Jan 21, 2025 •

edited

Loading